GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:46:26 Sequence gi568815596f:46658531_46860138 : 201608 bp : 41.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3169 3296 128 0 2 73 68 83 0.524 4.48 1.02 Intr + 4329 4519 191 0 2 69 60 144 0.249 8.01 1.03 Term + 22527 22672 146 1 2 47 40 164 0.418 4.59 1.04 PlyA + 22702 22707 6 1.05 2.03 PlyA - 23369 23364 6 1.05 2.02 Term - 41322 41142 181 2 1 77 53 169 0.657 8.40 2.01 Init - 52859 52801 59 2 2 70 63 64 0.566 2.93 2.00 Prom - 59728 59689 40 -3.95 3.00 Prom + 97596 97635 40 -1.35 3.01 Sngl + 100019 101611 1593 1 0 65 37 862 0.908 73.89 3.02 PlyA + 102074 102079 6 1.05 4.06 PlyA - 102270 102265 6 1.05 4.05 Term - 120208 120003 206 2 2 99 43 149 0.994 8.05 4.04 Intr - 121338 121074 265 0 1 11 79 165 0.872 4.06 4.03 Intr - 121772 121665 108 2 0 29 65 132 0.765 4.56 4.02 Intr - 122644 122454 191 2 2 76 -37 163 0.900 0.98 4.01 Init - 123234 122976 259 0 1 53 53 153 0.927 5.65 4.00 Prom - 123580 123541 40 -3.75 5.00 Prom + 131757 131796 40 -5.95 5.01 Init + 132003 132069 67 2 1 52 89 88 0.939 6.59 5.02 Term + 136634 136833 200 0 2 26 38 154 0.779 0.78 5.03 PlyA + 137573 137578 6 1.05 6.04 PlyA - 141414 141409 6 1.05 6.03 Term - 144061 143658 404 2 2 62 52 309 0.834 19.03 6.02 Intr - 145705 145533 173 0 2 108 32 99 0.691 4.96 6.01 Init - 147107 146887 221 2 2 103 83 119 0.745 11.05 6.00 Prom - 149951 149912 40 -6.35 7.00 Prom + 152788 152827 40 -6.25 7.01 Init + 153065 153082 18 1 0 87 83 -5 0.196 -1.00 7.02 Intr + 154906 154972 67 0 1 91 56 66 0.616 1.26 7.03 Intr + 155658 155827 170 1 2 127 -12 161 0.044 8.74 7.04 Intr + 158307 158382 76 2 1 53 99 91 0.226 4.87 7.05 Intr + 158718 158903 186 1 0 75 94 30 0.327 1.04 7.06 Intr + 159650 159701 52 0 1 133 72 23 0.583 2.25 7.07 Intr + 159994 160093 100 1 1 103 55 65 0.692 3.89 7.08 Term + 160333 160527 195 0 0 29 40 122 0.542 -2.07 7.09 PlyA + 160748 160753 6 1.05 8.07 PlyA - 161249 161244 6 -0.45 8.06 Term - 162143 161424 720 2 0 -6 42 263 0.484 4.79 8.05 Intr - 163378 163277 102 1 0 52 66 102 0.579 3.85 8.04 Intr - 167383 167211 173 2 2 11 105 165 0.626 9.24 8.03 Intr - 168629 168441 189 0 0 121 38 216 0.703 18.54 8.02 Intr - 169210 169108 103 1 1 20 24 56 0.429 -8.77 8.01 Init - 169765 169358 408 1 0 59 64 277 0.456 19.00 8.00 Prom - 175758 175719 40 -4.85 9.06 PlyA - 175780 175775 6 1.05 9.05 Term - 177150 176998 153 0 0 107 50 26 0.171 -2.36 9.04 Intr - 179581 179389 193 0 1 50 46 146 0.099 5.17 9.03 Intr - 187212 187110 103 1 1 48 90 56 0.089 0.11 9.02 Intr - 188493 188396 98 2 2 127 81 42 0.206 6.13 9.01 Init - 194933 194860 74 2 2 69 92 74 0.742 6.49 9.00 Prom - 195595 195556 40 -6.55 10.02 PlyA - 196316 196311 6 1.05 10.01 Term - 199759 199595 165 1 0 -5 44 239 0.489 7.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 155658 155881 224 1 2 127 48 191 0.917 15.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_1|154_aa MSIYVQINNAGAAMLGQSRGQTMQSSVRDHSTSAVQGKATPCSSPPCIALPIKSLLPGRA ICVSANPLPQPQESQDGNWLLLRLQVVTLLVEKVNLPILPLANWAGATPKTHMAPENKRT TAELLFRHQTAVLVDERNSFPTLKKMSRDRGAGM >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_1|465_bp atgagcatctacgtgcagatcaataatgccggtgcagcaatgctggggcagagtaggggc cagaccatgcaaagcagtgtcagagaccacagcacgtctgcagtccagggcaaggccacc ccatgtagttctcctccctgcatcgctttgcctattaaatccctgctaccaggcagagcc atctgcgtgtcagccaatccactgccgcagcctcaggagagccaggatggaaactggctt ctattacgacttcaggttgtgaccttgcttgtggaaaaagtaaatcttcctatcttacct ctggctaattgggcaggagcaactcccaaaacacacatggccccagagaataagaggacc acagctgagttgcttttcagacatcagacagctgtccttgtggatgaaagaaactccttt ccaactttgaagaaaatgtccagggatcggggagcaggcatgtga >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_2|79_aa MHDIDTHKENANSNHSEISHLCNCFSLRWPPRDPPRPRSRMPSYWPRTTPFPLGTPRSSN QKNNSTEMHREPTKRKLPL >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_2|240_bp atgcatgacatcgatactcacaaggaaaatgcaaattcaaaccacagtgagatatcccac ctgtgtaactgcttcagtctccggtggcctccccgcgacccaccacgaccccgatcacgg atgccctcctactggccccggacgacccccttccccttaggaaccccacgttcctccaac cagaaaaacaattccacagaaatgcatcgagagcctactaagcgcaagctcccactgtga >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_3|530_aa MWNNFKYRCQNLFGHEGGSRSENVDMNSNRCLSVKEKNISIGDSTPQQQSSPLRENIALQ LGLSPSKNSSRRNQNCATEIPQIVEISIEKDNDSCVTPGTRLARRDSYSRHAPWGGKKKH SCSTKTQSSLDADKKFGRTRSGLQRRERRYGVSSVHDMDSVSSRTVGSRSLRQRLQDTVG LCFPMRTYSKQSKPLFSNKRKIHLSELMLEKCPFPAGSDLAQKWHLIKQHTAPVSPHSTF FDTFDPSLVSTEDEEDRLRERRRLSIEEGVDPPPNAQIHTFEATAQVNPLYKLGPKLAPG MTEISGDSSAIPQANCDSEEDTTTLCLQSRRQKQRQISGDSHTHVSRQGAWKVHTQIDYI HCLVPDLLQITGNPCYWGVMDRYEAEALLEGKPEGTFLLRDSAQEDYLFSVSFRRYNRSL HARIEQWNHNFSFDAHDPCVFHSSTVTGLLEHYKDPSSCMFFEPLLTISLNRTFPFSLQY ICRAVICRCTTYDGIDGLPLPSMLQDFLKEYHYKQKVRVRWLEREPVKAK >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_3|1593_bp atgtggaataacttcaaatacaggtgtcagaatctcttcggtcatgagggaggaagccgt agtgaaaatgtggacatgaactccaacagatgtttgtctgtcaaagagaaaaacatcagc ataggagactcaactcctcagcaacaaagcagtcccttaagagaaaatattgccttacaa ctgggattaagcccttcgaagaattcttcaaggagaaatcaaaattgtgccacagaaatc cctcaaattgttgaaataagcatcgaaaaggataatgattcttgtgttaccccaggaaca agacttgcacgaagagattcctactctcgacatgctccatggggtgggaagaaaaaacat tcctgttctacaaagacccagagttcattggatgctgataaaaagtttggtagaactcga agtggacttcaaaggagagagaggcgctacggcgtaagttctgtacacgacatggacagt gtttccagcagaactgtaggaagtcgctctctaagacagaggttgcaggatactgtgggc ttgtgttttcccatgagaacttacagcaagcagtcaaagcctctcttttccaataaaaga aaaatccatctctctgaattaatgcttgagaaatgcccttttcctgctggctcagattta gcccaaaaatggcatttgattaaacagcatacagctcctgtgagcccacattcaacattt tttgatacatttgatccatctttggtttctacagaagatgaagaagataggcttagagag agaaggcggcttagtattgaagaaggggttgatccccctcccaatgcacaaatacataca tttgaagctactgcacaggttaatccattatataaactgggaccaaaattagctcctgga atgactgaaataagtggggacagttctgcaattccacaagctaattgtgactcggaagag gatacaaccaccctgtgtttgcagtcacggaggcagaagcagcgtcagatatctggagac agccatacccatgttagcagacagggagcttggaaagtccacacacagattgattacata cactgcctcgtgcctgatttgcttcaaattacagggaatccctgttactggggagtgatg gaccgttatgaagcagaagcccttctcgaagggaaacctgaaggcacgtttttgctcagg gactctgcgcaagaggactacctcttctctgtgagcttccgccgctacaacagatccctg catgcccgaattgagcagtggaatcacaactttagtttcgacgcccatgacccgtgtgta tttcactcctccactgtaacgggacttttagaacattataaagatcccagttcgtgcatg ttttttgaaccattgcttactatatcactaaataggactttcccttttagcctgcagtat atctgtcgcgcggtaatctgcaggtgcactacgtatgatggaattgatgggctccctcta ccctcaatgttacaggattttttaaaagagtatcattataaacaaaaagttagagttcgc tggttggaacgagaaccagtcaaggcaaagtaa >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_4|342_aa MGLSRTEQRGQAWKDLESVEDGWGRVPASGSTASLARKAWSCSLALTTPGEQQAKGSEPG HRCCRKAPSILPELNLTWPPNSPPPQESMEKNRTWKAQPAAVGAREDRWVEESCIYRRWL SAPRGASPAYSWNSSESASRGLSPLGPAAGDGIAVFLGVPPEADPETRIRVQVVYSGADS WNHGEVTNGKCAGHKQHHTDQGPRGALELAARRSVLAQVKACRIQSRRSEWESWLCSRAN LEQLPFPLSLTFFVEMGNSYFLHRIAVSPEAKIPGCKLPPEEHDLVQGSSLSKAALCREA ALCNKAGPERADSWGLAADQAFSGWAASPLEEGDLGCASLRL >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_4|1029_bp atgggactgagcaggactgagcagcggggccaggcctggaaggacttggagtctgtggag gatggatggggcagggtccccgcttccggctccacagctagccttgccaggaaggcttgg agctgctccctggccttgaccacccctggggagcaacaagcaaaaggttctgagccaggg catcggtgctgccggaaggcccccagcatcctgccagaactgaatctgacatggccaccc aacagcccacctccccaagaatcaatggaaaaaaacaggacttggaaagcccagccggct gcagtgggggctcgagaggacagatgggtagaagagagctgcatctataggcgctggctt tctgcccctcgtggggccagcccagcctactcctggaatagctctgagtccgccagcaga gggctgtcacccttggggcctgccgcaggagacgggattgctgtcttcctcggtgtgccc ccagaagcagaccctgagacaaggattcgggtgcaagtcgtttattcgggagctgattcc tggaatcacggagaggtgaccaacggaaagtgtgctggacataaacaacaccacactgac caaggccccagaggagctctggagctggcagccaggcgctcagtgctcgcacaggtgaag gcctgcaggattcagagtcgaagatctgagtgggaatcctggctctgttccagagcaaat cttgagcaacttcctttccctctaagcctcaccttctttgttgaaatgggaaactcatac tttctccacaggattgctgtgagccctgaagccaagataccaggatgcaagctgccacca gaagagcatgacctagtgcaaggcagctctttgagcaaggcagctctttgcagggaggcg gccctttgcaacaaggcaggccctgaaagagccgatagctgggggctggctgctgaccaa gctttcagtggatgggcagcaagtcctttagaggaaggggatctaggctgtgcatccctg cgtctataa >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_5|88_aa MTDLKEETDKSTKVAGDPKIPLSRRIALQWNATQYYEGTSTDTCNDMDEFQKYAEPNLPN MKDHLLQDSIYVNSRKSKPHLQCWKVDQ >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_5|267_bp atgacagacctgaaagaagaaacagacaaatccacaaaagtagctggagatcccaaaatt cctctcagcaggaggatagccttgcaatggaatgctactcagtattacgaaggaacatct actgatacatgcaacgacatggatgaatttcagaaatatgctgagccaaacttgccaaac atgaaagaccacttactacaggattctatttatgtgaattctagaaaaagcaaacctcat ctacagtgttggaaagtagatcagtga >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_6|265_aa MGSSDSSWHSAPESKMGPSHHSSHELKEQAVKIQAMPREPRGLPPGLCKTLIGITCHTGI QQPERAVSKRLTFTILLSLVACAQLAVRVRKVPTKCHTVSWVILPQPHEAGCVPCSTPED AVAQRGIGSTRGELALPSSRQMSSGFQASALTDEGASSGQAVGTTGGEMLMRSDPTHGAC PQTDEHVGKCAQVWQEMPVEMPWTGAADDWPRGHWRTSPGLCLQAGHRLAGDRSLFTRTN EWPRRGLPKGHLQRVVLAIGKKSDQ >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_6|798_bp atgggctccagcgacagctcctggcacagcgccccagaaagcaagatggggccatcccat catagcagtcatgagctcaaggagcaggctgtgaagatccaggcaatgccacgggaaccc aggggcctccctcctgggctgtgcaagactctgataggaatcacctgccacacggggatt cagcagccagagagagcagtctccaaaaggctgacatttacaattcttctttccttagtg gcctgtgctcagctggctgtcagagtgaggaaggtgcccactaaatgtcacacggtgagc tgggtaatattgccacagccccacgaggcagggtgtgtgccctgctccaccccggaggat gctgtggctcagcgaggaattggctccacacgtggagagctcgccctgccatcatccagg cagatgagctcaggcttccaggcttctgcactcacagatgagggggcgagctcaggacag gcagtgggaaccacagggggagaaatgctaatgagaagtgaccccacccatggagcctgt ccccaaactgatgagcatgttgggaaatgtgcccaagtgtggcaggaaatgcctgtggaa atgccttggacaggggcagccgatgactggccacggggacactggaggacatcgcctggc ctctgcctccaagcagggcaccgtctggctggggacagaagcctcttcacacgaaccaat gagtggcccaggcggggtcttcccaagggtcacctgcagcgtgtggttctggccatcgga aagaagagtgaccagtga >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_7|287_aa MDPGCELSSLRATWHLYEKTQSALISDAGFGKGCFTRGLSWSWKQCEFILARQRGPDSPS LDASVGRGLENVAQAAVSSRYSPQHCTFLPLKVQEERGNPQVELTMRTVTALRGKSRNQP SSFLYQEDWRLQPWLCGASLALSSDCPTWLQVRVRGSWRWKMQLGELTRQVSGNSLNKRT PDHACSSSPGAVGGFWERSGMPGGGIGEAKKRQKPTERSNEPQVKHMENHQVLKQRSSEQ SRPELVWGLVLWGTMADSSLWSLLGCLVSPWLSSADGKQPYITKCGQ >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_7|864_bp atggacccaggatgtgagctttccagcctccgagcaacttggcatctgtatgagaaaacc cagtcagccctcatctcagatgcagggtttggcaaaggctgcttcactcgaggcctctca tggagctggaaacaatgcgagtttattttggcacggcaaagagggcccgattccccttct ctggacgcctctgtgggaagaggactagagaatgtggcccaggccgctgtcagttcccgt tactctccccagcactgcacctttctgccgctgaaagtacaagaggagaggggaaaccca caggtggagctgaccatgaggactgtcacagctttgagaggcaaatccagaaatcagccc tccagcttcctctaccaagaagactggcggttacaaccctggctctgtggggcttctctt gccctctcttctgactgccctacttggctccaggtgagggtgagaggaagctggagatgg aagatgcagctgggggaactcaccaggcaggtctcagggaactctttgaataagaggaca ccagatcatgcctgcagttccagcccaggggctgtgggagggttctgggagaggtcggga atgccaggaggaggaattggtgaagcaaagaaaaggcaaaagcccactgagaggtctaat gagccccaggtaaagcacatggaaaaccaccaagtcctaaagcagagaagttctgagcag agcaggcctgagcttgtatggggcctggtgctgtggggcaccatggcagattccagcctg tggtcattattgggttgcctggtgtccccatggctctctagcgctgatggaaaacagcct tatatcactaagtgtggacaataa >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_8|564_aa MQRALEASEAGQGGRGGRGGWGIGEGEAKGMEASSISEVPSSTQPIPREAETAAPRAELP TGHPCLPTLLPGCLSLPAQHLITSELEDQGLSEEDCRTPRYQLTGWLISWVQYGQAYRLG MRTSEAPRSPQAAWTAVAVKHAHPEADQWPGRSLRAVPAVPRVQTWRGQGGECRSEALGT GSEPSWRVTGAALAPPQEASLEQGKIREAGADRSSEIGCSDPCSVQRGAGTCPGQGILET ERQGPEFINHLQTCRSGLGAQIPFGASWARALTRRTLEKEGRPRGEAQGWPDPSPEPQSD AVDLLSSSLQSSWVCETIEESQQRQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIRHIN RTKYKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDNPTANIILNGQK LEAFPLKTGTRQGCPLSPLLFNIVLEVLARAVGQEKEIKGIQLGKEEVKLSLFADDMIVY LENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLHTNNRQTENQIMSELPFTIASKRIK YLGIQLTRDLKDLFKETINHCSVK >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_8|1695_bp atgcagagagcacttgaggcaagtgaggcaggacaaggaggacgaggaggacgaggagga tggggcatcggggaaggggaggcaaagggcatggaggcctcctccatctcagaggtccct tcctcaacccaacccatcccaagagaggcagaaacagcagcacccagggcagagctgccc actgggcatccgtgtctccccacactcttacctggctgcctgtctcttccagcccaacac ctcatcacctccgagttagaggaccagggcttgagcgaggaggactgtaggaccccgagg taccagctgacagggtggctgatctcatgggtccagtatggccaggcctacaggctgggg atgaggacctctgaagcccctcgtagcccccaggctgcctggacggcagtggctgtgaaa catgctcaccctgaagcagatcagtggccaggccgttccctgagagcagtccctgcagtg cccagggtccagacctggcgggggcagggaggtgagtgtcggagcgaggcccttggcact ggctctgagccatcatggcgagtgaccggagctgctctggcacctcctcaggaagcttcc ctggagcaggggaagataagagaggctggggctgacaggtcctccgagatcggctgctct gacccttgcagtgtacagagaggggcaggcacttgcccaggccaggggatcctggaaaca gaacggcagggccctgagttcatcaaccacctgcaaacctgccggtctgggttgggtgcc caaatcccgtttggggccagctgggccagggccctcacacggagaacactcgagaaggaa gggcggcccagaggagaggcccaagggtggccggacccgagccctgaaccacagtctgat gccgtggacctgctttcaagcagcttacagtctagttgggtctgtgagaccattgaagaa tcccagcagaggcagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatc cctgggatgcaaggctggttcaacatatgcaaatcaataaacgtaatccggcatataaac agaaccaaatacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaa attcaacagcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctc aaaataataagagctatttatgacaaccccacagccaatatcatactgaatgggcaaaaa ctggaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactccta ttcaacatagtgttggaagttctggccagggcagtcgggcaggagaaagaaataaagggt attcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatat ctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttacacaccaataacaga caaacagagaaccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaa tacctaggaatccaacttacaagggatctgaaggacctcttcaaggagactataaaccac tgctcagtgaaataa >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_9|206_aa MKIKNKDNKENSRQEQDLGSQISGGTQSGGALQVSRNCNYHCPRKASTGVAEGLGYQGYS LLVRSTQRPFPLPRLHLAGTKGSYAIVSWEDGGNFRFNMMLHPLLRPEELEPKQPSQMAS PGGQTRPSLKRETSGAPPKPSSACSADSPIAVECCVTDGRQVSNSCPQVTLPPWPLKILG FQAQGPVRCRQGNQGRETTEFEKLKF >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_9|621_bp atgaaaataaagaacaaagacaacaaagaaaacagccgacaggaacaggatcttgggagc caaatatcaggtgggacacagagcggaggggcactccaagtcagcaggaactgcaactat cactgtcccaggaaggcaagcacgggagtagcagagggactaggataccaagggtatagt ctcctggtccgctcaacccaaaggcccttcccacttcctcggctgcatttggcaggaaca aagggttcatatgccattgtgtcctgggaagacggcggaaacttccgttttaacatgatg ttacaccctctcctaaggcctgaggaactggagcccaagcagccttcacagatggcctcc ccaggtggtcagaccagaccttccctgaagagagagacctctggagctccccctaagccc agctctgcctgctctgcggactcccctatagcagtagaatgctgtgtgacagatggacgg caggtctcaaactcttgccctcaagtgaccctcccaccttggcctctcaaaattctggga tttcaggctcagggcccagtccgatgcaggcaaggaaaccagggaagagaaactactgag tttgagaaacttaagttttga >gi568815596f:46658531_46860138|GENSCAN_predicted_peptide_10|54_aa VVLAQDRRSDGLTQTRDALLRPWSSSEREHLMDLLNVKSMPKEPRQEELCHQTF >gi568815596f:46658531_46860138|GENSCAN_predicted_CDS_10|165_bp gtggttcttgctcaggaccgtcgttcagacggactcacgcagactagagatgctctctta agaccttggagctcatccgaaagggaacacctcatggatctgctgaatgtgaagtccatg ccgaaggaaccacgacaggaggagctgtgtcaccaaacattctga