GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:26:05 Sequence gi568815584r:52543573_52795481 : 251909 bp : 38.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 8530 8855 326 2 2 48 42 204 0.053 5.74 1.02 PlyA + 8954 8959 6 -0.45 2.04 PlyA - 9331 9326 6 -0.45 2.03 Term - 10010 9684 327 2 0 80 54 411 0.885 30.62 2.02 Intr - 10228 10159 70 0 1 63 69 31 0.779 -3.03 2.01 Init - 10454 10288 167 2 2 76 17 178 0.898 8.67 2.00 Prom - 10845 10806 40 -6.95 3.00 Prom + 20401 20440 40 -2.65 3.01 Init + 23356 23358 3 0 0 98 53 0 0.401 -2.45 3.02 Intr + 25335 25736 402 2 0 51 100 216 0.744 12.80 3.03 Term + 26746 26889 144 0 0 30 39 173 0.539 3.53 3.04 PlyA + 27061 27066 6 1.05 4.00 Prom + 27222 27261 40 -6.15 4.01 Sngl + 29394 30344 951 2 0 31 47 261 0.347 12.63 4.02 PlyA + 30391 30396 6 -0.45 5.00 Prom + 30674 30713 40 -4.25 5.01 Init + 37777 37779 3 0 0 113 81 0 0.161 1.85 5.02 Term + 46399 46593 195 0 0 55 47 168 0.353 5.83 5.03 PlyA + 46654 46659 6 1.05 6.05 PlyA - 46837 46832 6 1.05 6.04 Term - 49386 48768 619 2 1 27 39 280 0.609 9.92 6.03 Intr - 50084 49853 232 0 1 50 72 159 0.207 6.41 6.02 Intr - 51869 51385 485 1 2 12 38 195 0.310 -1.36 6.01 Init - 52372 52227 146 1 2 82 86 142 0.954 13.04 6.00 Prom - 64884 64845 40 -6.55 7.14 PlyA - 65043 65038 6 1.05 7.13 Term - 66134 66034 101 0 2 96 42 114 0.923 4.91 7.12 Intr - 80035 79935 101 0 2 108 37 32 0.000 -1.07 7.11 Intr - 102715 102582 134 1 2 109 96 79 0.686 9.32 7.10 Intr - 108736 108667 70 0 1 84 86 21 0.496 -0.23 7.09 Intr - 109743 109497 247 1 1 91 38 126 0.494 3.50 7.08 Intr - 120275 120229 47 1 2 96 94 16 0.859 0.13 7.07 Intr - 122923 122803 121 2 1 46 44 149 0.902 4.93 7.06 Intr - 128131 128058 74 0 2 75 30 94 0.922 0.43 7.05 Intr - 128299 128223 77 1 2 67 20 195 0.943 7.99 7.04 Intr - 134900 134862 39 2 0 127 93 40 0.979 5.90 7.03 Intr - 138836 138753 84 2 0 112 47 54 0.766 2.80 7.02 Intr - 140335 140216 120 1 0 103 63 64 0.842 5.17 7.01 Init - 151909 151796 114 1 0 77 99 253 0.292 23.56 7.00 Prom - 157686 157647 40 -6.35 8.00 Prom + 160693 160732 40 -2.15 8.01 Init + 163606 163732 127 0 1 102 65 214 0.990 20.87 8.02 Intr + 164737 164816 80 2 2 45 96 111 0.949 5.95 8.03 Intr + 164911 164950 40 0 1 93 82 37 0.867 0.48 8.04 Intr + 165192 165244 53 1 2 79 90 78 0.813 4.81 8.05 Intr + 170309 170396 88 1 1 82 77 35 0.661 0.42 8.06 Intr + 174509 174570 62 0 2 102 98 16 0.776 1.53 8.07 Intr + 174657 174780 124 2 1 79 91 73 0.936 5.94 8.08 Intr + 177538 177618 81 2 0 69 82 50 0.687 1.19 8.09 Intr + 180393 180464 72 1 0 88 110 100 0.999 10.66 8.10 Term + 183927 184045 119 1 2 94 28 144 0.980 6.72 8.11 PlyA + 184097 184102 6 1.05 9.00 Prom + 184327 184366 40 -6.25 9.01 Init + 186903 187051 149 2 2 107 29 211 0.764 16.71 9.02 Intr + 214302 214352 51 0 0 101 110 10 0.357 1.60 9.03 Intr + 216110 216182 73 2 1 117 93 12 0.457 3.09 9.04 Intr + 225268 225361 94 0 1 111 110 28 0.443 5.92 9.05 Term + 227461 227534 74 2 2 36 38 101 0.205 -3.01 9.06 PlyA + 228422 228427 6 1.05 10.06 PlyA - 230103 230098 6 1.05 10.05 Term - 234886 234739 148 0 1 110 42 49 0.548 -1.11 10.04 Intr - 237168 237107 62 0 2 54 87 83 0.460 1.41 10.03 Intr - 238339 238212 128 2 2 83 95 145 0.997 14.18 10.02 Intr - 241092 240925 168 1 0 73 80 103 0.694 7.00 10.01 Init - 250656 250524 133 0 1 68 37 107 0.058 3.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_1|108_aa XLRCFPPFMQRKEKAKCPQEAPGEALRCPRDGCLKSDSGAVQCSPGYRKLLRLNPRVCRQ SPQPEGPALSPTCPGPAGGDLGRSRGRAANSSRRHATEEEAEDGAVPD >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_1|327_bp nggctccggtgtttcccgccgttcatgcagcgaaaagagaaagcaaaatgccctcaggag gctccaggtgaggcccttcggtgccccagagacgggtgtcttaagtcagactccggagct gtgcagtgcagccctggctacagaaagctgcttcgtctgaatccccgggtctgcaggcag agcccacagcctgaagggccagcgctgtcacctacctgcccgggacctgcgggcggggac ctgggccgttcccgaggccgcgcggccaacagttcccgccgccacgcgacggaggaggag gcagaagacggggccgtgccagactag >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_2|187_aa MAGSSPASAELLLPQHPTSLNTVRLRQSHRGCYFLRCHFVGATNALPRRARTPNKGPNPS VIKPISHLSLRRKHWLTLQVEVEETERGELETGGEAVEQPVGEEVQVSGRPEQGQGAAER EGGGEEGGPERCPQETEEEAEALVTQPPLAVQEQPPQLQVGKGEQRGVEQGVQDAQRQLH GARHRGA >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_2|564_bp atggctggcagcagtccagcttcagcagagctgctgctacctcagcatccaacctccctc aacacagtccggcttcggcagagccaccggggctgctacttcttgcgatgccactttgtt ggtgccaccaacgccttacctcggagggccagaacaccgaacaaaggcccgaacccctca gtaatcaagccaatcagccacttaagtttaaggcgaaaacactggctcacactccaggta gaggttgaggagacagagcgtggagaactggagacaggaggggaagcagtagagcagcca gtgggggaagaagtgcaggtgagcgggcggccggagcaagggcagggagccgctgagcga gaaggcggcggagaagagggtggtcctgagcgctgcccacaggagacagaggaagaggca gaggctctggtaactcagccgccgctcgcggtacaggagcagccgccacagctgcaggta ggcaaaggcgaacagcgcggcgtagagcagggcgtgcaggacgctcagcgccaactgcac ggagcccggcaccgcggcgcctga >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_3|182_aa MRAADLPAQRLSSAKGQTASSIGSLTTLPPDWEIPPSKGRQTPYATGELRLASDGCPSGT KLPEEGTDSNLCCLAASAGDTQANRVWRGPPADLHRRGLTVRRKTNKQKGMASTSTKKDI HAKTPSEEHWRQRPKACLTRASEGSTKYGKEQPVAATAKTYQIVKTIDPMKKLHRLTGKI TS >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_3|549_bp atgagagcagcggatctcccagcacagcgcttgagctctgctaagggacagactgcctcc tcaattgggtccctgaccactttgcctcctgactgggagatacctcccagcaaggggcga cagacaccttatgcaacaggagagctccggctggcatctgatgggtgcccgtctgggacg aagcttccagaggaaggtacagacagcaatctttgctgtttggcagcctccgctggtgat acccaggcaaacagggtctggagaggacctccagcagacctgcaccggaggggcctgact gttagaaggaaaactaacaaacagaaaggaatggcatcaacatcaacaaaaaaggacatc catgcaaaaaccccatctgaagagcactggcgtcaaagaccaaaggcctgccttacaaga gcttctgaaggaagcactaaatatggaaaggaacaaccggtagcagccactgcaaaaaca taccagattgtaaagaccatcgaccctatgaagaaactgcatcgactaacgggcaaaata accagctag >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_4|316_aa MYNCYKENKIPRNTTYKGCEGPLQGELQTTAQGNKRGCKQMEKHCMLMDRRINVVKMAIL PKAISRLSAIPIKLPLTFFTELEQTILNFIWNQKRAHIAKTILSKKNTAGGIMLPDFKLY YKATVTKTAWYLYQNRYIDQWNRTETSEITTHVYNHLIFDKPDESKQWGKDSLFNKWCWE NWLAICRTLKLDPFLTPYTKINSRWIEDLNVRPKTIKTQEENLGNTIQDIGMGKDFMTKT PKAMATKAKVDKWDLIKLKNFCTAKGTIIRVNRQPTESEEIVAIYPSDKGLISRIYKELI QIYKKKATPSKHGQRI >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_4|951_bp atgtataattgctacaaagagaataaaatacctaggaatacaacctacaagggatgtgaa ggacctcttcaaggagaactacaaaccactgctcaaggaaataagagaggatgcaaacaa atggaaaaacattgcatgctcatggatagaagaatcaatgtcgtgaaaatggccatactg cccaaagcaatatctagattaagtgctatccccatcaagctaccattgactttcttcaca gaattagaacaaactattttaaatttcatatggaaccaaaaaagagcccatatagccaag acaatcctaagcaaaaagaacacagctggaggcatcatgctacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtacttataccaaaacagatatatagaccaa tggaacagaacagagacatcagaaataacgacacatgtctacaaccatctgatctttgac aaacctgatgaaagcaagcaatggggaaaggattccctatttaataaatggtgttgggaa aactggctagccatatgcagaacactgaaactggaccccttccttacaccttatacaaaa attaactcacgatggattgaagacttaaacgtaagacctaaaaccataaaaacccaagaa gaaaacctaggcaataccattcaggacatagggatgggcaaagactttatgaccaaaaca ccaaaagcaatggcaacaaaagccaaagttgacaaatgggatctaattaaactaaagaac ttctgcacagcaaaaggaactatcatcagagtgaacaggcaacctacagaatcggaggaa attgttgcaatctatccatctgacaaagggctaatatccagaatttacaaggaacttata caaatttataagaaaaaagcaaccccatcaaaacatgggcaaaggatatga >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_5|65_aa MNGFLLIKKKLTVIQPHVGPSGSIPEEGTVIIGDDSSVHVIASKHLPVAQDVELEDSDTD DPDPV >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_5|198_bp atgaatggctttctacttattaaaaaaaagttaactgtaatacagcctcacgtaggtcct tcaggaagtattccagaagaaggcactgttatcataggagatgacagctccgtgcatgtt attgcatctaaacaccttccagtggcacaagatgtggagttagaagacagtgatacagat gatcctgaccctgtgtag >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_6|493_aa MGRNQNTKAENSKNQTASSPPKDSRSLPAMEQNWMENEFDELTEVGFKSDGENGTKLENT LQDIIQENFHNLARQANIQIQEIQRTPQRYSSRRGTPRHIIVRFTKVEMKEKMLRAAREK GRVTHEGKPIRLTGDLSAETPQTRKEWGSIFNILKEKNFQPRISYPDKLSFISEGEIKSF TDKQMLRDFVSTRPALQELLKEALNMERNTELQTTIREYYKHLYANKLENPEEMDKFLDT YTLPRLNQEEVESLNRSITGSEIEAIINSLLTKKRPGPDGFTAEFYQSQYHTEWAKTRST PFENRYKTRMLSLTTPIQHSVGNSGQGNQARERNKGYSIGKEEVKLSLFADDMIVYLENP IVSAQNLLKLTSNFSKVSGYKINVQKSQAFLYTNNRQTERQIMSEFPFTIVTKRIKYLRI QLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRSSIMKMPILPKVIYRFIAIPIK LPMTFFTELEKLL >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_6|1482_bp atggggagaaaccagaacacaaaagctgaaaattccaaaaaccagactgcctcttctcct ccaaaggatagcagatccttgccagcaatggaacaaaactggatggagaacgagtttgac gagttgacagaagtaggcttcaaaagtgatggggagaatggaaccaagttagaaaacact cttcaggatattatccaggagaacttccacaacctagcaaggcaggccaacattcaaatc caggaaattcagagaacaccacaaagatactcctcaagaagaggaaccccaagacacata attgtcagattcaccaaggttgaaatgaaggaaaaaatgttaagggcagccagagagaaa ggtcgggttacccacgaagggaagcccatcagactaacaggggatctctcggcagaaacc ccacaaaccagaaaagagtggggttcaatattcaacattcttaaagaaaagaattttcaa cccagaatttcatatccagacaaactaagcttcataagcgaaggagaaataaaatccttt acagacaagcaaatgctgagagattttgtctccaccaggcctgccttacaagagctcctg aaggaagcactaaacatggaaagaaacaccgaattacaaactaccatcagagaatactat aaacacctctatgcaaataaactagaaaatccagaagaaatggataaattcctggacaca tacaccctcccaagactaaaccaggaagaagttgaatctctgaatagatcaataacaggt tctgaaattgaggcaataattaatagcctactaaccaaaaaacgtccaggaccagatgga ttcacagccgaattctaccagagccagtatcatactgaatgggcaaaaactagaagcact ccctttgaaaatcggtacaagacaaggatgctctctctcaccactcctattcaacatagt gttggaaattctggccagggcaatcaggcaagagaaagaaataaagggtattcaattgga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaacccc atcgtctcagcccaaaatctccttaagctgacaagcaacttcagcaaagtctcaggatac aaaatcaatgtgcaaaaatcacaagcattcctatacaccaataacagacaaacagagagg caaatcatgagtgaattcccattcacaattgttacaaagagaataaaatacctaagaatc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaa ataaaagaggacacaaacaaatggaagaacattccatgctcatggataggaagaagcagt atcatgaaaatgcccatactgcccaaggtaatttatagattcattgctatccccatcaag ctaccaatgactttcttcacagaattggaaaaactactttaa >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_7|442_aa MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYR LFPRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKY SEEANNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVD LLLNPERYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQETWLEKKWGHNITEF QQRFDGILTEGEGPRRLKNLYFLYLIELRALSKVLPFFERPDFQLFTGNKIQDEENKMLL LEILHEIKSFPLHFDENSFFAGDKKEAHKLKTQGLGTALKILFSEKLIANMPESGPSYEF HLTRQEIVSLFNAFGRTFLNSFMVQVKLQHHFETNHSEFKAKGIQILDVDPSGPEIGINF RALTRHGASLGLERPPVLRQLQ >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_7|1329_bp atgggccgcggctggggattcttgtttggcctcctgggcgccgtgtggctgctcagctcg ggccacggagaggagcagcccccggagacagcggcacagaggtgcttctgccaggttagt ggttacttggatgattgtacctgtgatgttgaaaccattgatagatttaataactacagg cttttcccaagactacaaaaacttcttgaaagtgactactttaggtattacaaggtaaac ctgaagaggccgtgtcctttctggaatgacatcagccagtgtggaagaagggactgtgct gtcaaaccatgtcaatctgatgaagttcctgatggaattaaatctgcgagctacaagtat tctgaagaagccaataatctcattgaagaatgtgaacaagctgaacgacttggagcagtg gatgaatctctgagtgaggaaacacagaaggctgttcttcagtggaccaagcatgatgat tcttcagataacttctgtgaagctgatgacattcagtcccctgaagctgaatatgtagat ttgcttcttaatcctgagcgctacactggttacaagggaccagatgcttggaaaatatgg aatgtcatctacgaagaaaactgttttaagccacagacaattaaaagacctttaaatcct ttggcttctggtcaagagacctggttagaaaagaaatggggacacaacattacagaattt caacagcgatttgatggaattttgactgaaggagaaggtccaagaaggcttaagaacttg tattttctctacttaatagaactaagggctttatccaaagtgttaccattcttcgagcgc ccagattttcaactctttactggaaataaaattcaggatgaggaaaacaaaatgttactt ctggaaatacttcatgaaatcaagtcatttcctttgcattttgatgagaattcatttttt gctggggataaaaaagaagcacacaaactaaagactcagggtttgggcactgctctgaag atcttattttctgagaaattgatagcaaatatgccagaaagtggacctagttatgaattc catctaaccagacaagaaatagtatcattattcaacgcatttggaagaacatttttgaat agtttcatggtccaagttaagctgcagcatcactttgagaccaatcattcagagtttaaa gcaaaaggaattcaaattttagatgtagatccatctgggccagaaattggcatcaacttc cgagccctcacaagacatggagcaagcctaggcttagagcgtcctccagtgctgagacaa ctgcagtag >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_8|281_aa MAIPGIPYERRLLIMADPRDKALQDYRKKLLEHKEIDGRLKELREQLKELTKQYEKSEND LKALQSVGQIVGEVLKQLTEEKFIVKATNGPRYVVGCRRQVIELPLTNPELFQRVGIIPP KGCLLYGPPGTGKTLLARAVASQLDCNFLKVVSSSIVDKYIGESARLIREMFNYARDHQP CIIFMDEIDAIDIDLPNEQARLDILKIHAGPITKHGEIDYEAIVKLSDGFNGADLRNVCT EAGMFAIRADHDFVVQEDFMKAVRKVADSKKLESKLDYKPV >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_8|846_bp atggccattcccggcatcccctatgagagacggcttctcatcatggcggaccctagagat aaggcgcttcaggactaccgcaagaagttgcttgaacacaaggagatcgacggccgtctt aaggagttaagggaacaattaaaagaacttaccaagcagtatgaaaagtctgaaaatgat ctgaaggccctacagagtgttgggcagatcgtgggtgaagtgcttaaacagttaactgaa gaaaaattcattgttaaagctaccaatggaccaagatatgttgtgggttgtcgtcgacag gtgatagaattacctcttacaaacccagagttatttcagcgtgtaggaataatacctcca aaaggctgtttgttatatggaccaccaggtacgggaaaaacactcttggcacgagccgtt gctagccagctggactgcaatttcttaaaggttgtatctagttctattgtagacaagtac attggtgaaagtgctcgtttgatcagagaaatgtttaattatgctagagatcatcaacca tgcatcatttttatggatgaaatagatgctattgatattgatttgccaaatgaacaagca agattagacatactgaaaatccatgcaggtcccattacaaagcatggtgaaatagattat gaagcaattgtgaagctttcggatggctttaatggagcagatctgagaaatgtttgtact gaagcaggtatgttcgcaattcgtgctgatcatgattttgtagtacaggaagacttcatg aaagcagtcagaaaagtggctgattctaagaagctggagtctaaattggactacaaacct gtgtaa >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_9|146_aa MEDVKLEFPSLPQCKEDAEVSRSRGCHAQASPCGSGRGATPVPNRLSRHLAAFVIAYIME TFGMKYRDAFAYVQERRFCINPNAGFVHQLQEYEAIYLAKLTIQMMSPLQIERSLSVHSG TTGSLKRTHEEEDDFGTMQVATAQNG >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_9|441_bp atggaggacgtgaagctggagttcccttcccttccacagtgcaaggaagacgccgaggtg agtcgctcccgtggctgccacgcacaggcctctccctgtggctccggccgaggggcgacc ccagtccccaaccgtcttagccgccaccttgcagcctttgttattgcatacattatggaa acatttggaatgaagtacagagatgcttttgcttatgttcaagaaagaagattttgtatt aatcctaatgctggatttgtccatcaacttcaggaatatgaagccatctacctagcaaaa ttaacaatacagatgatgtcaccactccagatagaaaggtcattatctgttcattctggt accacaggcagtttgaagagaacacatgaagaagaggatgattttggaaccatgcaagtg gcgactgcacagaatggctga >gi568815584r:52543573_52795481|GENSCAN_predicted_peptide_10|212_aa MVPGQGILKLLIQANVPESLLNSITDIYVYPRNSLICANENIVVDLTRKMKPDETPMFDP SLLKEVDWSQNTATFSPAISPTHPGEGLVLRPLCTADLNRESFEHMKKSGDYYVTVVEDV TLGQIVATATLIIEHKFIHSCAKRGRVEDVVVSDECRGKQLGKLLLSTLTLLSKKLNCYK ITLECLPQNVGFYKKFGYTVSEENYMCRRFLK >gi568815584r:52543573_52795481|GENSCAN_predicted_CDS_10|639_bp atggtacccggccaaggtattcttaaacttctcattcaggccaacgttcctgaatcactt ctcaactcaatcactgacatctatgtttatccacgtaatagccttatttgtgccaatgag aatattgttgttgaccttactagaaaaatgaaacctgatgaaactcctatgtttgaccca agtctactcaaagaagtggactggagtcagaatacagctacattttctccagccatttcc ccaacacatcctggagaaggcttggttttgaggcctctttgtactgctgacttaaataga gaatcttttgagcatatgaagaaatctggggattattatgttacagttgtagaagatgtg actctaggacagattgttgctacggcaactctgattatagaacataaattcatccattcc tgtgctaagagaggaagagtagaagatgttgttgttagtgatgaatgcagaggaaagcag cttggcaaattgttattatcaacccttactttgctaagcaagaaactgaactgttacaag attacccttgaatgtctaccacaaaatgttggtttctataaaaagtttggatatactgta tctgaagaaaactacatgtgtcggaggtttctaaagtaa