GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:20:40 Sequence gi568815592f:63476881_63680171 : 203291 bp : 38.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3233 3628 396 1 0 71 35 374 0.805 24.49 1.02 PlyA + 4352 4357 6 1.05 2.06 PlyA - 5152 5147 6 1.05 2.05 Term - 14364 14082 283 2 1 25 55 122 0.018 -3.49 2.04 Intr - 21828 21582 247 1 1 75 75 117 0.081 4.70 2.03 Intr - 30806 30610 197 1 2 59 98 147 0.806 10.94 2.02 Intr - 48482 48380 103 0 1 116 -7 103 0.002 1.91 2.01 Init - 52039 51991 49 1 1 96 89 12 0.599 1.29 2.00 Prom - 52295 52256 40 -6.55 3.00 Prom + 58751 58790 40 -5.75 3.01 Sngl + 60942 61118 177 2 0 91 54 179 0.954 9.50 3.02 PlyA + 61485 61490 6 1.05 4.04 PlyA - 62009 62004 6 1.05 4.03 Term - 72436 71828 609 1 0 -17 36 563 0.386 34.11 4.02 Intr - 72618 72444 175 2 1 73 8 189 0.816 8.42 4.01 Init - 73638 73538 101 0 2 40 80 57 0.515 -0.11 4.00 Prom - 75147 75108 40 -3.95 5.03 PlyA - 75383 75378 6 1.05 5.02 Term - 89840 89707 134 0 2 27 43 162 0.796 2.87 5.01 Init - 90118 89977 142 1 1 9 92 151 0.885 7.94 5.00 Prom - 93897 93858 40 -6.05 6.03 PlyA - 94147 94142 6 1.05 6.02 Term - 96309 95805 505 2 1 62 37 360 0.954 21.23 6.01 Init - 96688 96447 242 1 2 66 -40 205 0.464 0.89 6.00 Prom - 98450 98411 40 -6.55 7.00 Prom + 98945 98984 40 -9.05 7.01 Init + 100001 100105 105 1 0 75 94 44 0.963 3.99 7.02 Intr + 101557 101649 93 0 0 36 89 78 0.792 1.94 7.03 Intr + 102018 102148 131 2 2 48 84 165 0.995 10.67 7.04 Intr + 102377 102451 75 2 0 48 98 57 0.563 0.41 7.05 Term + 103177 103294 118 1 1 122 38 124 0.976 7.83 7.06 PlyA + 104549 104554 6 1.05 8.04 PlyA - 104808 104803 6 1.05 8.03 Term - 121542 121330 213 0 0 64 42 160 0.538 5.25 8.02 Intr - 121623 121592 32 1 2 116 86 -23 0.283 -2.77 8.01 Init - 139288 138967 322 1 1 67 53 273 0.011 19.25 8.00 Prom - 144880 144841 40 -5.05 9.00 Prom + 145149 145188 40 -3.85 9.01 Init + 152201 152203 3 1 0 113 81 0 0.049 1.85 9.02 Intr + 158249 158543 295 1 1 40 91 106 0.051 1.76 9.03 Intr + 158953 159270 318 2 0 54 92 237 0.091 15.71 9.04 Intr + 159405 159564 160 1 1 90 40 56 0.029 -0.88 9.05 Intr + 169647 169915 269 0 2 87 91 187 0.832 15.15 9.06 Intr + 203120 203281 162 0 0 89 84 162 0.887 14.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 48482 48277 206 0 2 116 37 130 0.939 7.25 S.002 Init + 199512 199572 61 2 1 67 103 30 0.897 3.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_1|131_aa LSDAADAMGFRDLKSPTGLQVHNGYLADQSCNKGYVPSQADVAVIEAVSSPLSGNLCHAL HWYNYIKSYEKEKASLAGMKKALGKYGPANVEDSRGSGATDSEDDNDIYLFGFDKGEESK ESKRQREEYPA >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_1|396_bp ctctcagatgcagctgatgccatgggtttcagagacctgaaaagccccactggcctccag gtgcacaatggttacctggctgaccagagctgcaacaaggggtatgtgccatcacaagca gacgtggcagtaattgaagcagtctccagtccattgtccggtaatttgtgtcatgctcta cattggtataattacatcaagtcttacgaaaaggaaaaagccagcctagcaggaatgaag aaagctttgggcaagtatggacctgctaatgtggaagatagtagaggaagtggagctaca gatagtgaagatgacaatgacatttatctatttggatttgataagggggaggaaagcaaa gaatcaaaaaggcaaagggaagaataccctgcatag >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_2|292_aa MGFHHVSQAGLELLTSAPPLAESNRKPGGKNDTGGYPQPASASQGQIWKSGKDEESGFEM NRANTTYEVQELLITRMVSYENWLWIFYWILIETVPDCGTPGDYATRSTHQDVIRYKQCN REKPHENIASLSCPCSSIVSTTFDNMTSFGFWVSVMTTANLDCLLLARHCVGEDRRTRLI PDSAFIKLMFQWEPIENGRLKCWKIWGLGDLSAFSQGHLISMVTIEKRKVLVNPTLGKGQ IVGKTLFLNVCMRVVSEELSIGISRLSKEDPPSPLWAGIIQPVEGPNQKKKG >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_2|879_bp atggggtttcaccatgttagtcaggctggtcttgaactcctgacctcagctcccccattg gcagaatctaaccggaagccaggtggcaaaaatgacactggaggttacccacaaccggct tcagcatcacaaggccaaatatggaagagcggaaaggatgaagaatcaggctttgaaatg aacagagccaatacaacttatgaggtccaggaactgctgataacaagaatggtctcatat gaaaactggctttggatattttactggattctaatagaaactgttcctgattgtggaaca ccaggtgactatgcaactagatctactcatcaggatgttatcagatacaaacaatgtaac agggagaaaccacatgaaaacattgcttcactgtcctgtccttgttcaagcattgtgtcc actacctttgataacatgacaagttttgggttttgggtttctgtcatgactacagcaaac cttgactgcctactacttgccagacattgtgtaggtgaagatcggagaacaagactgatc ccagattctgctttcataaagcttatgttccaatgggagccaatagaaaatggaaggtta aagtgctggaagatctggggtcttggtgacctctcagctttcagtcaggggcacctcatc agcatggtcaccatagagaagagaaaggtgctggttaaccctaccctgggaaagggccag atagttggtaaaacattatttctgaatgtgtgtatgagggtagtttcagaagagcttagc attggaatcagtagactgagtaaagaagatccaccttcaccactgtgggcaggcatcatt caacccgttgagggccctaatcaaaaaaaaaaaggttga >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_3|58_aa MRTPQPPRIWRKGHGRSEREVGENPTNGIQAGPDRSLKEQKRGCSVTDKFHLHQSEFT >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_3|177_bp atgaggacacctcaaccgccaagaatctggcgaaagggccacggccgtagtgagagagaa gttggggagaatcctaccaatggaatacaggcaggcccagacaggagtttaaaagaacag aaaagaggatgtagtgtcacagacaaatttcacttgcatcagtcagagtttacctga >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_4|294_aa MKVGVAYSPLSVAGFFLQVESSEERFNFCSFTLRKEGQGKEGGSGPAVVKKQEVKKVVNP LFEKRPKNFDIGQDIQPKRDLTSFVKWPPLYQRQRAILCKWLKVPPEINQFTQALDHQTA ALLLQLAHKYRSETKQEKKQRLLALAKKKAAGKGGIPTKRPPVLRAGVNTITTLLENKKA QPVVIAHDVDPIELVVFLPALCHKMRVPYCIIKGEARLGRLVHRKTCTTVAFIQVNSEDK GALAKLVGAIRTDYNDRYNEIRRHWGGNVLGPKSVACIGKLEKAKAQELATKLG >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_4|885_bp atgaaagttggtgtggcctactcacccctctctgtagcagggttctttcttcaggtagag tcgtctgaagaacgcttcaatttttgtagcttcaccctcaggaaagaaggccaagggaaa gaaggtggctccggccctgctgtcgtgaagaagcaggaggttaagaaagtggtgaatccc ctgtttgagaaacggcctaagaattttgacattggacaggacattcagcccaaaagagac ctaaccagctttgtgaaatggcccccactatatcagcggcagagagccatcctctgtaag tggctgaaagtgcctcctgagattaaccagttcacccaggccctggaccaccaaacagct gctctgctacttcagctggcccacaagtacagatcagagactaagcaagagaagaaacag aggctgttggccctggccaaaaagaaagctgctggcaaagggggcatccccactaagaga ccacctgtccttcgagcaggagttaacaccatcaccaccttgctagagaataagaaagct cagccggtggtgattgcacacgacgtggatcccatcgagctggttgtcttcttgcctgcc ctgtgtcataaaatgcgggtcccttactgcattatcaagggggaggcaagactgggacgt ctagtccacaggaagacctgcaccactgtcgccttcatacaggttaactcagaagacaaa ggcgctttggctaagctagtgggagctatcaggactgactacaatgacagatacaatgag atccgccgtcactggggaggcaatgtcctgggtcccaagtctgtggcttgcattggcaag ctcgaaaaggcaaaggctcaagaacttgcaactaagctgggttaa >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_5|91_aa MNEELLLMKEKREWFLEIESTAGEDAVSIVEMIKDLEYHINLADKAVEIATATLTFSNYL PDQSAAIITEAGFSNSKDYDLLKAQMTVSIF >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_5|276_bp atgaatgaggagttgcttcttatgaaggagaaaagagagtggtttcttgagatagaatct actgctggtgaagatgccgtaagcattgttgaaatgataaaggatttagaatatcacata aacttagctgataaagcagtggaaattgccacagccaccttaaccttcagcaactacctc cctgatcagtcagctgccatcatcactgaggcaggattctccaacagcaaagattatgac ttgctgaaggctcagatgactgttagcattttttag >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_6|248_aa MPQRAAAPRTRLCAPEHARCAPSPRRNHGFAPRLQSGAGAEAILSEARVPAVEHKSVAAP SGCPAPTPELREALEPDRYHPAATLLRPNKEAPGRRGACHARTGRARPKGQGRRRRSQSG FCATHPARCGSRNCPRPPQKPDRAAPVAPGSGGTQKPVPPAANRHPAAPRGPASASAQPA AIGRPQGHNPPAPTRDPPEPQPGATWRRRRAPNKGDSGDGGPVVDSQPERRRCDGYSQEY VYSLKIVA >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_6|747_bp atgccccagagggccgccgccccgcgcacgcggctgtgtgcgcccgagcacgcccgctgc gccccctctccgcgccgcaatcacgggttcgcgccgcgccttcagtcaggcgcgggcgcc gaggccattttgtctgaggctcgggtcccggccgtcgagcacaagagtgtagctgcaccc agtggctgtcccgcgcccaccccggagctccgagaagctctggaaccagatcgttaccac cctgcagccactctcctccggccgaacaaagaggccccgggacgccgcggagcctgccat gcccggacgggtcgcgccagaccaaagggccaagggcggaggcggcggtcacaaagcggc ttttgtgccacccacccggcccgctgcggaagccgaaactgcccccggccgccgcagaag ccggaccgcgccgcccccgtcgctccgggctcgggagggactcaaaagccggtgcctccg gcggccaaccgccaccccgcagccccgaggggcccggcatctgcctccgcacagccggcc gcaatcggccggccacagggccataacccgcccgctcccacccgggacccaccggaaccg caacccggggcaacctggaggagacgccgggctccgaacaaaggcgacagcggggatggg gggccggtcgtggattcccagcccgagaggcgacgatgcgacggctactcacaggaatat gtttactcattgaagattgtggcctaa >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_7|173_aa MARMNRPAPVEVTYKNMRFLITHNPTNATLNKFIEELKKYGVTTIVRVCEATYDTTLVEK EGIHVLDWPFDDGAPPSNQIVDDWLSLVKIKFREEPGCCIAVHCVAGLGRAPVLVALALI EGGMKYEDAVQFIRQKRRGAFNSKQLLYLEKYRPKMRLRFKDSNGHRNNCCIQ >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_7|522_bp atggctcgaatgaaccgcccagctcctgtggaagtcacatacaagaacatgagatttctt attacacacaatccaaccaatgcgaccttaaacaaatttatagaggaacttaagaagtat ggagttaccacaatagtaagagtatgtgaagcaacttatgacactactcttgtggagaaa gaaggtatccatgttcttgattggccttttgatgatggtgcaccaccatccaaccagatt gttgatgactggttaagtcttgtgaaaattaagtttcgtgaagaacctggttgttgtatt gctgttcattgcgttgcaggccttgggagagctccagtacttgttgccctagcattaatt gaaggtggaatgaaatacgaagatgcagtacaattcataagacaaaagcggcgtggagct tttaacagcaagcaacttctgtatttggagaagtatcgtcctaaaatgcggctgcgtttc aaagattccaacggtcatagaaacaactgttgcattcaataa >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_8|188_aa MIKGVTLGFHYKMRSVYAHFPINVVIQENGSLVEIRNFLGKKYIRRVWMRAGVVCSVSQA QKDELILEGNDIELVSNSAALIQQATTVKKKDIREFLDGIYVSEKGTDLHFCNSFHIRCL LELFDLRARLSLLLQVSSLIVANGHHNRAFPTHLTLLQAASKRTGTYKSFSIQSSLVSGK NVVFTRSL >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_8|567_bp atgatcaagggtgttacactaggcttccattacaagatgagatctgtgtatgctcacttc cccatcaacgtcgttatccaggagaatgggtctcttgttgaaatccgaaatttcttgggt aaaaaatacatccgcagggtttggatgagagcaggtgttgtttgttcagtatctcaagcc cagaaagatgaattaatccttgaaggaaatgacattgagcttgtttcaaattcagcggct ttgattcaacaagccacaacagttaaaaagaaggatatcagggaatttttggatggtatc tatgtctctgaaaaaggaactgaccttcacttttgcaactcttttcatattcgttgtctc ttggaactcttcgatcttcgtgctcggctgtctttattgcttcaggtgtcttccctgata gttgcaaatggccaccacaaccgagcgttccctactcacctgacgttgctccaagcagca agcaagcgcacgggtacttacaagtctttctccatacaatcttctcttgtttcaggaaag aatgttgtttttaccagaagcctctaa >gi568815592f:63476881_63680171|GENSCAN_predicted_peptide_9|403_aa MISSMSGLIEDGWILLSVSALNQLLYAVLVEGWEENLALDTCSWQRNGGSTERSWRPLRE TWDHILKTAYWIPRMFLGGSNLSPEVDQPCRVRVTEVVNAPAPPPHADGCGVSRRRTVPT RQATFGLRAAAAATRIRRAARKPRIVSSAAAIWSQEEKRLWQRRRRPARTPSPRHPPGPL LLLFGGGSVHHLPLAASGSARLAELAERSPLGPLAQRPRPTWAAPLAPVDHLDRGLGRES QRRGRQAEGRDHGLSPDHWAERHTEVFMDIVDTFNHLIPTEHLDDALFLGSNLENEVCED FSASQNVLEDSLKNMLSDKDPMLGSASNQFCLPVLDSNDPNFQMPCSTVVGLDDIMDEGV VKESGNDTIDEEELILPNRNLRDKVEENSVRSPRKSPRLMAQX >gi568815592f:63476881_63680171|GENSCAN_predicted_CDS_9|1209_bp atgatttcctcaatgtctggcttaatagaagatggctggattctcttatctgtttctgca ttgaatcagctgctatatgctgttttggttgaagggtgggaagaaaatttggccttagac acatgctcctggcaaaggaatggaggctctactgaaaggtcttggcgacccctcagggaa acctgggaccacattctgaagactgcttactggataccaagaatgtttcttggaggtagc aatttgagtcctgaggtagatcagccatgtagggtgagggtaacggaggtagttaatgct cccgcgccgccgccgcacgccgatggctgcggggtctcgcgccgtcgcaccgtccccacg cggcaagcgaccttcgggctcagggcggcggcggctgcaacgaggattaggagggcggcg cggaagccaagaatagtgtcgtcagcagcagccatttggtcccaggaggaaaagaggctg tggcagcgacgccgacgtcctgcgcgtaccccctctccgcggcacccaccgggccccctc ctcctcctcttcggcggcggcagcgtccaccatcttcctcttgctgccagtggtagcgct cgtctggcggagctggccgagagatcgcctctcgggcctcttgcgcaacgtcctcggcct acctgggccgcgccgctggcgcctgtggaccatttggaccgcgggctggggagggagtcg cagcgacgcggtcgccaggcggagggtcgggaccacggcctctctcccgaccactgggct gaaagacacacagaagtcttcatggatatagttgatacatttaatcatttaattcctact gaacacttagatgatgccctatttctaggatccaacctggagaatgaagtctgtgaggat tttagtgcaagtcaaaatgtcttagaggactcgctgaagaacatgctcagcgataaggat cctatgctaggatctgcaagtaaccagttctgtttgcctgttttggatagcaatgatccc aatttccagatgccttgttcaacagttgttggtcttgacgatattatggatgaaggagtt gttaaagaaagtggcaatgataccattgatgaagaagaactgattttacctaacaggaac ttaagggacaaggtagaagaaaattcagtgagatctccaagaaaatcacctcgtttaatg gcacaagnn