GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:48:31 Sequence gi568815592f:146443872_146654602 : 210731 bp : 37.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 359 354 6 1.05 1.01 Sngl - 5377 4574 804 1 0 53 49 373 0.947 25.46 1.00 Prom - 5562 5523 40 -7.45 2.00 Prom + 7336 7375 40 -4.65 2.01 Init + 9776 9893 118 1 1 72 109 102 0.162 11.11 2.02 Term + 28288 28397 110 2 2 84 44 125 0.655 5.39 2.03 PlyA + 28535 28540 6 1.05 3.00 Prom + 45462 45501 40 -5.15 3.01 Sngl + 57450 58352 903 2 0 86 48 173 0.852 9.07 3.02 PlyA + 58543 58548 6 1.05 4.00 Prom + 62005 62044 40 -4.25 4.01 Init + 63082 63142 61 0 1 52 92 27 0.617 0.96 4.02 Intr + 63193 63312 120 2 0 22 105 120 0.438 6.65 4.03 Term + 65730 65845 116 1 2 47 54 64 0.085 -3.35 4.04 PlyA + 66059 66064 6 1.05 5.06 PlyA - 66846 66841 6 1.05 5.05 Term - 67822 67612 211 0 1 53 47 144 0.503 2.58 5.04 Intr - 69361 68978 384 0 0 57 39 151 0.314 0.14 5.03 Intr - 70849 70692 158 1 2 63 72 128 0.106 6.59 5.02 Intr - 77947 77499 449 2 2 65 86 245 0.060 14.14 5.01 Init - 81961 81955 7 1 1 85 110 0 0.443 3.02 5.00 Prom - 88977 88938 40 -5.45 6.00 Prom + 95563 95602 40 -2.65 6.01 Init + 100001 100250 250 1 1 79 102 480 0.923 44.17 6.02 Intr + 105593 105870 278 0 2 70 100 220 0.257 17.71 6.03 Term + 110585 110734 150 1 0 96 40 180 0.996 10.93 6.04 PlyA + 111065 111070 6 1.05 7.04 PlyA - 112008 112003 6 1.05 7.03 Term - 114113 113730 384 2 0 10 43 199 0.340 1.50 7.02 Intr - 121899 121818 82 2 1 92 87 23 0.392 1.32 7.01 Init - 122774 122737 38 2 2 86 91 28 0.486 2.46 7.00 Prom - 124108 124069 40 -0.85 8.00 Prom + 133397 133436 40 -3.35 8.01 Init + 143803 143946 144 0 0 104 -4 127 0.008 3.45 8.02 Term + 155020 155247 228 0 0 89 49 228 0.019 14.65 8.03 PlyA + 156735 156740 6 1.05 9.00 Prom + 158527 158566 40 -6.45 9.01 Init + 158756 158861 106 1 1 99 78 125 0.868 13.07 9.02 Intr + 177507 177639 133 1 1 53 30 135 0.070 2.98 9.03 Intr + 191504 191666 163 2 1 103 78 174 0.938 16.96 9.04 Term + 207190 207321 132 0 0 63 49 112 0.041 1.91 9.05 PlyA + 208075 208080 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_1|267_aa MWESLELPRDLLNGFDQKPDSDMDSKVQVEVVSDGDEELVGKWSKGDSCYILAKRLSAFC SCSRNLWNFELERDDSGYLMEEISKHQSIQEVTWVLLRVFSFIREEEDKSLENVQPDNVI EKKNPFCKEKFKLVAEICISNESNVNPPDNEENVSRTRQRSSRSPCHHRPGGIGENGFMV WDQGLRAVCSLGTWCPVFQLPQPQLKGANIKLRQWLQRVQAPSLGSFHVVLSLQVHRSQE LGFGNLCLDFKRAEVCCRGRVLMGNLC >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_1|804_bp atgtgggaaagtttagaacttcctagagacttgttgaatgggtttgaccaaaagcctgat agcgatatggacagtaaggtccaggttgaggtggtctcagatggagatgaggaacttgtt gggaagtggagcaaaggtgactcttgttatattttagcgaagagactgtcagcattttgc tcctgctctagaaatttgtggaactttgaacttgagagagatgattcagggtatctgatg gaagaaatttctaagcaccaaagcattcaagaggtgacttgggtgctgttaagggtattc agttttataagggaagaagaggataaaagtttggaaaatgtgcagcctgacaatgtgata gagaagaaaaacccattttgtaaggagaaattcaagctggttgcagaaatttgcataagt aatgagtcaaatgttaatcccccagacaatgaggaaaatgtctccaggacacgtcagagg tcatcacgcagcccctgccatcacaggcctggaggcataggggaaaatggtttcatggtc tgggaccagggtctccgtgctgtgtgcagcctagggacttggtgccctgtgttccagctg ccccagccacagctgaaaggggccaacataaagctcaggcagtggcttcagagggtgcaa gccccaagccttggcagcttccatgtggtgttgagcctgcaagtgcacagaagtcaagaa ttggggtttgggaacctgtgcctagatttcaaaagagcagaagtttgctgcaggggcagg gtgctcatggggaacctctgctag >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_2|75_aa MVYGASEAIRQRHSSASKPRRSQSESLGPEFQGLWEWLPARKSNTLTIDDGLYKSDPRKE TETTPTVKVFDCQLG >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_2|228_bp atggtctacggggcttctgaggcgatcaggcagcgtcattcttcagccagtaagccaaga aggagtcaatcagagagccttgggccagagttccaggggctctgggagtggctgccagct agaaaatccaacactctaactatagatgatggtttgtacaagtcggaccccaggaaagag acagaaaccacaccaactgtgaaggtatttgactgtcaacttggctag >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_3|300_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKARGITLPD FKLYYKATVTKTARYWYQNRDIHQWNRTEPSEITPHIYNYLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKYLNVRPKTIKTLEENLGITIQDIGMSKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRQNEKMLIITGHQRNAKDIVNFKIC >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_3|903_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagccagaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcacggtactggtaccaaaacaga gatatacatcaatggaacagaacagagccctcagaaataacaccgcatatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaatacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgagcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gatatgaacagacacttctcaaaagaagatatttatgcagccaaaagacaaaatgaaaaa atgctcatcatcactggccatcagagaaatgcaaaagatattgtaaattttaaaatctgt taa >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_4|98_aa MSKVLFSQSIKSEFPSSSESIAFQQYNYKYKQVREALCEELSFGDMAIFHATQDDLLMDS GISPNAIPLPAPTPQQAPACDVPCPVSMCSRCSIPTYE >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_4|297_bp atgagtaaggtactcttctcgcaaagcattaaatcagaatttccaagctcatcagagagc atcgcatttcaacagtacaattataagtacaaacaggttcgggaagcactatgtgaagag ttaagctttggggacatggccattttccatgcaacccaagatgacctcctcatggattct ggtatttctcctaatgctatccctctgccagcccccactccccaacaggccccagcgtgt gatgttccctgccctgtgtccatgtgttctcgttgttcaattcccacctatgagtga >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_5|402_aa MTVLEVLARAIRQKKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTGSQIMSELPFPIASKRIKYLGIQLTRDVKNLFKENHKPL LNEIKEDTNKWKNIPCSWIRRINIMKMDILPKDCSSLPAMEQNWTENDFDQLTEVGFRRS VITNFSELKEHVLTQWKEAKNLEKRSRRQKIYKYIQDLNSALHQADLIDIYRTLPPKSTE YTFFSAPHRTYSKIDHIIGSKTLLSKCKRTEITTNCLSDHSAIKLELKFKKLTQNCTTTW KLNNLLLNDYWVNNKMKAEIKMFFETNENKDTTIVLEVLARAIRQEKEIKGIQLGKQEVK LSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKS >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_5|1209_bp atgacagtgttggaagttctcgccagggcaatcaggcagaagaaagaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtgtatttg gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaa acagggagccaaatcatgagtgaactcccattcccaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaagaacctcttcaaggagaaccacaaaccactg ctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatggataaga agaatcaatatcatgaaaatggacatactgcccaaggattgcagctccttgccagcaatg gaacaaaactggacggagaatgactttgatcagttgacggaagtaggcttcagaaggtcg gtaataacaaacttctctgagctaaaggagcatgttctaacccagtggaaggaagctaaa aaccttgaaaaaagatcaagaagacaaaagatttacaagtatatccaggatttgaactca gctctgcaccaagcagacctaatagacatctacagaactctcccccctaaatcaacagaa tatacattcttctcagcaccacatcgtacttattctaaaattgaccatataattggaagt aaaacactcctcagcaaatgtaaaagaacggaaatcacaacaaactgtctctcagaccac agtgcaatcaaattagaactcaagtttaagaaactcactcaaaactgcacaactacatgg aaactgaacaatctgctcctgaatgactactgggtaaataacaaaatgaaggcagaaata aagatgttctttgaaaccaatgagaacaaagacacaaccatagtgttggaagttctggcc agggcaatcaggcaagaaaaagaaataaagggtattcaactaggaaaacaggaggtcaaa ttgtccctgtttgcagatgacatgattgtgtatttggaaaaccccatcgtctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcataa >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_6|225_aa MAGGGAGDPGLGAAAAPAPETREHLFKVLVIGELGVGKTSIIKRYVHQLFSQHYRATIGV DFALKVLNWDSRTLVRLQLWDIAGQERFGNMTRVYYKEAVGAFVVFDISRSSTFEAVLKW KSDLDSKVHLPNGSPIPAVLLANKCDQNKDSSQSPSQVDQFCKEHGFAGWFETSAKDNIN IEEAARFLVEKILVNHQSFPNEENDVDKIKLDQETLRAENKSQCC >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_6|678_bp atggcgggcggaggagccggggaccccggcctgggggcggccgccgccccagcgcccgag acccgcgagcacctcttcaaggtgctggtgatcggcgagcttggcgtgggcaagaccagc atcatcaagcgctacgtccaccagctcttctcccagcactaccgggccaccatcggggtg gacttcgccctcaaggtcctcaactgggacagcaggactctggtgcgcctgcagctgtgg gacatcgcggggcaggagcgatttggcaacatgacccgagtatactacaaggaagctgtt ggtgcttttgtagtctttgatatatcaagaagttccacatttgaggcagtcttaaaatgg aaaagtgatctggatagtaaagttcatcttccaaatggcagccctatccctgctgtcctc ttggctaacaaatgtgaccagaacaaggacagtagccagagtccttcccaggtggaccaa ttctgcaaagaacatggctttgccggatggtttgaaacctctgcaaaggataacataaac atagaggaagctgcccggttcctagtggagaagattcttgtaaaccaccaaagctttcct aatgaagaaaacgatgtggacaaaattaagctagatcaagagaccttgagagcagagaac aaatcccagtgttgctga >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_7|167_aa MVKGKTHVLHGSRTPQRVLASKKTQTRCSPLTLDFPDSRILEVLARAIKQEKEIKGIQLG KEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNK >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_7|504_bp atggtgaaaggcaaaacgcacgtgttacatggcagcaggaccccacagagagtcctcgct agcaagaagacccaaaccagatgcagccccttgaccctggacttcccagactccagaatt ttggaagttctggccagggcaattaagcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaacccc atagtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcgttcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatc caacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaa ataaaagaggatacaaacaaatag >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_8|123_aa MANHLMIELGLLLVMLMSGTCSKQELAVTEWTHFHLSDTLPDTDKHSRPQAAEPAPRAPP PLSGRWTRDAVSWQRRRGAERARRLFAQSSALHRSASAMASKQTKKKEVHRINSAHGSDK SKE >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_8|372_bp atggccaaccatctgatgatagaactgggactgctgttggtcatgttgatgagtgggaca tgtagcaaacaagaacttgctgtaactgaatggacacattttcacctgtctgacacccta cctgacactgacaaacattcaagaccacaggccgcagagcccgcccccagggccccgccc ccgctctccgggcgctggacgcgggacgccgtctcctggcaacgcagacgcggagccgag cgcgcccgcaggctctttgctcagagctcagccctacatagatcggcttctgccatggcc tccaaacaaaccaaaaagaaagaggtgcatcgtatcaactcggcgcacggatcggataaa tcgaaagagtaa >gi568815592f:146443872_146654602|GENSCAN_predicted_peptide_9|177_aa MVRFGEGCLRQGSPPAGAADQYQSMACKDLGCAAGGSSAFLVHPIAPLERSQGGLPMKIP KGEEIDRPPLISSNHHGNHSFYPFGSNVQSGSTEQKKGKFPLWPEWSEADINSEKWDAGK GAKEKDKTGKSPVFMLLSSHRQRMIARDDAMNVREGKFPETKAVLVAAGPRYGDNQP >gi568815592f:146443872_146654602|GENSCAN_predicted_CDS_9|534_bp atggtcaggttcggggaaggctgtctacgtcaagggtccccacccgctggggctgcggac cagtaccagtccatggcctgtaaggacctgggctgtgcagcaggaggtagttcagctttc ctggtacacccaattgctcccctggaaaggagtcagggtgggcttcccatgaagattcca aaaggtgaggagattgaccgtccacctctcatttcctccaatcaccatggaaaccacagt ttctatccttttggcagtaatgtacaatctggttctactgaacaaaagaaggggaaattc ccactctggccagagtggagtgaagctgacataaattcagaaaagtgggatgcaggcaaa ggtgcaaaagaaaaggacaaaacaggaaaaagccctgtatttatgttgctttcctcacac agacagagaatgatagcaagagatgatgccatgaacgtgagggaagggaaattccctgag acaaaggcagtcttggtagctgctggtcccaggtatggggacaaccagccatga