GENSCAN 1.0 Date run: 13-Jun-118 Time: 19:13:20 Sequence gi568815589r:74966755_75177581 : 210827 bp : 39.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9108 9187 80 1 2 119 50 101 0.733 6.35 1.02 PlyA + 11458 11463 6 1.05 2.15 PlyA - 14066 14061 6 1.05 2.14 Term - 16836 16778 59 1 2 91 45 69 0.875 -0.23 2.13 Intr - 17114 17027 88 2 1 103 68 55 0.891 3.62 2.12 Intr - 18256 18153 104 2 2 61 94 81 0.811 4.97 2.11 Intr - 29770 29693 78 2 0 96 97 2 0.452 0.50 2.10 Intr - 32022 31844 179 2 2 86 92 116 0.754 10.44 2.09 Intr - 61797 61258 540 2 0 50 90 477 0.100 35.19 2.08 Intr - 61935 61855 81 2 0 83 97 108 0.939 9.03 2.07 Intr - 70640 70532 109 0 1 6 116 64 0.789 -0.58 2.06 Intr - 73334 73212 123 0 0 39 45 149 0.019 5.34 2.05 Intr - 83006 82886 121 2 1 91 40 119 0.896 6.55 2.04 Intr - 89244 89127 118 2 1 65 78 63 0.016 2.55 2.03 Intr - 89430 89379 52 1 1 71 5 79 0.000 -5.05 2.02 Intr - 100086 100003 84 1 0 93 84 102 0.671 9.17 2.01 Init - 102290 102242 49 2 1 33 111 34 0.362 1.37 2.00 Prom - 110306 110267 40 -6.25 3.00 Prom + 112867 112906 40 -5.25 3.01 Init + 116497 116618 122 0 2 67 60 84 0.619 3.44 3.02 Term + 117477 117639 163 0 1 89 38 109 0.699 2.43 3.03 PlyA + 118040 118045 6 1.05 4.00 Prom + 118921 118960 40 -6.65 4.01 Init + 121077 121241 165 2 0 104 101 74 0.964 9.98 4.02 Intr + 121285 121401 117 0 0 41 91 87 0.907 4.04 4.03 Intr + 121801 121986 186 0 0 102 102 40 0.515 5.76 4.04 Term + 131429 131587 159 1 0 109 45 60 0.341 0.76 4.05 PlyA + 133164 133169 6 1.05 5.00 Prom + 145840 145879 40 -4.25 5.01 Init + 151595 151724 130 1 1 93 105 38 0.368 6.36 5.02 Intr + 152209 152471 263 2 2 63 42 129 0.233 2.08 5.03 Intr + 160815 160865 51 2 0 79 115 49 0.718 4.89 5.04 Intr + 163824 163887 64 2 1 94 115 -2 0.516 0.47 5.05 Intr + 165016 165069 54 2 0 63 92 57 0.425 1.63 5.06 Intr + 167592 167641 50 1 2 63 110 14 0.473 -1.42 5.07 Intr + 170784 170862 79 2 1 63 98 117 0.932 8.41 5.08 Intr + 174080 174178 99 0 0 49 68 104 0.874 3.66 5.09 Term + 183254 183519 266 0 2 83 42 218 0.024 11.39 5.10 PlyA + 183814 183819 6 1.05 6.00 Prom + 184529 184568 40 -5.85 6.01 Init + 195490 195619 130 0 1 40 46 149 0.507 6.26 6.02 Intr + 200840 200877 38 0 2 84 62 36 0.102 -2.34 6.03 Intr + 208640 208811 172 1 1 68 50 125 0.140 5.29 6.04 Term + 208866 209002 137 1 2 37 46 139 0.247 1.80 6.05 PlyA + 209022 209027 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 179929 179987 59 2 2 130 31 109 0.967 6.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_1|26_aa XRKKKCYDNRAASPMVLAKRRYGCDC >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_1|81_bp ngcagaaagaagaaatgttatgacaacagagcagcaagtcccatggtcctagctaaaagg cggtacggctgtgactgctga >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_2|594_aa MYLKYRQEMQDITWEVVYLDGTKSEEDLFLQVYEDLIQELAKQKLNPQPSSVRFMAIVNM SREQIKEPGKARALSGQNEAFCSTLLEPSQGCKPFHQVIHGRCCKAQKRRSLVLAYDHPD NTGFRTCCSAGLQLLEIADSFGMVRPADDEMTANEETACDSQFPRGKGHNKPHRATRGST SVGTTYALNEFTFYWETRYFSSVRNLVTEEGVTLSGMISAWDRVQPPAQQSGTRRLLSPT HAAAHRGLGPALGSVVVTSARCLRSPAANERRPHGPSAYQGTRMRIGSGSPEDLAEPSAD GGLRVAGAERAASWGHVRGGRRLEEGQAAATTLSVAERRKRARPRAAAAMQRRRRPPPPT SRLPEGCGGGGGGSEEVEVQFSAGRWGSAAAVSAAAAAATRSTEEEEERLEREHFWKIIN AFRYYGCSEINKYKLYPWIHQFSNNRRSADQIRPIFFPDVDPHSLPPGSNFSMTAGDFQE IYSECNTAHNVIDYIDTIWKILKPGGIWINLGPLLYHFENLANELSIELSYEDIKNVVLQ YGFKVEVEKESVLSTYTVNDLSMMKYYYECVLFVVQIYYAMQICTTQVKMEEMS >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_2|1785_bp atgtatctaaagtacagacaagaaatgcaggacatcacatgggaagttgtgtacctggat ggaacaaaatctgaagaggacctctttttgcaagtatatgaagatctaatacaagaacta gcaaagcaaaagttaaacccacaaccttccagtgtgcgctttatggccatcgtgaacatg tcacgggagcaaatcaaggagcctggaaaggcaagagctctgtcagggcaaaatgaagct ttctgttccacacttttggagcctagtcaaggatgcaagccatttcatcaggttatccat gggaggtgttgcaaggctcagaaaagaaggtcccttgttctcgcctatgatcaccctgat aatactggatttaggacgtgttgttcagctggtctacagctgctagaaatagcagattca tttggtatggtgaggccagcagatgatgagatgactgccaatgaagagacagcttgtgac tcacagttcccaagagggaaggggcacaacaagccacacagggccacacggggaagcacc agtgtcgggacaacctatgccttaaatgagttcacattctactgggaaaccagatatttt agcagtgtgagaaacttagtaactgaggaaggagtgactctttcggggatgataagtgcc tgggaccgcgttcagcctccagctcaacaatccggcactcgccggcttttaagtcccacc cacgcggccgcgcacaggggccttggccccgcccttggaagcgtggttgtgacgtcagcg cgctgtctccgaagcccggcagcaaatgagcgcagaccgcacgggccctccgcgtaccaa ggcacacgcatgcgcattggttctgggagccctgaagacctggcggagcccagcgcggat ggaggcctgagggtggcgggggcggagcgcgccgcgagctgggggcatgtccgcggcggg cgccgtctagaggaaggccaagcagccgccaccacgctgtcggtcgcggagagacgcaag cgagccaggccgagggccgcggcggcgatgcagcgacggcgtcgccctccgccgcccacc tcccggctgcccgagggctgcgggggaggaggcggtggcagcgaggaggtggaagtgcag ttttccgccgggcgttggggctcggccgcggcggtttcggcggcagcggcggcggccacg cgcagcaccgaggaggaggaggagaggcttgagcgtgagcacttctggaagatcattaat gccttccgctactacggatgttctgaaattaataaatataaactttatccttggatccat cagtttagcaataaccggagatcagctgatcagattcgacccatctttttccctgatgtt gacccccacagtcttcctcctggttctaacttttctatgacagcaggagattttcaagag atttattcagaatgcaacacagctcacaatgtaattgattatattgatacaatatggaaa atactcaagccaggtggaatttggataaatctaggtcctctgctgtaccactttgaaaat ctggcaaatgaactttccatagaattgagctatgaggatataaaaaacgttgttctgcag tatggattcaaggtagaggtggaaaaagaatctgtattgtcaacatatactgtgaatgat ctctctatgatgaaatactactatgaatgtgtcttgtttgtggtccaaatatactatgcc atgcagatttgtactacacaagtaaaaatggaggaaatgtcttga >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_3|94_aa MTVTFQPSPALSTFPPHWLPLLPLSANHWGVLRTVKQFNVRGVARDLGRVARGKRLPAKL EAPSNAPVSMICVHTLPGEGLCRKKPGGKRAIVV >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_3|285_bp atgacagtgacctttcagcccagccccgcattatccacattcccgccgcactggcttccc ctgctaccactgagtgccaaccactggggtgtgctcaggactgtcaaacagttcaacgtc aggggggttgccagggacctaggccgtgtagccaggggcaagcggctgcctgccaagctg gaggctcccagcaatgctccagtttctatgatctgtgtccacactctccctggggagggg ttatgcagaaaaaagcctggtgggaagagggctattgttgtctga >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_4|208_aa MSTRPPRQATYQLCEGGKSPGKAFRAQPSPGLQHRNPLETPPGPSKAARRGRGEQRQRRQ PSARLYPVRRYPQLAPREAHSPFPHAPPGLEGCQPRVGAGPRRRTVVSQTKRTGVPGVPG GGQAVGFSAGSLGFAAPGSEMSKPPPKPVKPGEGGKTPGPFSMIHFPTPSYSVLSSMWIE TYGGIKHTKQRLFISPDFHEKQQRDGKG >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_4|627_bp atgagtacacgtcctcctcggcaggccacctatcagctctgtgagggtggcaagagtcca ggcaaggctttcagggcgcaaccaagtcccgggctccagcacagaaaccctctggaaaca cctccaggccccagcaaggcggcgcggcgcggcaggggcgagcagcggcagaggcgacag ccaagcgcgcgcctgtacccagtgcgccgctacccgcagctcgctccccgagaggcccac agcccctttccccacgctcctcctgggttagagggatgccagccaagggtgggcgccggt cctaggaggcgcacggttgtaagccagacaaaaagaactggggtgcccggagtgccaggt ggcgggcaagcggtgggcttttcggcggggtctttaggatttgcagctccaggaagcgag atgtcgaagccgccacccaaaccagtcaaaccaggtgagggaggtaagactcctgggcca ttttcaatgatccacttcccaactcccagctactcagtgctcagcagcatgtggatagaa acatatggaggaattaaacacaccaagcaaaggcttttcataagtccagatttccatgag aaacaacaaagagatgggaaaggatag >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_5|351_aa MAWKVANMGEGSGRLLPRVGVDWKGRAFRACGSHLDFALGKIGGTRAFSLWAVSLLPASP GFAPDPAVPKVRGNSEGHTEQVTLSISSLPSGQGLPSNPKSVFYLLREFSYLTAIDSSCE SGDTILQWNHAPDELYFEEGDIIYITDMSDTNWWKGTSKGRTGLIPSNYVAEQAESIDNP LHEAAKRDIVEMLFTQPNIELNQQNKLGDTALHAAAWKGYADIVQLLLAKGARTDLRNIE KKLAFDMATNAACASLLKKKQGTVPVRDIFTDMMSFDLCSNLRRGMLMPQLFTETLRFEE VKELVEGGHTGLGNPRAIACCNSQMSSFTFSQKNLQKQDSFFPTEALDIGI >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_5|1056_bp atggcttggaaagttgcaaacatgggagagggtagtgggagactacttccgagagttggg gtggactggaagggcagagcatttagggcctgtggatctcatttggattttgctctgggt aagattggagggaccagagctttctctctgtgggcagtcagcttgctgcccgcttcacct gggtttgccccagaccctgctgtccccaaagttagaggcaactcagaaggtcacacagaa caagtgacactgagcatcagttcactgccatcaggccagggcctgccttcaaatcccaaa tctgtcttttacttactgcgtgaattttcttacctcactgccattgattcctcatgtgaa agtggggataccattctgcagtggaaccatgctccagatgaattatactttgaggaaggt gatattatctacattactgacatgagcgataccaattggtggaaaggcacctccaaaggc aggactggactaattccaagcaactatgtggctgagcaggcagaatccattgacaatcca ttgcatgaagcagcaaaaagagatatagtggaaatgctatttactcaaccaaatattgaa ctgaaccagcagaacaagttgggagatacagctttgcatgctgctgcctggaagggttat gcagatatcgtccagttgcttctggcaaaaggtgctagaacagacttaagaaacattgag aagaagctggccttcgacatggctaccaatgctgcctgtgcatctctcctgaaaaagaaa cagggaacagtgcctgtgcgagacattttcacagacatgatgtcatttgatctttgcagt aaccttaggcgaggaatgttaatgccccaactttttacagagacgctgaggtttgaagaa gtgaaggaactggttgaaggtggacacacgggtctggggaacccacgcgctattgcctgc tgcaacagtcagatgtcatcattcacattctctcagaaaaatctgcagaaacaggacagc ttctttccaacggaagcccttgatataggcatctaa >gi568815589r:74966755_75177581|GENSCAN_predicted_peptide_6|158_aa MFPPFDSIIVNNAAMNVAVQVPGQVPVLNSVRYIPTSGIPGPYVSHPHTESTDQLKVEFN SPTWLLRHASLVECSGSDTLGLKALLKEDFQLLLRSLGTQLPDFRGTLAALWEADSQHQL AGHGSQRSWKWIPSLKEINLVGALWSGNECSAKPLPDP >gi568815589r:74966755_75177581|GENSCAN_predicted_CDS_6|477_bp atgtttccaccttttgacagcattattgtgaataatgctgcaatgaatgttgctgtacaa gtacctggtcaagtccctgttttaaattctgttcggtatatacctacgagtggaattcct gggccatatgttagccatcctcacactgagtctacagatcaactcaaggtagagtttaat tctcctacctggcttttgcgacacgcatcactagtagaatgcagtggaagtgacactctg ggactgaaggctttgttaaaagaagacttccagcttctactgaggtctcttggaactcag ttgccagactttcggggaactttagcagccctgtgggaagctgacagccagcatcaattg gccggtcatgggagtcagcgttcttggaagtggatccccagcctcaaagaaatcaactta gttggtgctttgtggagcggaaatgaatgctcagccaaacccttaccagatccctga