GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:20:13 Sequence gi568815581r:57905750_58107137 : 201388 bp : 45.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6293 6487 195 0 0 64 77 93 0.453 5.19 1.02 Intr + 7729 7809 81 2 0 28 108 54 0.513 1.01 1.03 Term + 7936 8000 65 2 2 73 37 58 0.600 -2.75 1.04 PlyA + 9125 9130 6 1.05 2.00 Prom + 12670 12709 40 -1.56 2.01 Init + 13831 13883 53 0 2 88 92 36 0.926 4.63 2.02 Intr + 20476 20586 111 1 0 45 86 84 0.303 3.39 2.03 Intr + 28228 28275 48 1 0 93 81 27 0.034 0.30 2.04 Intr + 34744 34859 116 1 2 102 -7 91 0.054 1.09 2.05 Intr + 48096 48298 203 1 2 87 81 42 0.875 2.40 2.06 Intr + 48421 48492 72 0 0 76 75 95 0.944 6.60 2.07 Intr + 51644 51724 81 1 0 99 76 6 0.527 0.33 2.08 Term + 54196 54417 222 0 0 65 44 145 0.906 4.72 2.09 PlyA + 54929 54934 6 1.05 3.07 PlyA - 55850 55845 6 1.05 3.06 Term - 57083 57019 65 0 2 93 47 50 0.619 -0.55 3.05 Intr - 59679 59572 108 1 0 49 94 68 0.448 3.76 3.04 Intr - 73564 73403 162 2 0 46 116 198 0.986 18.35 3.03 Intr - 75037 74854 184 1 1 69 56 74 0.907 1.66 3.02 Intr - 77644 77018 627 1 0 60 56 485 0.900 34.91 3.01 Init - 82362 82330 33 0 0 112 84 43 0.006 5.44 3.00 Prom - 83631 83592 40 -4.66 4.00 Prom + 95993 96032 40 -0.96 4.01 Sngl + 99626 99829 204 1 0 79 40 320 0.978 21.49 4.02 PlyA + 99919 99924 6 -3.74 5.04 PlyA - 99957 99952 6 -5.80 5.03 Term - 100224 99998 227 1 2 26 42 132 0.234 -0.66 5.02 Intr - 100778 100594 185 1 2 86 105 188 0.710 19.73 5.01 Init - 101388 101195 194 0 2 92 77 408 0.975 38.54 5.00 Prom - 132423 132384 40 -0.56 6.08 PlyA - 137690 137685 6 1.05 6.07 Term - 142239 142129 111 0 0 108 50 59 0.749 2.66 6.06 Intr - 147636 147460 177 0 0 22 81 95 0.368 2.32 6.05 Intr - 152015 151931 85 1 1 57 70 28 0.075 -2.28 6.04 Intr - 153809 153711 99 1 0 119 99 14 0.409 4.83 6.03 Intr - 155055 154874 182 0 2 113 81 39 0.420 4.37 6.02 Intr - 162138 162019 120 0 0 128 90 6 0.729 5.49 6.01 Init - 162728 162678 51 2 0 65 52 30 0.283 -2.58 6.00 Prom - 164227 164188 40 -5.26 7.00 Prom + 170775 170814 40 -2.36 7.01 Init + 173248 173369 122 0 2 54 93 40 0.204 0.76 7.02 Intr + 175305 175391 87 0 0 103 52 43 0.202 1.29 7.03 Intr + 177907 177941 35 1 2 110 64 67 0.252 4.37 7.04 Intr + 178136 178567 432 0 0 70 28 247 0.504 10.52 7.05 Intr + 178923 179005 83 1 2 31 99 48 0.855 -0.44 7.06 Intr + 181333 181473 141 0 0 81 75 278 0.991 26.35 7.07 Term + 183393 183530 138 2 0 121 43 78 0.889 4.56 7.08 PlyA + 183833 183838 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 46758 46806 49 2 1 72 95 28 0.834 3.10 S.002 Init - 79510 79505 6 1 0 68 92 5 0.969 -0.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_1|113_aa XAKNYNSMNGRSLQKVLCEDPSQPQPNAPTPGPGDRYTHTTLAPVAGAGACQQAHTVAVT PSCNHEKGPCVFVNSTESEFPEVHPNPIPKIPRISSEEISSFSSELTSPPGTC >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_1|342_bp naggccaagaattacaacagcatgaatggcagaagcctacagaaggtgctctgtgaggat ccctcccaaccgcagcccaacgcccccacccctggccctggagaccgctacacacacaca accttggccccagttgcaggtgctggtgcctgccaacaagcacacacagttgcagtcaca ccctcgtgtaaccatgaaaaaggcccatgtgtctttgtcaattctactgaatcagaattt cctgaggtgcacccaaaccccatccccaagatcccaaggatttctagcgaggaaatatca tccttctcgtctgaactgaccagtcctccagggacctgctga >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_2|301_aa MIKGLSSHTDKPTKPCTYKELKFTITKGASAWPPAAGILLRHWRGPKKAHTKPKRQGKEE CSVKKEYLKPSNLEGPIFQMMWLQMEVHCLTYKTFHEQKINLDGVKPLGSAESGQRVSHP ALLHTLQLPGDPSQVSPSSTSPSQRWHHIQTEVPVALTPNNREDPASHLIQQGPNHMVAQ VLRDHPGKAHSSRYKTLTADQTPPPTPTTASEPPGLQPLTPAAYRGPPTECEGDGSLLLS LWLYEAQIAGAYGIIYCENEADGEPLGSGLEMVSSDRTVGKTGHFPSEGPPAQGKCQGMN G >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_2|906_bp atgatcaaaggcttaagctcccacaccgacaaacccactaagccatgcacatacaaggag ctcaagttcaccatcaccaaaggtgcctctgcatggcctcctgctgctggcattttgctg agacactggagaggaccgaaaaaggcccacaccaagccaaaaagacaaggcaaagaggaa tgcagtgtcaagaaagaatatttaaagccaagtaaccttgaaggccctatattccagatg atgtggctacagatggaggtgcactgcctgacctacaagactttccacgagcaaaaaata aacctagatggtgtcaagcccctgggatctgcagaaagtggacagcgggtttcacaccca gccctgctgcacacgctccagcttcctggagacccgagccaggtttctcccagcagcacc tctccatcccagagatggcaccatattcaaactgaggtaccagtggcactgacccccaac aacagagaggacccagcttcacacttgatccagcaggggcccaaccacatggtggctcag gtgctgcgggatcacccagggaaggctcattcctctcgctacaagacgctgactgcggat cagacacctccacccactcccaccactgcctctgagcccccagggctccagcccctcaca ccagccgcctacagagggccacctactgaatgtgaaggtgatggaagccttttgctaagt ctgtggctgtatgaagctcagatcgctggagcatatggcattatctactgtgagaatgag gctgatggagagccactgggctcaggcttggaaatggtgagctctgatcggaccgtgggg aaaacaggacatttcccttctgaaggtcccccagcccagggcaaatgccagggcatgaat ggttga >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_3|392_aa MEANWTAFLFQAHEASHHQQQAAQNSLLPLLSSAVEPPDQKPLLPIPITQKPQGAPETLK DAIGIKKEKPKTSFVCTYCSKAFRDSYHLRRHESCHTGIKLVSRPKKTPTTVVPLISTIA GDSSRTSLVSTIAGILSTVTTSSSGTNPSSSASTTAMPVTQSVKKPSKPVKKNHACEMCG KAFRDVYHLNRHKLSHSDEKPFECPICNQRFKRKDRMTYHTCTAAFATKDRLRTHMVRHE GKVSCNICGKLLSAAYITSHLKTHGQSQSINCNTCKQGISKTCMSEETSNQKQQQQQQQQ QQQQQQQQQQHVTSWPGKQVETLRLWEEAVKARKKGMAESVRLLYRISKSQLYESMGPGL WELWGLRFAYPVGDVPLYATISCCASQPATIM >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_3|1179_bp atggaggccaactggaccgcgttcctgttccaggcccatgaagcttcccatcaccaacag caggcagcacagaacagcttgctgcccctcctgagctctgccgtggagccccctgatcag aaaccattgcttccaataccaataactcagaaacctcagggtgcaccagaaacattaaag gatgccattgggattaaaaaagaaaaacccaaaacttcatttgtgtgcacttactgcagt aaagctttcagggacagctatcacctgaggcgccacgaatcctgccacacagggatcaag ttggtgtcccggccaaagaaaacccccaccacggtggttccccttatctctaccatcgct ggggacagcagccgaacttcgttggtctcgaccattgcaggcatcttgtcaacagtcact acatcttcctcgggcaccaaccccagtagcagtgccagcaccacagctatgccagtgacc cagtctgtcaagaaacccagtaagcctgtcaagaagaaccatgcttgtgagatgtgtggg aaggccttccgagatgtgtaccatctcaatcgacacaagctctcccattcagatgagaaa ccctttgagtgtcctatttgtaatcagcgcttcaagaggaaggaccggatgacttaccat acgtgcactgctgcctttgccaccaaagacagactgcggacacacatggtgcgccatgaa ggcaaggtatcatgtaacatctgtgggaagctcctgagtgcagcatacatcaccagccac ttaaagactcatgggcagagccaaagtatcaactgtaatacatgtaaacaaggcatcagt aaaacatgcatgagtgaagagaccagtaaccaaaagcagcagcagcagcagcagcagcag cagcagcagcagcaacaacaacaacaacaacatgtgacaagctggccagggaagcaagta gaaacactgagactgtgggaagaagctgttaaagcaaggaagaaagggatggctgaatct gtccgtttattgtatcgcatatccaaatcacagctgtacgagtcaatgggaccaggctta tgggaactgtgggggctgcgttttgcatacccagtgggagatgtgcccctctatgctacc atctcctgctgtgcttctcaacctgccacgataatgtga >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_4|67_aa MGSTKSVTNHLMYESEICYDGENSVVILCFSLGSNCDSCCCFCYGFCYDYGFEIEIFHNL DFWAHQL >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_4|204_bp atgggttctacaaaaagtgtcaccaatcatcttatgtacgagagcgagatctgctatgac ggggagaatagcgtggtgatcctctgcttctccttggggagtaactgcgactcctgctgt tgcttctgctacggcttctgctacgactacggcttcgagatcgagatcttccataacttg gacttctgggcccatcaactttaa >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_5|201_aa MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF RSHEVGYTRILFFDQNWIQWS >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_5|606_bp atgtcgggaggtggtgtgattcgtggccccgcagggaacaacgattgccgcatctacgtg ggtaacttacctccagacatccgaaccaaggacattgaggacgtgttctacaaatacggc gctatccgcgacatcgacctcaagaatcgccgcgggggaccgcccttcgccttcgttgag ttcgaggacccgcgagacgcggaagacgcggtgtatggtcgcgacggctatgattacgat gggtaccgtctgcgggtggagtttcctcgaagcggccgtggaacaggccgaggcggcggc gggggtggaggtggcggagctccccgaggtcgctatggccccccatccaggcggtctgaa aacagagtggttgtctctggactgcctccaagtggaagttggcaggatttaaaggatcac atgcgtgaagcaggtgatgtatgttatgctgatgtttaccgagatggcactggtgtcgtg gagtttgtacggaaagaagatatgacctatgcagttcgaaaactggataacactaagttt agatctcatgaggtaggttatacacgtattcttttctttgaccagaattggatacagtgg tcttaa >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_6|274_aa MLFTVPGTLLPLFACLKEGQTALALLPGKHPGVYPTFSARAILRILAGDEHLKSGVECTV ALFSCQLGSSSLLLLSTPPREKGEESALIAWRTVQQIHVRASLGMSPKLPRLLVRRKRSL SPRCETDPKTPCNFLLKISTKKHSHSHSSSWSSRTSGKEHLPTWIWQPKPSKPPEHHVQY ADGNDPVEMENLMLQMREWIIAGVMLFNKQEKKGSSVREEGLASDRHMDDSFIMRGGKQN EDQMFSGVAPSSSFIRSTNMHHAIMLYCSGQSFC >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_6|825_bp atgctgttcactgtgcctggaacactcttgcccctcttcgcctgcttaaaggaaggacaa acagcactggctctcctcccaggaaagcaccctggcgtttaccccacattttcagcaaga gccatattgaggattcttgctggagatgagcatcttaagagtggggtggagtgcacagtc gctctgttctcctgccaacttggcagctcatcgctgctgttactatcaacacctccaagg gaaaagggagaggagtcagctctgatcgcctggcgcactgtgcagcaaattcatgtcaga gccagccttggaatgtctcccaagctcccaaggctcctggtgaggaggaaaaggagcctt tctcccagatgtgaaactgatcctaagaccccctgtaatttcctgctgaaaatctccacc aagaaacacagccacagccacagttcttcctggagttcaagaacctcaggcaaagaacac ctgcccacatggatctggcagcccaagccctctaagcccccagagcatcatgttcaatat gctgatgggaatgatccagttgagatggaaaatttgatgctgcagatgagagagtggata attgctggagtgatgctttttaacaagcaagagaagaagggatccagtgtgcgagaggag gggctggcctcagacagacacatggacgattcattcatcatgagaggaggaaagcaaaat gaggaccagatgttctctggtgttgctcccagctcttcattcattcgctccacaaatatg caccatgctatcatgctctactgctctgggcagtcattctgctag >gi568815581r:57905750_58107137|GENSCAN_predicted_peptide_7|345_aa MSYKVTQEISKELRWISAVLVTWDLRGETGTRVCIAMGVSRMQYSIMAKIVGSGNNLELN PAADKLDNLGAQCGLASVKVSARAAPPFAPATLVALLEARAAELPLPPPPGVSLRCVEAP DVRGRGRGGGEGSGVTRLLLPPPPPSGASFLQRGRPVTPALFQAGWGRAADRPGPADLAG GAGGRSGGRRMLPRSPGAALRDRSALEDGIVEGYGPLGEPEEGSRSSFLESKVAFNYCNK VAPLDFGNEAVEQCHTMSDRKAVIKNADMSEDMQQDAVDCATQAMEKYNIEKDIAAYIKK EFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815581r:57905750_58107137|GENSCAN_predicted_CDS_7|1038_bp atgtcttacaaggtcacccaagaaatcagcaaagaattgaggtggatttcagcagttcta gtcacatgggacttgagaggggagactggcaccagggtctgcattgcaatgggagtgtca aggatgcagtatagtataatggctaagattgtaggcagtggaaacaatctggagttgaat cctgccgctgacaaactggataatcttggggcccagtgcggactcgcctccgtgaaggtg agcgcccgggctgcgccaccctttgccccggccaccctcgtggcgctgctggaggcccgg gctgcggagctgccgctgcccccgcccccgggagtctctctgcgctgcgtggaggcccct gatgtcaggggccggggaagagggggtggagagggctcgggcgtgacgcggctcctcctg ccgccaccgccaccctctggggcgtccttcctgcagcgaggacgcccagtcactccggcg ctgttccaggccgggtgggggagggcggccgaccggccggggcctgcggacctggccggc ggcgcgggcgggaggtcggggggaaggaggatgcttcctcgttccccaggtgctgccctt cgggaccgcagcgctctggaggatgggatagtggagggatacggcccacttggagagccc gaagagggaagtagaagctctttcctggagagtaaggtggcttttaattactgtaacaaa gttgctcctttggactttgggaatgaggcagtggagcagtgtcacaccatgtctgaccgg aaggcagtgatcaagaacgcagacatgtctgaggacatgcaacaggatgccgttgactgc gccacgcaggccatggagaagtacaatatagagaaggacattgctgcctatatcaagaag gaatttgacaagaaatataaccctacctggcattgtatcgtgggccgaaattttggcagc tacgtcacacacgagacaaagcacttcatctatttttacttgggtcaagttgcaatcctc ctcttcaagtcaggctag