GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:22:59 Sequence gi568815581r:57905409_58107137 : 201729 bp : 45.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6634 6828 195 2 0 64 77 93 0.450 5.19 1.02 Intr + 8070 8150 81 1 0 28 108 54 0.512 1.01 1.03 Term + 8277 8341 65 1 2 73 37 58 0.600 -2.75 1.04 PlyA + 9466 9471 6 1.05 2.00 Prom + 13011 13050 40 -1.56 2.01 Init + 14172 14224 53 2 2 88 92 36 0.926 4.63 2.02 Intr + 20817 20927 111 0 0 45 86 84 0.303 3.39 2.03 Intr + 28569 28616 48 0 0 93 81 27 0.034 0.30 2.04 Intr + 35085 35200 116 0 2 102 -7 91 0.054 1.09 2.05 Intr + 48437 48639 203 0 2 87 81 42 0.875 2.40 2.06 Intr + 48762 48833 72 2 0 76 75 95 0.944 6.60 2.07 Intr + 51985 52065 81 0 0 99 76 6 0.527 0.33 2.08 Term + 54537 54758 222 2 0 65 44 145 0.906 4.72 2.09 PlyA + 55270 55275 6 1.05 3.07 PlyA - 56191 56186 6 1.05 3.06 Term - 57424 57360 65 2 2 93 47 50 0.619 -0.55 3.05 Intr - 60020 59913 108 0 0 49 94 68 0.448 3.76 3.04 Intr - 73905 73744 162 1 0 46 116 198 0.986 18.35 3.03 Intr - 75378 75195 184 0 1 69 56 74 0.907 1.66 3.02 Intr - 77985 77359 627 0 0 60 56 485 0.900 34.91 3.01 Init - 82703 82671 33 2 0 112 84 43 0.006 5.44 3.00 Prom - 83972 83933 40 -4.66 4.00 Prom + 96334 96373 40 -0.96 4.01 Sngl + 99967 100170 204 0 0 79 40 320 0.978 21.49 4.02 PlyA + 100260 100265 6 -3.74 5.04 PlyA - 100298 100293 6 -5.80 5.03 Term - 100565 100339 227 0 2 26 42 132 0.234 -0.66 5.02 Intr - 101119 100935 185 0 2 86 105 188 0.710 19.73 5.01 Init - 101729 101536 194 2 2 92 77 408 0.975 38.54 5.00 Prom - 132764 132725 40 -0.56 6.08 PlyA - 138031 138026 6 1.05 6.07 Term - 142580 142470 111 2 0 108 50 59 0.749 2.66 6.06 Intr - 147977 147801 177 2 0 22 81 95 0.368 2.32 6.05 Intr - 152356 152272 85 0 1 57 70 28 0.075 -2.28 6.04 Intr - 154150 154052 99 0 0 119 99 14 0.409 4.83 6.03 Intr - 155396 155215 182 2 2 113 81 39 0.420 4.37 6.02 Intr - 162479 162360 120 2 0 128 90 6 0.729 5.49 6.01 Init - 163069 163019 51 1 0 65 52 30 0.283 -2.58 6.00 Prom - 164568 164529 40 -5.26 7.00 Prom + 171116 171155 40 -2.36 7.01 Init + 173589 173710 122 2 2 54 93 40 0.204 0.76 7.02 Intr + 175646 175732 87 2 0 103 52 43 0.202 1.29 7.03 Intr + 178248 178282 35 0 2 110 64 67 0.252 4.37 7.04 Intr + 178477 178908 432 2 0 70 28 247 0.504 10.52 7.05 Intr + 179264 179346 83 0 2 31 99 48 0.855 -0.44 7.06 Intr + 181674 181814 141 2 0 81 75 278 0.991 26.35 7.07 Term + 183734 183871 138 1 0 121 43 78 0.889 4.56 7.08 PlyA + 184174 184179 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 47099 47147 49 1 1 72 95 28 0.834 3.10 S.002 Init - 79851 79846 6 0 0 68 92 5 0.969 -0.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_1|113_aa XAKNYNSMNGRSLQKVLCEDPSQPQPNAPTPGPGDRYTHTTLAPVAGAGACQQAHTVAVT PSCNHEKGPCVFVNSTESEFPEVHPNPIPKIPRISSEEISSFSSELTSPPGTC >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_1|342_bp naggccaagaattacaacagcatgaatggcagaagcctacagaaggtgctctgtgaggat ccctcccaaccgcagcccaacgcccccacccctggccctggagaccgctacacacacaca accttggccccagttgcaggtgctggtgcctgccaacaagcacacacagttgcagtcaca ccctcgtgtaaccatgaaaaaggcccatgtgtctttgtcaattctactgaatcagaattt cctgaggtgcacccaaaccccatccccaagatcccaaggatttctagcgaggaaatatca tccttctcgtctgaactgaccagtcctccagggacctgctga >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_2|301_aa MIKGLSSHTDKPTKPCTYKELKFTITKGASAWPPAAGILLRHWRGPKKAHTKPKRQGKEE CSVKKEYLKPSNLEGPIFQMMWLQMEVHCLTYKTFHEQKINLDGVKPLGSAESGQRVSHP ALLHTLQLPGDPSQVSPSSTSPSQRWHHIQTEVPVALTPNNREDPASHLIQQGPNHMVAQ VLRDHPGKAHSSRYKTLTADQTPPPTPTTASEPPGLQPLTPAAYRGPPTECEGDGSLLLS LWLYEAQIAGAYGIIYCENEADGEPLGSGLEMVSSDRTVGKTGHFPSEGPPAQGKCQGMN G >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_2|906_bp atgatcaaaggcttaagctcccacaccgacaaacccactaagccatgcacatacaaggag ctcaagttcaccatcaccaaaggtgcctctgcatggcctcctgctgctggcattttgctg agacactggagaggaccgaaaaaggcccacaccaagccaaaaagacaaggcaaagaggaa tgcagtgtcaagaaagaatatttaaagccaagtaaccttgaaggccctatattccagatg atgtggctacagatggaggtgcactgcctgacctacaagactttccacgagcaaaaaata aacctagatggtgtcaagcccctgggatctgcagaaagtggacagcgggtttcacaccca gccctgctgcacacgctccagcttcctggagacccgagccaggtttctcccagcagcacc tctccatcccagagatggcaccatattcaaactgaggtaccagtggcactgacccccaac aacagagaggacccagcttcacacttgatccagcaggggcccaaccacatggtggctcag gtgctgcgggatcacccagggaaggctcattcctctcgctacaagacgctgactgcggat cagacacctccacccactcccaccactgcctctgagcccccagggctccagcccctcaca ccagccgcctacagagggccacctactgaatgtgaaggtgatggaagccttttgctaagt ctgtggctgtatgaagctcagatcgctggagcatatggcattatctactgtgagaatgag gctgatggagagccactgggctcaggcttggaaatggtgagctctgatcggaccgtgggg aaaacaggacatttcccttctgaaggtcccccagcccagggcaaatgccagggcatgaat ggttga >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_3|392_aa MEANWTAFLFQAHEASHHQQQAAQNSLLPLLSSAVEPPDQKPLLPIPITQKPQGAPETLK DAIGIKKEKPKTSFVCTYCSKAFRDSYHLRRHESCHTGIKLVSRPKKTPTTVVPLISTIA GDSSRTSLVSTIAGILSTVTTSSSGTNPSSSASTTAMPVTQSVKKPSKPVKKNHACEMCG KAFRDVYHLNRHKLSHSDEKPFECPICNQRFKRKDRMTYHTCTAAFATKDRLRTHMVRHE GKVSCNICGKLLSAAYITSHLKTHGQSQSINCNTCKQGISKTCMSEETSNQKQQQQQQQQ QQQQQQQQQQHVTSWPGKQVETLRLWEEAVKARKKGMAESVRLLYRISKSQLYESMGPGL WELWGLRFAYPVGDVPLYATISCCASQPATIM >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_3|1179_bp atggaggccaactggaccgcgttcctgttccaggcccatgaagcttcccatcaccaacag caggcagcacagaacagcttgctgcccctcctgagctctgccgtggagccccctgatcag aaaccattgcttccaataccaataactcagaaacctcagggtgcaccagaaacattaaag gatgccattgggattaaaaaagaaaaacccaaaacttcatttgtgtgcacttactgcagt aaagctttcagggacagctatcacctgaggcgccacgaatcctgccacacagggatcaag ttggtgtcccggccaaagaaaacccccaccacggtggttccccttatctctaccatcgct ggggacagcagccgaacttcgttggtctcgaccattgcaggcatcttgtcaacagtcact acatcttcctcgggcaccaaccccagtagcagtgccagcaccacagctatgccagtgacc cagtctgtcaagaaacccagtaagcctgtcaagaagaaccatgcttgtgagatgtgtggg aaggccttccgagatgtgtaccatctcaatcgacacaagctctcccattcagatgagaaa ccctttgagtgtcctatttgtaatcagcgcttcaagaggaaggaccggatgacttaccat acgtgcactgctgcctttgccaccaaagacagactgcggacacacatggtgcgccatgaa ggcaaggtatcatgtaacatctgtgggaagctcctgagtgcagcatacatcaccagccac ttaaagactcatgggcagagccaaagtatcaactgtaatacatgtaaacaaggcatcagt aaaacatgcatgagtgaagagaccagtaaccaaaagcagcagcagcagcagcagcagcag cagcagcagcagcaacaacaacaacaacaacatgtgacaagctggccagggaagcaagta gaaacactgagactgtgggaagaagctgttaaagcaaggaagaaagggatggctgaatct gtccgtttattgtatcgcatatccaaatcacagctgtacgagtcaatgggaccaggctta tgggaactgtgggggctgcgttttgcatacccagtgggagatgtgcccctctatgctacc atctcctgctgtgcttctcaacctgccacgataatgtga >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_4|67_aa MGSTKSVTNHLMYESEICYDGENSVVILCFSLGSNCDSCCCFCYGFCYDYGFEIEIFHNL DFWAHQL >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_4|204_bp atgggttctacaaaaagtgtcaccaatcatcttatgtacgagagcgagatctgctatgac ggggagaatagcgtggtgatcctctgcttctccttggggagtaactgcgactcctgctgt tgcttctgctacggcttctgctacgactacggcttcgagatcgagatcttccataacttg gacttctgggcccatcaactttaa >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_5|201_aa MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF RSHEVGYTRILFFDQNWIQWS >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_5|606_bp atgtcgggaggtggtgtgattcgtggccccgcagggaacaacgattgccgcatctacgtg ggtaacttacctccagacatccgaaccaaggacattgaggacgtgttctacaaatacggc gctatccgcgacatcgacctcaagaatcgccgcgggggaccgcccttcgccttcgttgag ttcgaggacccgcgagacgcggaagacgcggtgtatggtcgcgacggctatgattacgat gggtaccgtctgcgggtggagtttcctcgaagcggccgtggaacaggccgaggcggcggc gggggtggaggtggcggagctccccgaggtcgctatggccccccatccaggcggtctgaa aacagagtggttgtctctggactgcctccaagtggaagttggcaggatttaaaggatcac atgcgtgaagcaggtgatgtatgttatgctgatgtttaccgagatggcactggtgtcgtg gagtttgtacggaaagaagatatgacctatgcagttcgaaaactggataacactaagttt agatctcatgaggtaggttatacacgtattcttttctttgaccagaattggatacagtgg tcttaa >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_6|274_aa MLFTVPGTLLPLFACLKEGQTALALLPGKHPGVYPTFSARAILRILAGDEHLKSGVECTV ALFSCQLGSSSLLLLSTPPREKGEESALIAWRTVQQIHVRASLGMSPKLPRLLVRRKRSL SPRCETDPKTPCNFLLKISTKKHSHSHSSSWSSRTSGKEHLPTWIWQPKPSKPPEHHVQY ADGNDPVEMENLMLQMREWIIAGVMLFNKQEKKGSSVREEGLASDRHMDDSFIMRGGKQN EDQMFSGVAPSSSFIRSTNMHHAIMLYCSGQSFC >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_6|825_bp atgctgttcactgtgcctggaacactcttgcccctcttcgcctgcttaaaggaaggacaa acagcactggctctcctcccaggaaagcaccctggcgtttaccccacattttcagcaaga gccatattgaggattcttgctggagatgagcatcttaagagtggggtggagtgcacagtc gctctgttctcctgccaacttggcagctcatcgctgctgttactatcaacacctccaagg gaaaagggagaggagtcagctctgatcgcctggcgcactgtgcagcaaattcatgtcaga gccagccttggaatgtctcccaagctcccaaggctcctggtgaggaggaaaaggagcctt tctcccagatgtgaaactgatcctaagaccccctgtaatttcctgctgaaaatctccacc aagaaacacagccacagccacagttcttcctggagttcaagaacctcaggcaaagaacac ctgcccacatggatctggcagcccaagccctctaagcccccagagcatcatgttcaatat gctgatgggaatgatccagttgagatggaaaatttgatgctgcagatgagagagtggata attgctggagtgatgctttttaacaagcaagagaagaagggatccagtgtgcgagaggag gggctggcctcagacagacacatggacgattcattcatcatgagaggaggaaagcaaaat gaggaccagatgttctctggtgttgctcccagctcttcattcattcgctccacaaatatg caccatgctatcatgctctactgctctgggcagtcattctgctag >gi568815581r:57905409_58107137|GENSCAN_predicted_peptide_7|345_aa MSYKVTQEISKELRWISAVLVTWDLRGETGTRVCIAMGVSRMQYSIMAKIVGSGNNLELN PAADKLDNLGAQCGLASVKVSARAAPPFAPATLVALLEARAAELPLPPPPGVSLRCVEAP DVRGRGRGGGEGSGVTRLLLPPPPPSGASFLQRGRPVTPALFQAGWGRAADRPGPADLAG GAGGRSGGRRMLPRSPGAALRDRSALEDGIVEGYGPLGEPEEGSRSSFLESKVAFNYCNK VAPLDFGNEAVEQCHTMSDRKAVIKNADMSEDMQQDAVDCATQAMEKYNIEKDIAAYIKK EFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815581r:57905409_58107137|GENSCAN_predicted_CDS_7|1038_bp atgtcttacaaggtcacccaagaaatcagcaaagaattgaggtggatttcagcagttcta gtcacatgggacttgagaggggagactggcaccagggtctgcattgcaatgggagtgtca aggatgcagtatagtataatggctaagattgtaggcagtggaaacaatctggagttgaat cctgccgctgacaaactggataatcttggggcccagtgcggactcgcctccgtgaaggtg agcgcccgggctgcgccaccctttgccccggccaccctcgtggcgctgctggaggcccgg gctgcggagctgccgctgcccccgcccccgggagtctctctgcgctgcgtggaggcccct gatgtcaggggccggggaagagggggtggagagggctcgggcgtgacgcggctcctcctg ccgccaccgccaccctctggggcgtccttcctgcagcgaggacgcccagtcactccggcg ctgttccaggccgggtgggggagggcggccgaccggccggggcctgcggacctggccggc ggcgcgggcgggaggtcggggggaaggaggatgcttcctcgttccccaggtgctgccctt cgggaccgcagcgctctggaggatgggatagtggagggatacggcccacttggagagccc gaagagggaagtagaagctctttcctggagagtaaggtggcttttaattactgtaacaaa gttgctcctttggactttgggaatgaggcagtggagcagtgtcacaccatgtctgaccgg aaggcagtgatcaagaacgcagacatgtctgaggacatgcaacaggatgccgttgactgc gccacgcaggccatggagaagtacaatatagagaaggacattgctgcctatatcaagaag gaatttgacaagaaatataaccctacctggcattgtatcgtgggccgaaattttggcagc tacgtcacacacgagacaaagcacttcatctatttttacttgggtcaagttgcaatcctc ctcttcaagtcaggctag