GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:51:40 Sequence gi568815592f:95486440_95706402 : 219963 bp : 35.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4453 4570 118 0 1 94 61 52 0.057 2.22 1.02 Intr + 10390 10665 276 2 0 79 57 151 0.149 7.67 1.03 Term + 15026 15306 281 0 2 37 47 144 0.302 -0.18 1.04 PlyA + 15338 15343 6 1.05 2.00 Prom + 29512 29551 40 -3.15 2.01 Init + 32334 32439 106 2 1 112 22 69 0.654 3.13 2.02 Intr + 32529 32731 203 1 2 58 85 55 0.418 0.28 2.03 Term + 32913 33092 180 2 0 52 48 175 0.560 6.53 2.04 PlyA + 33754 33759 6 1.05 3.02 PlyA - 33881 33876 6 1.05 3.01 Sngl - 38878 38483 396 1 0 58 43 190 0.499 7.50 3.00 Prom - 69150 69111 40 -3.05 4.04 PlyA - 69268 69263 6 1.05 4.03 Term - 69913 69639 275 2 2 5 49 175 0.165 -0.05 4.02 Intr - 77496 77444 53 2 2 117 93 35 0.130 4.53 4.01 Init - 91698 91475 224 0 2 55 53 234 0.183 12.72 4.00 Prom - 96059 96020 40 -6.25 5.00 Prom + 96914 96953 40 -7.45 5.01 Init + 100082 100544 463 1 1 52 19 165 0.172 1.91 5.02 Intr + 110298 110407 110 1 2 94 63 89 0.924 6.08 5.03 Intr + 118388 118464 77 1 2 101 89 20 0.971 0.79 5.04 Term + 119309 119966 658 2 1 91 38 492 0.980 36.93 5.05 PlyA + 120324 120329 6 1.05 6.00 Prom + 124328 124367 40 -4.15 6.01 Init + 129873 129952 80 2 2 90 101 34 0.850 5.48 6.02 Term + 131255 131318 64 2 1 80 52 69 0.790 -1.12 6.03 PlyA + 131780 131785 6 1.05 7.00 Prom + 141008 141047 40 -4.25 7.01 Init + 141804 141880 77 2 2 95 89 36 0.627 5.01 7.02 Term + 150535 151018 484 1 1 81 54 294 0.536 18.53 7.03 PlyA + 151151 151156 6 1.05 8.00 Prom + 153004 153043 40 -5.65 8.01 Init + 156811 157139 329 0 2 63 98 159 0.112 11.24 8.02 Term + 165302 165335 34 2 1 77 41 76 0.084 -2.12 8.03 PlyA + 165954 165959 6 1.05 9.00 Prom + 166140 166179 40 -4.05 9.01 Init + 179662 179782 121 0 1 47 86 107 0.465 6.80 9.02 Intr + 179838 180708 871 1 1 82 53 207 0.109 6.06 9.03 Term + 214208 214334 127 2 1 32 29 279 0.932 13.17 9.04 PlyA + 214725 214730 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 183441 183929 489 2 0 59 43 277 0.914 14.17 S.002 Sngl - 191947 191741 207 1 0 77 37 191 0.952 7.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_1|224_aa EVHLYESHSHGFSLIWLPVESNQRRNIAVWRKKERGRAETSTLAMLQEPFSPPLHCGIPS LGWPRPEPAPTACREVWRERHGQEPGLHPVLMDQRKFQVGTGSAGLTLRVASWRHQPRAM RGLAPGPAAAEESRQWYHCDTKHNQQLDSTSEHSQCPSPIKEPDIKVCLSMVTTSWPIQN PRPDQITKVYHCQKILAKTGICGHYLKCADTNTNTQELRSTRDI >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_1|675_bp gaggttcatctgtatgaatcacacagtcatggtttctcactcatctggcttccagttgag tccaatcagcggagaaacatagcagtttggaggaagaaagaaagaggaagagcagagaca tccactctggccatgcttcaggagcccttcagcccgccgctgcactgtgggatcccctct ctgggctggccaaggccagagcctgctcccactgcttgcagggaggtgtggagggagagg cacgggcaggaaccggggctgcacccagtgctcatggaccagcgcaagttccaggtgggc acgggatcagcaggcctcacactcagagtggccagctggcgccaccagccccgggcaatg aggggcttagcacccgggccagcagctgcggaggaatccagacagtggtatcattgtgac acaaagcacaaccagcagctcgactcaacttcagagcacagtcagtgtcccagcccaatc aaagaacctgacattaaggtctgcctatccatggttactaccagctggcctattcagaat cccaggccagaccaaataacaaaggtctatcactgccaaaaaatacttgcaaagactgga atatgtggccattatctgaaatgtgctgacaccaacacaaatacccaagaattaagaagc accagggacatatga >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_2|162_aa MAVPESLETLVTAEPPKRVTACHSLGLGSPEFWASTPSILPPLGWPSPATASCHVGQLPS LGKEQETYSVTVSLARGIPRSGPPGGSPPFTPAVQECVTICSLAGHPEECVYESRDFYGL RIEEVHADWSMGGHGWAWKKHHPIGRKASVKFSLWVLDSTRN >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_2|489_bp atggcggtgcctgaaagcttggagacgctagtaaccgcagagccccccaagagagttaca gcttgtcacagcctaggcttagggagccccgagttctgggcctccactccttcaattctg ccacccctgggctggcccagccctgccactgcttcctgtcatgtggggcagctgcccagc ctaggcaaagagcaggagacctatagtgttacagtatctctggctaggggaatcccaagg tctggacccccaggagggtcaccaccctttactcccgcagtccaggagtgtgtcaccatc tgcagcttggcaggtcatcccgaagagtgtgtgtacgaatccagggatttttatggactc agaatagaggaagtgcatgctgattggtccatgggcggtcatgggtgggcctggaaaaag caccatccgattggccgaaaggcatcagtgaagttctcactctgggtcctggactccaca aggaactga >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_3|131_aa MLDQNRVIGHCCVWNWWVLGLTSRIKPQTLMVSATVLKDRVYRVCSFRCSDVPRVSSFCW VLGLTSIMKPRTLMVSVTVLKDGVSRVCSFQCSDVSGDSSFLWVGGLADFRSESPLLQGV LQLLKVVQTQS >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_3|396_bp atgcttgatcaaaacagagtcataggtcactgctgtgtctggaattggtgggttcttggt ctgacttcaagaattaagccacagaccctcatggtgagtgctacagttcttaaagatcgt gtgtacagagtttgttccttccgatgttccgacgtgcccagagtttcttccttctgttgg gttcttggtctgacttcaataatgaagccacggaccctcatggtgagtgttacagttctt aaagatggtgtgtccagagtttgttccttccaatgttcagacgtgtccggagattcttcc ttcctgtgggttggtggtctcgctgacttcaggagtgaaagtccacttttgcagggagta ttacagctcttaaaggtggtgcagacccagagttag >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_4|183_aa MPRGDSYLSVAPRAQQASILLLSRSAQGPEAVWGHKEKVLKKASIRPEISTLSGRQGKSE ATGGTVSKRRRRRGQIKSKFYNTTYRTLMDLAAPKLLKVISNFSEVSGYKINVQKLLAFL YTNNSQAENQIMNKLPFTAATKRIKYLRIQLTREVKDLFKENYKPLLKEIRDDTNAKTFH ARR >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_4|552_bp atgcccagaggcgacagctacctttccgttgcacccagagcccagcaggcctcgatcctt ctactctcccggtcggctcaaggcccagaggcagtctgggggcacaaggagaaggtgctg aaaaaggcttccatccgacccgaaatttccaccttgtcggggagacaaggaaaatccgag gcgacaggaggtaccgtttcaaagaggaggcggcgacgaggccagataaagtccaaattc tataacacaacatatagaacccttatggatttggctgccccaaagcttcttaaggtgata agcaacttcagcgaagtctcaggatacaaaatcaatgtccaaaaattgctagcatttcta tacaccaacaacagtcaagcagagaaccaaatcatgaataaactcccattcacagctgcc acaaaaagaataaaatacctacgaatacagctaaccagggaagtgaaggaccttttcaag gaaaactacaaaccactgctcaaagaaatcagagatgacacaaatgcaaaaacattccat gctcgtagatag >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_5|435_aa MLRPNTATFGAPFGLDLLPELHQRTIHLGKNFDFQKSDRINSETNTKNLKSVEITMKPSK ASELNLDELPPLNNYLHVFYYSWYGNPQFDGKYIHWNHPVLEHWDPRIAKNYPQGRHNPP DDIGSSFYPELGSYSSRDPSVIETHMRQMRSASIGVLALSWYPPDVNDENGEPTDNLVPT ILDKAHKYNLKVTFHIEPYSNRDDQNMYKNVKYIIDKYGNHPAFYRYKTKTGNALPMFYV YDSYITKPEKWANLLTTSGSRSIRNSPYDGLFIALLVEEKHKYDILQSGFDGIYTYFATN GFTYGSSHQNWASLKLFCDKYNLIFIPSVGPGYIDTSIRPWNTQNTRNRINGKYYEIGLS AALQTRPSLISITSFNEWHEGTQIEKAVPKRTSNTVYLDYRPHKPGLYLELTRKWSEKYS KERATYALDRQLPVS >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_5|1308_bp atgctgagaccaaatacagctacttttggagctccttttggacttgaccttcttccagaa cttcatcaacgaactattcatttggggaaaaattttgatttccaaaagagtgacagaatc aacagtgaaacaaataccaagaatttaaaaagtgttgaaatcactatgaaaccttccaaa gcctctgaacttaacttggatgaactaccacctctgaacaattatctacatgtattttat tacagttggtatggaaatccacaatttgatggtaaatatatacattggaatcatccagtg ttagagcattgggaccctagaatagccaagaattatccacaagggagacacaaccctcca gatgacattggctccagcttttatcctgaattgggaagttacagttctcgggatccttct gtcatagaaactcacatgagacaaatgcgctcagcttcaattggtgtactagccctctct tggtacccacctgatgtaaatgatgaaaatggagaacctactgataacttggtacccact attttggataaagctcataaatataacctaaaggttacttttcacatagaaccatatagc aatcgagatgatcaaaacatgtacaaaaatgtcaagtatattatagacaaatatggaaat catccggccttttacaggtacaagacgaagactggcaatgctcttcctatgttttatgtc tatgattcctatattaccaagcctgaaaaatgggccaatctgttaaccacctcagggtct cggagtattcgcaattctccttatgatggactgtttattgcccttctggtagaagaaaaa cataagtatgatattcttcaaagtggttttgatggaatttacacatattttgccacaaat ggctttacttatggctcatcacatcagaattgggctagcctaaaattattttgtgataaa tacaacttaatatttatcccaagtgtgggcccaggatacatagataccagcatccgtcca tggaacacgcaaaacactcggaaccgaatcaatgggaagtattatgaaattggtctgagt gccgcacttcagacacgccccagcttaatttctatcacctcttttaatgagtggcatgaa ggaactcagattgaaaaagctgttcccaaaagaaccagtaatacagtgtacctagattac cgtcctcataaaccaggtctttacctagaactgactcgcaagtggtctgaaaaatacagt aaggaaagagcaacttatgcattagatcgccagctgcctgtttcttaa >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_6|47_aa MAQVVPGAARPATPEGTSHKPWQMYRIIDNFASYLTEKIQAIEDDTE >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_6|144_bp atggctcaagtggtcccaggtgcagctcgacctgccactccagagggtacaagtcataaa ccatggcagatgtacagaataattgataattttgcttcttacttaactgagaaaatacaa gcaattgaagatgacaccgaatga >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_7|186_aa MDGLGGHYPKQTNVVTEKPNTTCSHFSVERATQTMPPRSLRWGLQVLTGSLSAYSSVPDH TAFLPRAMRWLPGQRALAGCSQGQALRTTRLFSMFTLQHTAGESGDAVPGFRVAEALGLG VGPAQPCESGGGVVACLRDAGHRALLPLLLQQPLLPPTTLPPSCSWCDGSDRSRQTTAAI NILFHR >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_7|561_bp atggatggacttggaggccattatcctaagcaaactaatgtagtaacagaaaaaccaaat accacatgttctcacttctctgtggagcgtgcaacccaaaccatgcctcctcgtagccta aggtgggggctccaggtcctcactgggtccctctctgcctactcctccgtgcctgaccac actgctttcctgccaagagcaatgcgctggctaccagggcagcgggctctggcaggctgc tcccagggtcaggctctgaggactacccgcctcttctccatgtttaccctgcagcataca gcgggggagagcggtgatgcggtaccagggttcagagtggcagaggctctgggcctggga gtgggtcctgctcaaccatgcgagagtgggggtggtgtagtcgcttgcctcagggatgca gggcacagggcactgttaccactgctgctccagcagccactcctgccgccaacaaccttg cctcccagctgcagctggtgtgatggcagcgaccgctctagacagaccaccgctgccatc aatatcttgtttcacagatga >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_8|120_aa MVETSQHGSPVTSYKYIRLTIGQYPLEWSFQRKEQGAICAISQPSLVIPPGMGINETPRI WSKHPTNCSSAMKEWPDSKRRKNNRKHQQQHQPKGPTKLHLKVSNLKDQSLWGITEPTDM >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_8|363_bp atggttgagacctcacaacatggatctccagtcacctcctacaaatacattcggctgaca ataggtcagtaccccctggaatggagcttccagaggaaggagcagggtgccatctgtgcc atttcacagccttcattggtgatacctccaggtatgggaataaatgagacacctaggatc tggagtaaacacccaacaaactgcagcagtgctatgaaagagtggcctgattctaaaaga agaaaaaacaacagaaaacaccaacagcaacatcaaccaaaaggacccacaaaactccat ttaaaggtcagcaacctcaaagatcaaagcttgtggggcatcacagaacctaccgacatg tga >gi568815592f:95486440_95706402|GENSCAN_predicted_peptide_9|372_aa MVPVPLHTAQGNKRGYKQMEENSKLMGRKNQYRENGHTAQELEKTTLKFIWNQKRARIAK SILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFD KPEKNKQWGKDSLFNKWCWDNWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLE ENLGITIQDIGMGKYFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNKQPTKWEK IFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSS LAIREMQIKTTMRYHLTPVRMAIIKKSGNNRKEGEEEEERRGRRKKRKNKEEEEEEEEEE GGGGGGGETQLY >gi568815592f:95486440_95706402|GENSCAN_predicted_CDS_9|1119_bp atggtaccagttcctcttcatactgctcaaggaaataaaagaggatacaaacaaatggaa gaaaattccaagctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaa gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccacatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggat aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacattagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcatgggcaagtacttcatgtccaaaaca ccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaagcaacctacaaaatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttaga atggcgatcattaaaaagtcaggaaacaacagaaaagaaggagaagaagaagaagaaaga agaggaagaagaaagaagaggaagaacaaagaagaggaagaggaagaagaagaagaagaa ggaggaggaggaggaggaggagaaacacaactctattag