GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:49:53 Sequence gi568815594r:169629581_169857877 : 228297 bp : 39.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6349 6508 160 0 1 52 111 160 0.960 14.73 1.02 Intr + 30016 30060 45 2 0 34 97 81 0.131 1.06 1.03 Intr + 50470 50627 158 2 2 123 114 66 0.948 11.51 1.04 Intr + 54958 55066 109 0 1 91 55 61 0.904 2.04 1.05 Intr + 59463 59650 188 1 2 72 107 122 0.999 10.99 1.06 Intr + 60950 61072 123 1 0 105 99 67 0.990 9.36 1.07 Intr + 62534 62740 207 1 0 97 107 41 0.916 5.25 1.08 Intr + 66032 66112 81 1 0 79 92 51 0.765 3.52 1.09 Intr + 67609 68154 546 0 0 129 115 295 0.760 28.45 1.10 Intr + 74418 74604 187 2 1 69 58 94 0.391 2.84 1.11 Intr + 77288 77686 399 0 0 92 48 397 0.960 29.65 1.12 Intr + 83499 83715 217 1 1 97 106 188 0.908 18.14 1.13 Term + 84530 84671 142 2 1 89 44 39 0.363 -3.98 1.14 PlyA + 85819 85824 6 1.05 2.10 PlyA - 86625 86620 6 1.05 2.09 Term - 100129 99998 132 1 0 91 48 124 0.780 5.81 2.08 Intr - 102296 102124 173 0 2 59 84 104 0.851 5.84 2.07 Intr - 108167 108080 88 2 1 67 50 130 0.829 5.72 2.06 Intr - 118319 118073 247 1 1 97 105 172 0.985 16.34 2.05 Intr - 119262 119164 99 2 0 89 71 45 0.694 1.21 2.04 Intr - 121145 120956 190 0 1 55 103 138 0.979 9.72 2.03 Intr - 124255 124096 160 1 1 30 105 194 0.986 13.94 2.02 Intr - 128081 127813 269 0 2 82 70 -12 0.023 -7.27 2.01 Init - 128436 128250 187 0 1 60 84 257 0.024 20.29 2.00 Prom - 143868 143829 40 -6.15 3.05 PlyA - 143984 143979 6 1.05 3.04 Term - 145360 145161 200 2 2 45 33 147 0.304 1.48 3.03 Intr - 145813 145659 155 0 2 52 87 67 0.537 1.79 3.02 Intr - 149016 148365 652 1 1 67 39 323 0.176 15.34 3.01 Init - 154206 153540 667 0 1 85 -19 350 0.300 19.42 3.00 Prom - 158771 158732 40 -9.65 4.03 PlyA - 161355 161350 6 1.05 4.02 Term - 162238 161640 599 2 2 -43 48 581 0.774 34.70 4.01 Init - 165571 165538 34 1 1 58 98 53 0.622 3.39 4.00 Prom - 188601 188562 40 -7.25 5.00 Prom + 191043 191082 40 -5.35 5.01 Init + 191588 191774 187 1 1 75 6 416 0.446 31.37 5.02 Intr + 191894 191980 87 0 0 -119 6 300 0.512 0.12 5.03 Intr + 192010 192229 220 2 1 30 -10 474 0.041 28.24 5.04 Intr + 193841 193934 94 2 1 59 45 106 0.035 2.45 5.05 Intr + 196796 196906 111 1 0 62 39 81 0.500 0.26 5.06 Term + 197457 197849 393 2 0 54 43 296 0.943 15.75 5.07 PlyA + 197898 197903 6 1.05 6.00 Prom + 200419 200458 40 -6.15 6.01 Sngl + 203313 203783 471 2 0 24 48 404 0.917 25.87 6.02 PlyA + 206249 206254 6 1.05 7.02 PlyA - 206408 206403 6 1.05 7.01 Term - 211341 211196 146 1 2 89 44 131 0.597 5.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 192010 192245 236 2 2 30 48 502 0.938 35.90 S.002 Term - 219759 219619 141 0 0 115 49 95 0.832 5.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_1|853_aa MESEQLFHRGYYRNSYNSITSASSDEELLDGAGVIMDFQTSEDDNLLDGDTAVASSFKKG NKKPLEEIGTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRERHR RIILILILIHRRHFPQQTWNQPFLKELSDSIEGKYNIGALAGLIDIAADWMTDLKEGICL SALWYNHEQCCWGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWALSFA FLAVSLVKVFAPYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASGLSLGKE GPLVHVACCCGNIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVLFSLEEVSY YFPLKTLWRSFFAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFPFILLGVFGGL WGAFFIRANIAWCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRLNTSELIKELFTD CGPLESSSLCDYRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALIFKIIMTVFTFGIKV PSGLFIPSMAIGAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGADCITPGLYAMVGAAAC LGGVTRMTVSLVVIVFELTGGLEYIVPLMAAVMTSKWVGDAFGREGIYEAHIRLNGYPFL DAKEEFTHTTLAADVMRPRRNDPPLAVLTQDNMTVDDIENMINETSYNGFPVIMSKESQR LVGFALRRDLTIAIESARKKQEGIVGSSRVCFAQHTPSLPAESPRPLKLRSILDMSPFTV TDHTPMEIVVDIFRKLGLRQCLVTHNGTLHFHYIFFLVELKNVVLSEGKYQKTEGILGLC INWMANPEQRKAG >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_1|2562_bp atggagtctgagcagctgttccatagaggctactatagaaacagctacaacagtataaca agtgcaagtagtgatgaggaacttttagatggagcaggtgttattatggactttcaaaca tctgaagatgacaatttattagatggtgacactgcagttgctagttccttcaagaaagga aacaagaagccgctggaggagattggaactcattatacaatgacaaatggaggcagcatt aacagttctacacatttactggatcttttggatgaaccaattccaggtgttggtacatat gatgatttccatactattgattgggtgcgagaaaaatgtaaagacagagaaaggcataga cggataatcctaatcctaatcctgattcatcgtagacatttcccgcagcaaacctggaat cagccatttctcaaggagctctctgattccattgaaggaaaatataatataggggcactg gccggattaatagacattgctgccgattggatgactgacctaaaggagggcatttgcctt agtgcgttgtggtacaaccacgaacagtgctgttggggatctaatgaaacaacatttgaa gagagggataaatgtccacagtggaaaacatgggcagaattaatcataggtcaagcagag ggtcctggttcttatatcatgaactacataatgtacatcttctgggccttgagttttgcc tttcttgcagtttccctggtaaaggtatttgctccatatgcctgtggctctggaattcca gagattaaaactattttaagtggattcatcatcagaggttacttgggaaaatggacttta atgattaaaaccatcacattagtcctggctgtggcatcaggtttgagtttaggaaaagaa ggtcccctggtacatgttgcctgttgctgcggaaatatcttttcctacctctttccaaag tatagcacaaacgaagctaaaaaaagggaggtgctatcagctgcctcagctgcaggggtt tctgtagcttttggtgcaccaattggaggagttctttttagcctggaagaggttagctat tattttcctctcaaaactttatggagatcattttttgctgctttagtggctgcatttgtt ttgaggtccatcaatccatttggtaacagccgtctggtccttttttatgtggagtatcat acaccatggtacctttttgaactgtttccttttattcttctaggggtatttggagggctt tggggagcctttttcattagggcaaatattgcctggtgtcgtcgacgcaagtccacgaaa tttggaaagtatcccgttctggaagtcattattgttgcagccattactgctgtgatagcc ttccctaatccatacactaggctaaacaccagtgaactgatcaaagagctttttacagac tgtggtcccctggaatcctcttctctttgtgactacagaaatgacatgaatgccagtaaa attgtcgatgacattcctgatcgtccagcaggcattggagtatattcagctatatggcag ttatgcctggcactcatatttaaaatcataatgacagtattcacttttggcatcaaggtt ccatcaggcttgttcatccccagcatggccattggagcgatcgcaggaaggattgtgggg attgcggtggagcagcttgcctactatcaccacgactggtttatctttaaggagtggtgt gaggtcggggctgattgcattacacctggcctttatgccatggttggtgctgctgcatgc ttaggtggtgtgacaagaatgactgtctccctggtggttattgtttttgagcttactgga ggcttggaatatattgttccccttatggctgcagtcatgaccagtaaatgggttggagat gcctttggcagggaaggcatttatgaagcacacatccgattaaatggataccctttcttg gatgcaaaagaagaattcactcataccaccctggctgctgacgttatgagacctcgaagg aatgatcctcccttagctgtcctgacacaggacaatatgacagtggatgatatagaaaac atgattaatgaaaccagctacaatggatttcctgtcataatgtcaaaagaatctcagaga ttagtgggatttgccctcagaagagacctgacaattgcaatagaaagtgccaggaaaaaa caagaaggtatcgttggcagttctcgggtgtgttttgcacagcacaccccatctcttcca gcagaaagtcctcggccattgaagcttcgaagcattcttgacatgagcccttttacagtg acagaccacaccccaatggagatcgtggtggatattttccgaaagctgggactgaggcag tgccttgtaactcacaatgggacccttcatttccactatatattctttcttgttgaactt aagaatgttgttttatccgaaggcaaataccaaaaaacagagggtattcttggattatgc ataaactggatggctaatcctgaacagcgtaaagctggttga >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_2|514_aa MGGAFPLNWRARAALKFGLEAFEPGSGWRRSEAAADRGILLALQLQNGRRWREAQARRRG AAGGWTRKVAPLHLLPSSSWSISWGNGLSQPMVHSPSQIDWEPITYLTFCGALRIWGSTC LLLLLVEPSAWGGGAQANMHSTVTFLFGIITLCEKTTDVKKSKFCEADVSSDLRKEVENH YKLSLPEDFYHFWKFCEELDPEKPSDSLSASLGLQLVGPYDILAGKHKTKKKSTGLNFNL HWRFYYDPPEFQTIIIGDNKTQYHMGYFRDSPDEFPVYVGINEAKKNCIIVPNGDNVFAA VNPREAKNQVLGSQLQLLQNHQMPLQLCSLMEIVEAQPPCGISSASEISKYCSCLEEDRS HVEPELKSSMGNAVNFHGEGVGIEVVTKTFHGAGLVVPVDKNDVGYRELPETDADLKRIC KTIVEAASDEERLKAFAPIQEMMTFVQFANDECDYGMGLELGMDLFCYGSHYFHKVAGQL LPLAYNLLKRNLFAEIIEEHLANRSQENIDQLAA >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_2|1545_bp atgggcggagcctttcccctaaattggcgagcccgggcggcgctgaaattcggcttggag gcttttgaacccggaagcggttggcggcgctcggaagcggccgcggatcggggaattctg ctggcgctgcagctgcagaatggtcggcggtggcgggaagcgcaggcccggcggagaggg gccgcagggggttggaccaggaaagtagctcctctgcatcttctgcccagctcctcttgg agcatttcctggggaaacggattaagccaaccaatggttcattcgcccagccagattgat tgggaaccgattacatacctgacattctgtggggcgctgcggatttgggggtcaacctgc ttgctcctgctgcttgttgagccctcggcctggggaggaggggcacaagcaaatatgcat tctactgtgacttttttgtttgggattataacgctgtgtgaaaaaacaactgatgtgaag aaaagtaaattctgtgaagctgatgtctccagtgaccttcgaaaagaagtagaaaatcat tataagctttctttacctgaagatttctatcacttctggaagttctgtgaagaacttgat cctgaaaagccatctgattcactttctgcaagccttggacttcaattagttggtccttat gatatccttgctggaaaacataaaacgaagaaaaaatcaacaggcctgaattttaacctt cactggaggttttactatgatcctcctgagttccagaccattattattggagataataaa actcagtaccacatggggtatttcagggattctcctgatgaatttcctgtatatgttggt ataaatgaagcaaagaaaaattgtataattgttccaaatggagataatgtatttgctgca gtcaacccccgtgaagctaagaaccaggtgctgggatctcagctacagctgctgcagaac catcagatgcctctacagctgtgctcattgatggagatcgtggaagcacagcctccatgt ggtatctcttctgcctctgagatttcaaagtattgcagctgcttggaagaagataggtcc catgtggagcctgagctaaagagtagtatgggaaatgcagtgaactttcatggggaaggg gttggaatagaggttgtgacaaagacctttcatggtgcaggcttggttgttccagtagat aaaaatgatgttgggtaccgagagctccctgaaacagatgctgacctcaagagaatttgc aagacaatagttgaggctgcaagtgatgaggagagactaaaagcttttgctcccattcag gaaatgatgacttttgtgcagtttgctaatgatgaatgtgattatggcatggggcttgaa ttgggaatggatctcttttgctatggctcacattattttcataaagttgctggccagctt ttacctcttgcatataatctgttgaagaggaatctgtttgcagaaattattgaggagcat ctggcaaacagaagtcaagagaacatagaccaacttgctgcatga >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_3|557_aa MGSNIISSPRGYYKQFRRGVYTHCDMGSNIILSPPGYYEQYRRGLYVPCDIKSNIILSTL GYYGQHRRRVYVPCDIKSNIILSTPGYYGQYRRRVYVPCDIKSNIILSTPGYYGQYRRGV YIPCEIKSNIILSTPGYYGQYRRGVYIPCEIKSNIILSTPGYYGQYRSGVYIPCNMENNT ILSLPGYYGQYRRLFMGRNIILSSPGYYGQYHRKLYTPCDMGMLEVLARAIRQEKEIKGI QLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQESQAFLYTNNRQ TERQIMSELPFTIASKRIKYLEIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVG RINIVKMAVLPKVIYKFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAG GIMLPDFKLYYKATVTKTAWPLPWGRTNGLWGPLHGLDPISDAAAAPRSRRIHTTVQETG SHFRKGREERTGGAWEPRSPLRTLRSPFWAARKPHLDRGPSRRRPGPQQDRMGAPQGRNY RKMERDFFTCPSSPKWA >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_3|1674_bp atggggagtaacatcatctcctccccccgtggatattacaaacaatttcgcaggggtgtg tacacccactgcgatatggggagtaacatcatcctctctcctcctggatattacgaacaa tatcgcagggggttgtatgttccttgtgatataaagagtaacatcatcctctccaccctt ggttattacggacaacatcgcaggagggtgtatgttccttgtgatataaagagtaacatc atcctctccacccctggttattacggacaatatcgcaggagggtgtatgttccttgtgat ataaagagtaacatcatcctctccacccctggttattacggacaatatcgcaggggggtg tatattccttgtgaaataaagagtaacatcatcctctccacccctggttattacggacaa tatcgcaggggggtgtatattccttgtgaaataaagagtaacatcatcctctccacccct ggttattacggacagtatcgcagtggggtgtacattccctgcaatatggaaaataacacc atcctctccctccctggatattatggacaatatcgcaggttgtttatggggaggaacatc atcctctcttcccctggatattacggacaatatcacaggaagttgtacaccccctgcgat atggggatgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaagaatcacaagcattcttatacaccaataacagacaa acagagagacaaatcatgagtgaactcccattcacaatcgcttcaaagagaataaaatac ttagaaatccaacttacaagggacgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggctgtactgcccaaggtaatttacaaattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcatgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg cccctgccctggggacggacaaatggactctggggacctctccatgggctggatccaatc tccgatgctgctgcagcgccgagatctcgacgcattcacaccacggtgcaagagacgggc agtcacttcaggaaagggagagaggagaggacagggggcgcgtgggaaccgcggagtccg ctgcgcacacttcgaagtcccttctgggcggctcggaagcctcatctcgatcgcgggcct tccaggagacgccccggtccacagcaggataggatgggggctccccaagggaggaactac aggaagatggaacgggatttcttcacctgcccttcatcgcctaagtgggcctag >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_4|210_aa MYAKGLVVREDREREEAAATREKTRSPHRRAAAARQPPAHPFARPPAAFTMQPAFAKWYD RRDCVFTESCVEDNKDVNVNFEKSKLTFSCLGGSDNFKHLNEIGLFYSIDPNDSKHKRTD RSILCCLRKGESGQSWPRLTKERAKLNWLSMDFNHWKDWEDGSDEDRSNFDRFSEMMNNM GGDEDVDLPEVDEAGDDSQDSDDEKMPDLE >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_4|633_bp atgtatgctaagggccttgttgtccgtgaggacagggagcgagaagaggcagcggcgacc agggagaaaacgcggagtccccaccggagagcagccgccgccagacagccgcccgcccac ccgttcgcccgtccccctgccgcattcacaatgcagcctgcttttgcaaagtggtacgat cgaagggactgtgtcttcactgaatcttgtgttgaagacaataaggatgttaatgtaaat tttgaaaaatccaaacttacattcagttgtcttggaggaagtgataattttaagcattta aatgaaattggtcttttttactctattgatccaaatgattccaagcataaaagaacggac agatcaattttatgttgtttacgaaaaggagaatctggccagtcatggccaaggttaaca aaagaaagggcaaagcttaactggcttagtatggacttcaatcattggaaagactgggaa gatggttcagatgaagacaggtctaattttgatcgtttctccgagatgatgaacaacatg ggtggggatgaggatgtagatttaccagaagtagatgaagcaggtgatgattcacaagac agtgatgatgaaaaaatgccagatctggagtaa >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_5|363_aa MKKKMKMKKMKMKKRWQKKIKMVPLAASSHGRRNEEEDENEDDEDEEDEDEEEEEMAEED EDDEEEDENEEDDEDEEDDEEEEQMAEEDEDEEMKKMKMKMKKKKKRWQKKMKMVSLAAS SHGRRDEEEDEDEEDEDEEDEEEDNEDEEEMAEEEEDGASCCILTLVVEMKMSLVMINDY PQLKVFRYQFIYIDLLWFCPCCPYTKEKDKEPEGFTYTISCSCPKERGQSVFSLEENQPT QMRKNQCKNSGNSKNQSVFLPPNDLTSSTVMVLNKAEMAEMTEIELRMWIRMKIIKIQEK TETQSKESKEQEKMIQDVKDKMAILRKKKTDLIELKNLLQEFHNTVTSINRISGQTEEKN LRA >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_5|1092_bp atgaagaagaagatgaagatgaagaagatgaagatgaagaagagatggcagaagaagatt aagatggtgcctcttgctgcatcttcacatggcagaagaaatgaagaagaagatgaaaat gaagatgatgaagatgaagaggatgaagatgaagaagaagaagagatggcagaagaagat gaagatgatgaagaagaagatgaaaatgaagaagatgatgaagatgaagaagatgatgaa gaagaagaacagatggcagaagaagatgaagatgaagagatgaagaagatgaagatgaag atgaagaagaagaagaagagatggcagaagaagatgaagatggtgtctcttgctgcatct tcacatggcagaagagatgaagaagaagatgaagatgaagaagatgaagatgaggaagat gaagaagaagataatgaagatgaagaagagatggcagaagaagaggaagatggtgcctct tgctgcatcctcacacttgtcgtggaaatgaagatgtcactggtcatgatcaatgattat cctcagttaaaagtgttcaggtatcagtttatctatattgatcttttgtggttctgccct tgctgcccttatactaaggaaaaggacaaagagcctgagggctttacttataccatcagc tgcagctgccctaaggaaagagggcagtctgtcttctctctagaggaaaaccagcccaca cagatgagaaagaaccagtgcaagaactctggcaactcaaaaaaccagagcgtcttttta cctccaaatgacctcactagttccacagtaatggttcttaacaaggctgaaatggctgaa atgacagaaatagaattgagaatgtggatcagaatgaagatcatcaagattcaggagaaa actgaaacccaatccaaggaatctaaggaacaagagaaaatgatacaggatgtaaaagac aaaatggccattttaagaaagaagaaaactgatctgatagagctgaaaaacttacttcaa gaatttcataatacagtcacaagtattaacagaataagtggtcaaactgaggaaaagaat ctcagagcttaa >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_6|156_aa MVKAKFEEGSKEFVADEEEENSDNDVELLLNLDLCLVINQHWRQHSLQCVNASTCIQMLK MKIQMLTMEKNMMWKRKNKDMGTSLHFTPVRKDLSHLTAEGQTCWRLEGTLSQSVSCQYH MVGVRTEDSIGDDEAGMEVDTTPTVAGQFEDADVDH >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_6|471_bp atggtgaaagccaaatttgaagaaggatcaaaagaatttgttgctgacgaagaagaggaa aatagtgataatgatgtggaactattactgaatttagatttgtgcctagtgataaatcaa cattggaggcaacattcactgcaatgtgtgaatgccagtacttgcatccagatgctgaag atgaagattcagatgcttacgatggagaagaatatgatgtggaagcgcaagaacaaggac atggggacatccctacattttacacctgtgaggaaggatttatctcatctaacggcagag ggccaaacatgttggagattagaaggaacgctttctcagtctgtaagctgccagtatcat atggttggggtcaggacagaagactcaataggagatgatgaagctgggatggaggtggat accacaccaacagttgctggacagtttgaggatgcagatgttgatcactga >gi568815594r:169629581_169857877|GENSCAN_predicted_peptide_7|48_aa XKHNHSYENSPLPVDTVTHLGKNLMRFAPSTGSPELKAPSKPQTSISM >gi568815594r:169629581_169857877|GENSCAN_predicted_CDS_7|147_bp nagaaacacaaccactcttatgaaaacagccccttgcctgtggacacagtgacccacctt ggtaaaaatctaatgcgttttgcaccaagtacaggatcaccagagctaaaagcaccctcc aagccacaaacctcaatctccatgtga