GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:58:55 Sequence gi568815597f:180130530_180374576 : 244047 bp : 44.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 24379 24643 265 0 1 69 100 553 0.869 49.77 1.02 Intr + 27847 27993 147 2 0 51 64 68 0.119 0.91 1.03 Intr + 35962 36062 101 2 2 107 61 74 0.131 6.43 1.04 Intr + 44792 44837 46 1 1 85 94 52 0.884 3.48 1.05 Intr + 45402 45504 103 1 1 114 19 81 0.267 3.03 1.06 Intr + 48265 48355 91 1 1 77 83 94 0.272 7.80 1.07 Intr + 51645 51790 146 2 2 118 67 175 0.751 17.48 1.08 Intr + 53387 53521 135 2 0 113 99 71 0.997 10.38 1.09 Intr + 55524 55653 130 0 1 86 105 205 0.968 22.70 1.10 Intr + 59023 59145 123 0 0 116 105 66 0.997 11.88 1.11 Intr + 59904 60051 148 2 1 93 98 117 0.999 13.01 1.12 Intr + 63684 63863 180 1 0 55 110 241 0.999 22.84 1.13 Term + 65733 66508 776 1 2 113 46 733 0.999 65.15 1.14 PlyA + 67482 67487 6 1.05 2.00 Prom + 96643 96682 40 -6.36 2.01 Init + 100001 100076 76 1 1 75 74 54 0.619 4.13 2.02 Intr + 102751 103099 349 2 1 126 81 267 0.978 24.22 2.03 Intr + 103877 103939 63 2 0 60 102 46 0.663 1.03 2.04 Intr + 104385 104593 209 0 2 37 64 108 0.768 2.02 2.05 Intr + 117756 117927 172 1 1 117 108 221 0.932 26.00 2.06 Intr + 120639 120714 76 0 1 69 82 20 0.543 -0.98 2.07 Intr + 123870 123913 44 2 2 73 96 49 0.003 1.24 2.08 Intr + 129880 129909 30 1 0 101 93 13 0.007 0.35 2.09 Intr + 135863 136065 203 2 2 94 115 406 0.995 42.83 2.10 Intr + 140851 141005 155 2 2 64 94 187 0.996 16.69 2.11 Intr + 141306 141477 172 2 1 90 83 197 0.941 18.92 2.12 Term + 143656 144050 395 2 2 123 37 207 0.909 14.30 2.13 PlyA + 144501 144506 6 1.05 3.00 Prom + 170235 170274 40 -3.36 3.01 Sngl + 172493 173386 894 1 0 66 42 377 0.879 27.04 3.02 PlyA + 173434 173439 6 1.05 4.00 Prom + 174602 174641 40 -4.46 4.01 Init + 174693 174781 89 2 2 62 91 81 0.753 5.91 4.02 Term + 195742 195826 85 1 1 105 48 48 0.109 -0.37 4.03 PlyA + 198158 198163 6 1.05 5.00 Prom + 198727 198766 40 -3.26 5.01 Sngl + 204025 205041 1017 0 0 73 43 450 0.660 36.04 5.02 PlyA + 205269 205274 6 1.05 6.00 Prom + 205438 205477 40 -4.96 6.01 Sngl + 206382 207353 972 2 0 70 48 373 0.815 28.25 6.02 PlyA + 207385 207390 6 1.05 7.03 PlyA - 211262 211257 6 1.05 7.02 Term - 227270 227154 117 2 0 41 53 46 0.169 -5.06 7.01 Init - 233198 233142 57 2 0 69 31 187 0.946 10.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 38636 38604 33 2 0 134 47 23 0.808 0.39 S.002 Term - 157988 157834 155 0 2 96 37 135 0.841 7.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_1|796_aa MRRCNSGSGPPPSLLLLLLWLLAVPGANAAPRSALYSPSDPLTLLQADTVRGAVLGSRSA WAVEFFASWCGHCIAFAPTWKALAEDVKAQSLVQSRSPLHVHGGQWPEPHRIPNDSVSVE PSAGAVDPLDSDRGCQPAWRPALYLAALDCAEETNSAVCRDFNIPGFPTVRFFKAFTKNG SGAVFPVAGADVQTLRERLIDALESHHDTWPPACPPLEPAKLEEIDGFFARNNEEYLALI FEKGGSYLGREVALDLSQHKGVAVRRVLNTEANVVRKFGVTDFPSCYLLFRNGSVSRVPV LMESRSFYTAYLQRLSGLTREAAQTTVAPTTANKIAPTVWKLADRSKIYMADLESALHYI LRIEVGRFPVLEGQRLVALKKFVAVLAKYFPGRPLVQNFLHSVNEWLKRQKRNKIPYSFF KTALDDRKEGAVLAKKVNWIGCQGSEPHFRGFPCSLWVLFHFLTVQAARQNVDHSQEAAK AKEVLPAIRGYVHYFFGCRDCASHFEQMAAASMHRVGSPNAAVLWLWSSHNRVNARLAGA PSEDPQFPKVQWPPRELCSACHNERLDVPVWDVEATLNFLKAHFSPSNIILDFPAAGSAA RRDVQNVAAAPELAMGALELESRNSTLDPGKPEMMKSPTNTTPHVPAEGPEASRPPKLHP GLRAAPGQEPPEHMAELQRNEQEQPLGQWHLSKRDTGAALLAESRAEKNRLWGPLEVRRV GRSSKQLVDIPEGQLEARAGRGRGQWLQVLGGGFSYLDISLCVGLYSLSFMGLLAMYTYF QAKIRALKGHAGHPAA >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_1|2391_bp atgaggaggtgcaacagcggctccgggccgccgccgtcgctgctgctgctgctgctgtgg ctgctcgcggttcccggcgctaacgcggccccgcggtcggcgctctattcgccttccgac ccgctgacgctgctgcaggcggacacggtgcgcggcgcggtgctgggctcccgcagcgcc tgggccgtggagttcttcgcctcctggtgcggccactgcatcgccttcgccccgacgtgg aaggcgctggccgaagacgtcaaagcacagagcctggtgcagagtaggagcccactgcac gttcatggtggccagtggcctgagcctcacaggattccaaatgacagcgtgagtgtagaa ccttctgctggtgccgtggatcctcttgactcagaccgggggtgtcagccagcctggagg ccggccctgtatctcgccgccctggactgtgctgaggagaccaacagtgcagtctgcaga gacttcaacatccctggcttcccgactgtgaggttcttcaaggcctttaccaagaacggc tcgggagcagtatttccagtggctggtgctgacgtgcagacactgcgggagaggctcatt gacgccctggagtcccatcatgacacgtggcccccagcctgtcccccactggagcctgcc aagctggaggagattgatggattctttgcgagaaataacgaagagtacctggctctgatc tttgaaaagggaggctcctacctgggtagagaggtggctctggacctgtcccagcacaaa ggcgtggcggtgcgcagggtgctgaacacagaggccaatgtggtgagaaagtttggtgtc accgacttcccctcttgctacctgctgttccggaatggctctgtctcccgagtccccgtg ctcatggaatccaggtccttctataccgcttacctgcagagactctctgggctcaccagg gaggctgcccagaccacagttgcaccaaccactgctaacaagatagctcccactgtttgg aaattggcagatcgctccaagatctacatggctgacctggaatctgcactgcactacatc ctgcggatagaagtgggcaggttcccggtcctggaagggcagcgcctggtggccctgaaa aagtttgtggcagtgctggccaagtatttccctggccggcccttagtccagaacttcctg cactccgtgaatgaatggctcaagaggcagaagagaaataaaattccctacagtttcttt aaaactgccctggacgacaggaaagagggtgccgttcttgccaagaaggtgaactggatt ggctgccaggggagtgagccgcatttccggggctttccctgctccctgtgggtcctcttc cacttcttgactgtgcaggcagctcggcaaaatgtagaccactcacaggaagcagccaag gccaaggaggtcctcccagccatccgaggctacgtgcactacttcttcggctgccgagac tgcgctagccacttcgagcagatggctgctgcctccatgcaccgggtggggagtcccaac gccgctgtcctctggctctggtctagccacaacagggtcaatgctcgccttgcaggtgcc cccagcgaggacccccagttccccaaggtgcagtggccaccccgtgaactttgttctgcc tgccacaatgaacgcctggatgtgcccgtgtgggacgtggaagccaccctcaacttcctc aaggcccacttctccccaagcaacatcatcctggacttccctgcagctgggtcagctgcc cggagggatgtgcagaatgtggcagccgccccagagctggcgatgggagccctggagctg gaaagccggaattcaactctggaccctgggaagcctgagatgatgaagtcccccacaaac accaccccacatgtgccggctgagggacctgaggcaagtcgacccccgaagctgcaccct ggcctcagagctgcaccaggccaggagcctcctgagcacatggcagagcttcagaggaat gagcaggagcagccgcttgggcagtggcacttgagcaagcgagacacaggggctgcattg ctggctgagtccagggctgagaagaaccgcctctggggccctttggaggtcaggcgcgtg ggccgcagctccaagcagctggtcgacatccctgagggccagctggaggcccgagctgga cggggccgaggccagtggctgcaggtgctgggagggggcttctcttacctggacatcagc ctctgtgtggggctctattccctgtccttcatgggcctgctggccatgtacacctacttc caggccaagataagggccctgaagggccatgctggccaccctgcagcctga >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_2|647_aa MMQSATVPAEGAVKGLPEMLGVPMQRSRADLAVRAVALLVFYRALVARPPVRASSLRQPY SCPYGPHRQLVPATSLRAVGTSRSESSQRLTRGRVGYGYGESLPSRLQNSPRGPQQSLHA SGPCWPAMPSEETRKRARHIGRIPFQPLARAVYHKEGSLEKPGCRPGWPGEQTMNSLNER LLRTHGRLTILGTRPGIPGVLWVLGDSPVEVDVDRAAGPSRVAGFALTASRSEIPQCAGC NQHILDKFILKVLDRHWHSSCLKCADCQMQLADRCFSRAGSVYCKEDFFKACQRHWRIRA SCRRSLPVLARCFPQPASLQAVGGLSAKGRVQTLKPVLHWRFGTKCTACQQGIPPTQVVR KAQDFVYHLHCFACIICNRQLATGDEFYLMEDGRLVCKEDYETAKQNDDSEAGAKRPRTT ITAKQLETLKNAYKNSPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKKDAGRHR WGQFYKSVKRSRGSSKQEKESSAEDCGVSDSELSFREDQILSELGHTNRIYGNVGDVTGG QLMNGSFSMDGTGQSYQDLRDGSPYGIPQSPSSISSLPSHAPLLNGLDYTVDSNLGIIAH AGQGVSQTLRAMAGGPTSDISTGSSVGYPDFPTSPGSWLDEMDHPPF >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_2|1944_bp atgatgcagagtgcgactgtccccgcggaaggggctgtcaaggggctcccggagatgcta ggtgtgccgatgcaacgctcccgtgcggacttagcagttcgcgccgtagccctcctggtt ttttaccgcgccttggtcgcgcggcccccggtgcgggcctcgtcgctccgacagccctac agctgtccctacggcccgcaccgacagctcgtccccgccacctccctccgggctgtaggg acgtcgagatccgaatcctcacagcgcctgacccggggccgtgttgggtacgggtacggc gagagccttccgagccgcctccaaaactctccccgcggacctcagcagtccctccacgcg tcggggccctgctggccggccatgccctctgaggagacgcggaaaagagcgaggcatata ggaaggatcccgttccagcccctggccagagctgtgtaccacaaagaagggagtttggag aaacccggatgccggcctgggtggcccggggaacagacgatgaattctttaaatgagcgt ttgctgcgcacccatgggcggctcacgatcctgggcactcgcccgggaatcccgggcgtg ctctgggttttgggggactcgccggtggaggtagatgtggacagggcggccggcccttcg cgggtagccggctttgcgcttactgcgtcccgcagcgagattccccagtgcgctggctgc aaccagcacatcctggacaagttcatcctgaaggtcctggacagacactggcacagctcc tgcctcaagtgtgcagactgccagatgcagctggcggacaggtgcttctccagggctggg agcgtctactgcaaggaggacttcttcaaagcctgccagaggcactggcgcatcagagcc tcctgccggcgctcccttccagtcctagcccgctgctttcctcagcctgccagtctgcag gccgtaggtgggctgtccgccaagggcagggtccagaccctcaagcctgtcttgcactgg cgcttcggcacaaaatgcacggcctgccagcagggtatccccccaacccaggtggtccgc aaggcccaggactttgtctaccacctgcactgctttgcttgcatcatctgcaaccggcag ctggccacgggggacgaattctacctcatggaggacgggcggctggtgtgcaaggaagac tacgagacagccaagcagaacgatgactcagaggctggagctaagcggccccggaccacc atcacagccaagcagctggagacattaaagaatgcatacaagaactcccccaagcctgcc cggcacgtgagggagcagctgtcctcagagacaggcctggacatgagggtcgtacaggtt tggtttcagaacagaagggccaaagagaaacgcctgaagaaggatgcagggcggcaccgc tgggggcagttctataagagcgtcaagaggagccggggcagcagcaagcaggagaaggag agctctgcagaggactgtggggttagtgacagtgagctgagcttccgagaggatcaaatt ctctcagaacttggccacaccaataggatttatggcaacgtgggggacgttacaggcgga cagttaatgaatgggagcttctccatggacgggacaggacaatcctatcaggacttgagg gatgggagcccctatggaatcccccagtctccatcctccatatcgtccctgccatcccac gctcctttgctcaatgggctggattacacggtggacagtaatttgggcatcattgcgcat gcagggcagggagtaagccagacgctgagagccatggctgggggacccacctctgacatc tccacaggaagcagtgtaggctatcccgactttccaactagcccaggctcttggctcgat gaaatggatcatcctcctttttaa >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_3|297_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKLNESRSWFFEKINKIDRLLARLIKKKR EKNQIDAIKNDKGDITTNPAEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLKRSIIGSEIEAKINSSPTKKSPGPDGFTAEFHQRYKEELVPFLLKLFQSIEKDGIL PNSFYEASIILLPKPGRDTTKKENFRPISLMNTDAKIINKILANRIQQHIKKLIHHD >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_3|894_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagca gaactgaaggagatagagacacaaaaaacccttcaaaaactcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaagaagaaaaga gagaagaatcaaatagatgcaataaaaaatgataaaggggacatcaccaccaatcccgca gaaatacaaactaccatcagagaatattataaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaa gttgaatctctgaagagatcaataataggatctgaaattgaggcaaaaattaatagctca ccaaccaaaaaaagtccaggaccagatggattcacagccgaattccaccagaggtacaag gaggagctggtaccattccttctgaaactattccaatcaatagaaaaagatggaatcctc cctaactcattttatgaggccagcatcatcctgttaccaaagcctggcagagacacaaca aaaaaagagaattttagaccaatatccctgatgaatactgatgcaaaaatcatcaataaa atactggcaaaccgaatccagcagcacatcaagaagcttatccaccatgattaa >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_4|57_aa MATKAKIDKWDLIKLKSFCTAKETTIRVNRQRSLSHSQLPPPPPAHGELCQATADIH >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_4|174_bp atggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcagaggagcctctcccactcccagttgcca ccaccaccaccagcccatggagagctctgccaggccactgccgatattcactga >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_5|338_aa MGKKQSRKTGNSKNQSASPPPKERSSSPAMEESWTENDFDELREEGFRRSNYFELKEEVQ THGKEVKNLEKKLDEWLTRITNAEKSLKDLMELKTMARELRDECTSLSSRFNQLEERVSV IGDQMNEMKREEKFREKRIKRNEQSLQEIWDYVKRPNLLLIGVPESDGEKGTKLENTLQD IIQENFPKLARQANIQIQEIQRMPQRYSSRRTTPRHIIVRFTKVEMKEKMLRAAREKGQV THKGKPVILTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDK QMLRDFVTTRPALQQLLKEALNMERNNRYQPLQKHAKL >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_5|1017_bp atgggaaaaaaacagagcagaaaaactggaaactctaaaaatcagagcgcttctcctcct ccaaaggaacgcagctcctcaccagcaatggaagaaagctggacggagaatgactttgat gagttgagagaagaaggcttcagacgatcaaactactttgagctaaaggaggaagtgcaa acccatggcaaagaagttaaaaaccttgaaaaaaaattagacgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatggcacgagaacta cgtgacgaatgcacaagcctcagtagccgattcaatcaactggaagaaagggtatcagtg ataggagatcaaatgaatgaaatgaagcgagaagagaagtttagagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatatgggactacgtgaaaagaccaaatctacttctg attggtgtacctgaaagtgatggggagaagggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaaactagcaaggcaggccaacattcaaattcaggaaata cagagaatgccacaaagatactcctcgagaagaacaaccccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcggccagagagaaaggtcaggta acccacaaagggaagcccgtcatactaacagctgatctctcggcagaaactctacaagcc aggagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatcctttacagacaag caaatgctgagagattttgtcaccaccaggcctgctttacaacagctcctgaaggaagca ctaaacatggaaaggaacaacaggtaccagccactgcaaaaacatgccaaattgtaa >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_6|323_aa MDKFLDTYTLPRLNQEEVESLKRPITGSEIEAKINSSPTKKSPGPDIHSRIPPEVQGGAG TIPSETIPINRKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILKN RIQQHIKKLIHCDQVGFIPGMQGWFNIQKSINVIQHINRTNDKNHMIISIDAEKAFDKIQ QRFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKMGTRQGCPLSPLLFN IVLEVLARAIRQEKEIKGIQLGKEEIKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVS GYKINVQKSQAFLYTNNRQTAKS >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_6|972_bp atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaagagaccaataacaggctctgaaattgaggcaaaaattaatagctcaccaaccaaa aaaagtccaggaccagacattcacagccgaattccaccagaggtacaaggaggagctggt accattccttctgaaactattccaattaatagaaaagagggaatcctccctaactcattt tatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagagaat tttagaccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatactgaaaaac cgaatccagcagcacatcaaaaagcttatccactgtgatcaagtgggcttcatccctggg atgcaaggctggttcaacatacaaaaatcaataaacgtaatccagcatataaacagaacc aatgacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaa caacgcttcatgctaaaaactctcaataaattaggtattgatgggatgtatctcaaaata ataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaatgggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggaaaaggaaataaagggtattcaa ttaggaaaagaggaaatcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca gggtacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaaca gccaaatcatga >gi568815597f:180130530_180374576|GENSCAN_predicted_peptide_7|57_aa MLPLLLLLLLLLLLTLIGKGSSSASSCTLLLLLSYTEKRKTLRIIFLKVFQLLGNLY >gi568815597f:180130530_180374576|GENSCAN_predicted_CDS_7|174_bp atgctcccgctcctgctcctgctcctgctcctgctcctgcttactctgattggcaaggga agcagttctgcttcttcatgcacactgttgctccttctttcatatactgagaagaggaag actttgaggatcatcttcctgaaggtcttccagcttctagggaacctctactga