GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:31:23 Sequence gi568815596r:39636738_39879239 : 242502 bp : 39.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 51 46 6 1.05 1.02 Term - 8580 8439 142 2 1 47 47 157 0.806 3.92 1.01 Init - 13370 13240 131 2 2 74 63 132 0.951 8.97 1.00 Prom - 15430 15391 40 -3.65 2.00 Prom + 18507 18546 40 -6.35 2.01 Init + 29238 29637 400 2 1 98 105 427 0.999 40.27 2.02 Term + 51587 51744 158 0 2 54 54 127 0.022 3.01 2.03 PlyA + 52717 52722 6 1.05 3.00 Prom + 55883 55922 40 -3.85 3.01 Init + 58851 58929 79 2 1 68 111 13 0.447 2.89 3.02 Intr + 67344 67457 114 1 0 98 88 90 0.648 9.50 3.03 Intr + 70312 70449 138 2 0 -4 96 134 0.217 4.51 3.04 Intr + 80273 80508 236 0 2 84 72 98 0.464 4.28 3.05 Intr + 83773 83943 171 0 0 103 34 112 0.195 6.52 3.06 Intr + 90473 90562 90 1 0 95 30 98 0.513 3.97 3.07 Term + 96607 96693 87 0 0 70 48 158 0.556 6.68 3.08 PlyA + 99509 99514 6 1.05 4.08 PlyA - 99705 99700 6 1.05 4.07 Term - 100322 99998 325 1 1 60 43 188 0.801 4.75 4.06 Intr - 107741 107633 109 0 1 62 115 73 0.691 5.82 4.05 Intr - 118672 118558 115 1 1 94 82 124 0.994 11.50 4.04 Intr - 119223 119152 72 0 0 91 100 89 0.997 9.08 4.03 Intr - 124681 124594 88 0 1 94 116 62 0.773 8.65 4.02 Intr - 132356 132208 149 2 2 61 2 100 0.072 -3.19 4.01 Init - 133365 132973 393 0 0 28 85 327 0.807 23.18 4.00 Prom - 133893 133854 40 -4.65 5.00 Prom + 139358 139397 40 -6.25 5.01 Init + 142200 142314 115 2 1 51 35 168 0.313 6.99 5.02 Term + 150904 151031 128 2 2 42 42 110 0.321 -0.74 5.03 PlyA + 151146 151151 6 1.05 6.04 PlyA - 151278 151273 6 1.05 6.03 Term - 153619 153479 141 1 0 62 38 110 0.446 0.35 6.02 Intr - 155655 155600 56 1 2 85 92 25 0.410 0.38 6.01 Init - 165886 165700 187 1 1 51 92 99 0.423 5.87 6.00 Prom - 168750 168711 40 -3.35 7.02 PlyA - 168867 168862 6 1.05 7.01 Sngl - 173163 172078 1086 0 0 34 42 332 0.926 20.00 7.00 Prom - 174662 174623 40 -6.55 8.04 PlyA - 174829 174824 6 1.05 8.03 Term - 175867 175466 402 1 0 -36 49 228 0.758 0.67 8.02 Intr - 176375 176082 294 2 0 37 71 153 0.237 4.78 8.01 Init - 180301 180179 123 1 0 80 49 109 0.507 6.42 8.00 Prom - 188739 188700 40 -3.35 9.02 PlyA - 189499 189494 6 1.05 9.01 Sngl - 192943 192566 378 1 0 36 49 230 0.493 9.81 9.00 Prom - 196048 196009 40 -6.55 10.00 Prom + 196399 196438 40 -7.25 10.01 Init + 202291 202364 74 0 2 99 45 5 0.100 -2.10 10.02 Intr + 202686 202792 107 0 2 84 91 95 0.226 8.34 10.03 Term + 208783 208907 125 2 2 32 47 174 0.321 5.27 10.04 PlyA + 210541 210546 6 1.05 11.00 Prom + 213949 213988 40 -5.85 11.01 Init + 214453 214518 66 0 0 55 92 37 0.452 1.92 11.02 Intr + 218145 218411 267 2 0 38 98 128 0.836 5.61 11.03 Term + 222196 222462 267 0 0 79 55 120 0.813 2.31 11.04 PlyA + 223771 223776 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 132356 132164 193 2 1 61 96 98 0.841 6.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_1|90_aa MEEEGQEPRNARENFHQVKKARKWKWIPPKRLQKEYGLANTLAQSFRRYPEEGIVIIGGD SSMLVTAPEDLPRGQDVEVEDSDIDDPDTM >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_1|273_bp atggaggaagagggccaagaaccaaggaatgcacgtgaaaacttccaccaggtgaaaaag gcaaggaaatggaaatggattccccctaagagactccagaaggaatatggcctcgctaac actttagcccagtccttcaggaggtatccagaagaaggcattgttatcattggaggtgac agctccatgcttgttactgcccctgaagaccttccaaggggacaagatgtggaggtggaa gacagcgatattgatgatcctgacactatgtag >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_2|185_aa MEPRALVTALSLGLSLCSLGLLVTAIFTDHWYETDPRRHKESCERSRAGADPPDQKNRLM PLSHLPLRDSPPLGRRLLPGGPGRADPESWRSLLGLGGLDAECGRPLFATYSGLWRKCYF LGIDRDIDTLILKDLGRDLEMPVELLEELDTWEIVIKAAKDRGPCPDCHFLLLPLMALGF GFFLP >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_2|558_bp atggagccgcgggcgctcgtcacggcgctcagcctcggcctcagcctgtgctccctgggg ctgctcgtcacggccatcttcaccgaccactggtacgagaccgacccccggcgccacaag gagagctgcgagcgcagccgcgcgggcgccgaccccccggaccagaagaaccgcctgatg ccgctgtcgcacctgccgctgcgggactcgcccccgctggggcgccggctgctcccgggc ggcccggggcgcgccgaccccgagtcctggcgctcgctcctggggctcggcgggctggac gccgagtgcggccggcccctcttcgccacctactcgggcctctggaggaagtgctacttc ctgggcatcgaccgggacatcgacaccctcatcctgaaagacttgggtcgtgatttagag atgcctgtggagctgctggaggaactagacacatgggaaatagtgataaaggctgctaaa gacagagggccctgccctgactgtcacttcctgctgctgccactaatggctttgggattt ggcttcttcctaccatga >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_3|304_aa MVTEMDMMLPGDNLGREGSSPWEPVTGIAQRCTAIKYHFSQPIRLRNIPFNLTKTIQQDE WHLLHLRRITAGFLGMAVAVLLCGCIVATVSFFWEESLTQHVAGLLFLMTGIFCTISLCT YAASISYDLNRLPKLIYSLPADVEHGYSWSIFCAWCSLGFIVAAGGLCIAYPFISRTKIA QLKSGRDSTLLHNTPGHLPQTCCIASNPFLTSYAPHMNCLATHNQIIDSFKEESVPCPGA QPTQCLIAVTAAIQAGPVYQGAFPEPDMEQVLTNCLHINALCHQTGAAEREGCQWNELNE QGAE >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_3|915_bp atggtcacagagatggacatgatgttgccaggggacaatttggggagggagggaagtagc ccctgggaaccagtcacaggtattgcgcagcgatgcacggccatcaagtaccacttttct cagcccatccgcttgcgaaacattccttttaatttaaccaagaccatacagcaagatgag tggcacctgcttcatttaagaagaatcactgctggcttcctcggcatggccgtagccgtc cttctctgcggctgcattgtggccacagtcagtttcttctgggaggagagcttgacccag cacgtggctggactcctgttcctcatgacagggatattttgcaccatttccctctgtact tatgccgccagtatctcgtatgatttgaaccggctcccaaagctaatttatagcctgcct gctgatgtggaacatggttacagctggtccatcttttgcgcctggtgcagtttaggcttt attgtggcagctggaggtctctgcatcgcttatccgtttattagccggaccaagattgca cagctaaagtctggcagagactccacgcttctacacaacactcccggacaccttcctcaa acatgctgcattgcttcaaatcctttcctgaccagttacgctccgcacatgaattgcctg gcaacccataaccagatcatagactctttcaaggaggaatctgtgccttgtcccggtgct cagcccacccagtgcctgatagctgtcacagcagccattcaggcagggccagtttaccaa ggagcattcccagaacctgacatggagcaagtactcactaactgcctgcatattaatgct ttgtgtcaccagacaggagcagctgaacgtgaaggatgccaatggaatgagttgaacgaa caaggtgcggaataa >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_4|416_aa MQRLINEDPGSWLNAISIWKNLLELDAKKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQ KIEENRDCQLEKQIKEETLEQRDFTTKSEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCRC SGTIGKAFTAQVLYMGLSPGPRLQMHMRSQQCREERTPQTVAITRPHQLALTKFSDVKGH WVSLASRAYIKTAGLRSTIAWAMASLADIKAGAFVLDPMCGLGTILLEAAKEWPDVYYVG ADVSDSQLLGTWDNLKAAGLEDKIELLKISVIELPLPSESVDIIISDIPFGKKFKLGKDI KSILQEMERVLHVGGTIVLLLSEDHHRRLTDCKESNIPFNSKDSHTDEPGIKKCLNPEEK TGAFKTASTSFEASNHKFLDRMSPFGSLVPVECYKVSLGKTDAFICKYKKSHSSGL >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_4|1251_bp atgcaaagacttataaatgaagatccaggaagttggttgaatgccatttcaatttggaaa aatcttcttgaacttgatgcaaaaaaggaaaaactttctcagagagatgataaccaacta aaaagaaaagtgggagaaaatgaaatcattgcaaagaaattaaaaatagaacaaatgcaa aagatagaagagaatagggactgccagctggaaaaacaaataaaagaagaaactctggag caaagagattttaccactaaaagcgaaaagtttcaagaagaagaatttcagaatgacata gagaaagcaattgatactcataatcagaatgacttgactttcagagtatcttgtcgctgc agtggaactattggaaaggccttcactgcacaggtcctgtacatgggattgtcaccagga cctagactgcaaatgcacatgaggtctcagcaatgtagagaagagcgaacaccacaaact gttgccatcaccagacctcatcagttggcccttacaaagttcagtgatgttaaaggccat tgggtttccctagccagcagagcttacatcaagacagctggactgcgatctacaatagcg tgggcaatggcatctctggctgacattaaggctggtgcatttgttttagatccaatgtgt ggacttggaacaatacttttggaagctgctaaagaatggccagatgtgtattatgtaggt gctgatgtcagcgactcacagttactaggtacttgggacaatctgaaagctgcaggcctt gaggataaaattgaattacttaaaatctctgttatagaattgccattgccttcagaaagt gttgatattattatttctgacattccatttgggaaaaagtttaagttaggaaaagacatc aaaagcattctacaagaaatggaaagagtgcttcatgttggcggaaccattgtattgttg cttagtgaagatcaccacaggcgccttacagattgtaaagagagcaacatccctttcaat tccaaggacagtcacacagatgaacctggaattaaaaagtgcttgaatcctgaagaaaaa actggtgcattcaagacagcgtcaacttcattcgaagccagtaaccacaaattcttagac agaatgtcaccatttggctccttggtaccagtggaatgctacaaagttagccttggaaag acagatgcgttcatatgtaaatataagaagtcgcactcttctggactgtag >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_5|80_aa MVASLGRSGSACQSTSGVGDRRVEQRGAPATLSRVHDWGILEEEIIIEDDCSVCVIAPEY LPVGRDKEVKDSDFDDPDTV >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_5|243_bp atggtggcttctctcggccggagcggaagcgcctgccagtcaacctcgggggtcggcgac cgtcgcgtggaacagagaggggcgccagcgacgctttcccgcgtccacgactggggtatt ctagaagaagagattatcatagaagatgactgctctgtgtgtgttattgcccctgaatac ctcccagtgggtagagataaggaggtgaaagacagtgactttgatgatcctgacactgtg tag >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_6|127_aa MHALLGQLGQLDPMRKWKKVVRLDVPSKFHAEMQSPVLEVGPDGRCLGHGGGSLMAWCCP RDKEKQKVFQKIQQSFTDQNQHVIFVTYSLFIPPSAFEIPNKNLLVLRLKGHHGTCRHVM SPPEAQL >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_6|384_bp atgcatgctctcttggggcaacttggtcaactggatcccatgagaaaatggaagaaggta gtacgtttggatgtcccttcgaaatttcatgctgagatgcaatctccagtgttggaggtg ggacctgatgggaggtgtttgggtcatgggggtggatccctaatggcttggtgctgtccc cgagataaagaaaagcaaaaagttttccagaaaatccagcaaagtttcactgaccagaac cagcatgtgatctttgtgacttactccctgttcatacccccctccgcttttgaaatccct aataaaaacttgctggttttgcggctcaaggggcatcacggaacctgccgacatgtgatg tcacctccagaggcccagctgtaa >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_7|361_aa MLKTLNKLGIDGKHLTIIRAIYDKPTTNIVLIGQKLEALPLKTGTRQGCPLSSFLIKTVL EVLTRAIRQEKEIKGIQTGREEVKLSLFADDLIVYLEKPFVSTPKLLKLIRNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSELPSTIATKRIKYLGIQLTRDMKDHFKENYKPLLKEI KEDTNKRKNIPCLWIERINIVKMAILPKVIYRFNAIPIKLPLTFITELEKTTLNFIWNQK IARIAKTILSKKNKAGSIMLPDFKLYYKATVTKAAWFWYQNRQIDQWNRTEASEVTPHIY NHLIFDKPDKNKKWGKDSLFNKWCWENWIAICRKLKLDPFLIPYTKINSRWIKDLNVRPK P >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_7|1086_bp atgctaaaaacactcaataaattaggtattgatggaaagcatctcacaataataagagct atttatgacaaacccacaaccaatattgtactgattgggcaaaagctggaagcactccct ttgaaaactggcacaagacaaggatgccctctctcatcattcctaatcaaaacagtatta gaagttctgaccagggcaatcaggcaagagaaagaaataaagggtattcaaaccggaaga gaggaagtcaaattgtccctgtttgcagatgacttgattgtatatttagaaaaacccttt gtctcaaccccaaaactccttaagctgataagaaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcctatacaccaataatagacaaacagagagccaa atcatgagtgaactcccatccacaattgctacaaagagaataaaatacctaggaatacaa ctaacaagggatatgaaggaccacttcaaggagaactacaaaccactgctcaaggaaata aaagaggacacaaacaaacggaaaaatattccatgcttatggatagaaagaatcaatatc gtgaaaatggccatactgcccaaagtaatttatagattcaatgctattcccatcaagcta ccattgaccttcatcacagaattagaaaaaactactttaaatttcatatggaaccaaaaa atagcccgtatagccaagacaatcctaagcaaaaagaacaaagccggaagcatcatgcta cctgacttcaaactatactacaaggctacagtaaccaaagcagcatggttctggtaccaa aacagacaaatagaccaatggaacagaacagaggcctcagaagtaacaccacacatctac aaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctattt aataaatggtgttgggaaaactggatagccatatgcagaaaactgaaactggaccccttc cttataccttatacaaaaattaactcaagatggattaaagacttaaacgtaagaccaaaa ccataa >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_8|272_aa MHTYTSTDPAANAPTKHFGQHPPLEYCCQWTWNISVPLEQQGLTDTSYRRAPAGICWVPL REKFPEEGASSSFWCSAASAGATQANSVWSGPPVNSSTPAEERPIRRKTNKHKAIASTST KRTPMQKTPSKGDQHQRPKEAKNLDKRIQELTTRITSIEKNINGLTELKNTAQKLQDPYT SINSQTNLAKERTSETEVQLNEIKYGHRIREKRMKMNEQSLQEVWDFGKRPNLHLIGVSE SEGENGTKLENTLQDNIQDNFPNLARQANTQI >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_8|819_bp atgcacacatacacaagcacagaccctgctgccaacgccccgacaaagcactttggccag catcccccactggagtattgttgccagtggacctggaacatctcagtccctctggaacag caggggttgacagacacctcatacaggagagctccagctggcatctgttgggtgcccctc agggaaaagtttccagaggaaggagcaagcagctccttctggtgttctgcagcctctgct ggtgctacccaggcaaacagcgtctggagtggacctccagtaaactccagcacacctgca gaagagcggcctattagaagaaaaactaacaaacacaaagcaatagcatcaacatcaaca aaaaggacacccatgcaaaaaacaccatccaaaggtgaccaacatcaaagaccaaaggaa gctaagaaccttgataaaaggatacaggaactgacaactagaataaccagtatagagaag aacataaatggcctgacggagctgaaaaacacagcacaaaaacttcaggacccatacaca agtatcaatagccaaaccaatctagccaaagaaaggacatctgagactgaagttcaactt aatgaaataaagtatggacacaggattagagaaaaaagaatgaaaatgaatgaacaaagc ctccaagaagtatgggactttgggaaaagaccaaatctacatttgattggtgtatctgaa agtgaaggggagaatggaaccaagttggaaaacacacttcaggataatatccaggacaac ttccccaacctagcaagacaggccaacacacaaatttag >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_9|125_aa MKRNKQSLQEIWDNVKRPNLRFNGVPESHEENGTKLENTLQDIIQENFPNLAREPNTKIQ DTQRTLQRYSSRRPTPRHIIIRFTKVERKEKMLRAAREKGQVTHNRKPIRLTADLSAENL QARRE >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_9|378_bp atgaaaaggaacaaacaaagcctccaagaaatatgggacaatgtgaaaagaccaaaccta cgtttcaatggtgtacctgaaagtcatgaagagaatggaaccaagttggaaaacactctt caggatattatccaggaaaacttccccaacctagcaagagagcccaacactaaaattcag gatacacagagaacactacaaagatactcctcaagaagaccaaccccaagacacataatc atcagattcaccaaggttgaaaggaaggaaaaaatgttaagagcagccagagaaaagggt caggttacccacaataggaaacccatcagactaacagctgatctctctgcagaaaaccta caagccagaagagagtag >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_10|101_aa MGHVARLHFPPAETTLDFTVRGMQRSGRGGDSGTGDCKAPLSDFRELGSQRNTGPWLQCS GKNRDAEKVGFREVWAQEEESLMVRTLEEPEEPATPQHQPL >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_10|306_bp atggggcatgtggcacgtttgcacttccctcctgcagaaacaacactggatttcactgtt aggggcatgcaaaggagtggcaggggtggagacagtggcactggtgactgcaaagcgcct ctcagtgacttccgggagttgggttctcagaggaacactgggccatggctgcagtgttca ggaaaaaaccgagatgcagaaaaagtgggcttcagggaggtttgggctcaggaagaagag tctttaatggtgaggactttagaagaaccggaagaaccagctacacctcagcaccagccc ctatga >gi568815596r:39636738_39879239|GENSCAN_predicted_peptide_11|199_aa MTGTNKDGLKYLCKPFDVASKAWLHCQKKLQRVPAQKLIIHVSCNVLEMGNRFAKILQQA GSSDKCLNNAQLINNRIQKGEGNTTNQAELFSVKGEKKKKDQGKIGEGKYKILGENSQTM FFLCSHTTTQSPTQKTSVTKGVEFFLHIPGGRHQLGVFQFNSGTIYAEDSVRSHRSEAQS PRLPLFSQTQVTSSNIWNF >gi568815596r:39636738_39879239|GENSCAN_predicted_CDS_11|600_bp atgactggcactaataaagatggtttaaaatatctttgcaagccttttgatgttgcttca aaagcatggttacattgtcagaagaagctgcagcgtgtacctgctcagaaactcatcatc cacgtctcctgcaatgtgttggagatgggaaataggtttgcaaaaattctacaacaagct ggttcaagtgataagtgcctcaacaatgcgcaattgattaataatcgtatccagaaaggg gaaggaaacacaacaaaccaggcagaactattttctgttaagggggaaaaaaaaaaaaaa gatcaggggaagattggggaaggcaaatataagattcttggagaaaactctcaaaccatg tttttcctctgttctcacacaaccacccaatcaccaactcagaagacttctgtgaccaaa ggtgtggagtttttccttcacataccaggtggcagacaccagctgggtgtcttccaattc aattctggcactatctatgctgaagacagtgtcagatcccataggtcagaggctcagtcc ccaagactgcccctcttctcccagacacaagtcacaagttcaaacatctggaacttctga