GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:44:19 Sequence gi568815594r:24903454_25130639 : 227186 bp : 43.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1396 1391 6 1.05 1.04 Term - 3396 3340 57 0 0 118 49 40 0.583 0.79 1.03 Intr - 8686 8567 120 1 0 72 111 4 0.500 1.79 1.02 Intr - 9585 9373 213 0 0 13 93 249 0.722 16.81 1.01 Init - 15851 15654 198 2 0 103 94 5 0.175 1.54 1.00 Prom - 18482 18443 40 -2.96 2.00 Prom + 24516 24555 40 -2.26 2.01 Sngl + 35985 37001 1017 2 0 77 43 433 0.899 34.74 2.02 PlyA + 37229 37234 6 1.05 3.00 Prom + 37398 37437 40 -6.36 3.01 Sngl + 39173 39781 609 1 0 58 38 283 0.857 16.50 3.02 PlyA + 39878 39883 6 1.05 4.00 Prom + 40625 40664 40 -2.46 4.01 Sngl + 57918 58253 336 2 0 86 55 181 0.609 10.63 4.02 PlyA + 59428 59433 6 1.05 5.00 Prom + 62972 63011 40 -5.76 5.01 Init + 63482 64243 762 1 0 36 -18 342 0.183 13.10 5.02 Intr + 64859 65044 186 1 0 -23 44 216 0.155 5.99 5.03 Intr + 66124 66271 148 0 1 117 80 114 0.953 13.31 5.04 Intr + 66766 66966 201 2 0 20 50 158 0.644 4.46 5.05 Intr + 67613 67730 118 0 1 73 48 59 0.170 -0.08 5.06 Term + 70713 70773 61 0 1 76 53 47 0.055 -2.72 5.07 PlyA + 71721 71726 6 1.05 6.13 PlyA - 71741 71736 6 1.05 6.12 Term - 72249 72215 35 1 2 101 42 37 0.249 -1.85 6.11 Intr - 72915 72840 76 0 1 92 11 86 0.270 0.39 6.10 Intr - 75530 75351 180 2 0 50 81 112 0.709 6.76 6.09 Intr - 76236 76153 84 0 0 92 66 54 0.798 3.62 6.08 Intr - 85272 85099 174 0 0 108 81 19 0.228 3.34 6.07 Intr - 100815 100118 698 1 2 32 -19 937 0.004 68.61 6.06 Intr - 109046 108882 165 0 0 54 89 216 0.329 18.23 6.05 Intr - 114705 114536 170 2 2 60 66 149 0.829 9.49 6.04 Intr - 121438 121367 72 0 0 90 95 35 0.597 2.92 6.03 Intr - 123486 123415 72 2 0 74 95 64 0.645 4.22 6.02 Intr - 125125 125054 72 0 0 85 68 50 0.488 1.22 6.01 Init - 127240 127044 197 1 2 93 76 355 0.985 31.20 6.00 Prom - 163654 163615 40 -5.36 7.04 PlyA - 164863 164858 6 1.05 7.03 Term - 168290 168183 108 2 0 52 43 100 0.894 0.41 7.02 Intr - 168720 168584 137 1 2 87 98 76 0.988 8.79 7.01 Init - 171758 171734 25 2 1 92 106 15 0.614 2.44 7.00 Prom - 191575 191536 40 -6.36 8.00 Prom + 197249 197288 40 -0.86 8.01 Init + 204104 204158 55 1 1 89 113 28 0.827 6.91 8.02 Intr + 204498 204620 123 1 0 75 78 49 0.829 3.16 8.03 Intr + 207640 207824 185 2 2 74 53 79 0.321 2.51 8.04 Intr + 208390 208468 79 0 1 78 83 28 0.354 0.42 8.05 Term + 211245 211315 71 1 2 105 49 45 0.342 0.40 8.06 PlyA + 212008 212013 6 1.05 9.03 PlyA - 212278 212273 6 1.05 9.02 Term - 220772 220478 295 1 1 101 44 219 0.972 13.58 9.01 Intr - 222331 222241 91 2 1 78 111 21 0.563 2.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100815 99998 818 1 2 32 49 905 0.913 74.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_1|195_aa MAIYSKSLEDRGLCPGCHPHSLVQHIREAQERLIELMAQYVKGDPAKPLLRHLKIVFPVS WKQMGLADVESTALAAARGGGAAARRRRALSGPRGARAAAATRRREEEEAAGVQEAGQRM EEEAMNGDRTESDWQGLDPMLFALQFPLRVCRPAGRSCRPLQVRGNADFSLSPGGNQVSS GPLAVCTTSMLLLDE >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_1|588_bp atggctatctactccaaaagccttgaggacagaggtctgtgtcctggttgtcatcctcac agcctagttcagcatataagagaagctcaagaaaggcttattgagttaatggcccagtat gttaaaggagaccctgctaagcctctcttacggcaccttaagattgtttttcctgtatcg tggaaacaaatggggctagctgacgtggagtcaacagcgctggcggccgcgagaggcggc ggcgcagcggctcggaggcgccgggccctctcggggccgcgcggggcccgggcggcggcg gcgacgcgacgtcgcgaggaggaggaggcggcgggggtccaggaggccggccagcgcatg gaggaggaggccatgaacggcgaccggactgagagcgactggcaggggctggatcctatg ctgtttgctcttcagttccccctaagagtttgccggcctgccgggagaagctgccggcct ctccaggtgagagggaatgcggatttctccctctcacctggaggcaaccaggtgtcctct ggccctctggctgtctgcaccacatccatgctacttttggatgaatga >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_2|338_aa MRKKQSRKTGNSKNQSASPPPKERSSSPATEQSWMENDFDELREKGFRRSNYSELKEEVQ TNGKEVKNVEKKIDEWITRITNAQKSLKDLMELKTTARELRDERTSLSNRCDQVEERVSA MEDEMNETMREEKFREKRIKRNKQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNPARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMREKMLRAARKKGWV THKRKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNWYQPLQKHAKL >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_2|1017_bp atgcggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaacgcagctcctcaccagcaacggaacaaagctggatggagaatgactttgac gagttgagagagaaaggcttcagaagatcaaactactctgagctaaaggaggaagttcaa accaatggcaaagaagttaaaaacgttgaaaaaaaaatagacgaatggataactagaata actaatgcacagaagtccttaaaggacctgatggaactgaaaaccacagcacgagaacta cgcgacgaacgcacaagcctcagtaaccgatgcgatcaagtggaagaaagggtatcagcg atggaagatgaaatgaatgaaacgatgcgtgaagagaagtttagagaaaaaagaataaaa agaaacaaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctc attggcgtacctgaaagtgacggggagaatggaaccaagttggaaaacactttgcaggat attatccaggagaacttccccaatccagcaaggcaggccaacattcaaattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgagggaaaaaatgttaagggcagccagaaagaaaggttgggtt acccacaaaaggaagcccatcagactaacagctgatctcttggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaactggtaccagccactgcaaaaacatgccaaattgtaa >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_3|202_aa MIVYLENPIISAQNLLKLISNFSKGSGYKINVQKSKAFLYTNNRQTESQIMSELPFTIAS KRIKCLGIQLTRDVRELFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITIPDFKLYYKATV TKTAWYWYQEIKTDGTEQSPQK >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_3|609_bp atgattgtatatctagaaaaccccatcatctcagcccaaaatctccttaagctgataagc aacttcagcaaaggctcaggatacaaaatcaatgtgcaaaaatcaaaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatgcctaggaatccaacttacaagggatgtgagggaactcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacgatacctgacttcaaactatactacaaggctacggta accaaaacagcatggtactggtaccaagagataaagaccgatggaacagaacagagccct cagaaataa >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_4|111_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLNFIWTQKRAHIAKSILSQNNKAGGIMLPD FKLHYKATVTKTTWYWYQNRDTDQWNRTEPSEIMLHIYNYLIFDKPDKNKK >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_4|336_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaacttcatatggacccaaaaaagagcc cacattgccaagtcaatcctaagccaaaataacaaagctggaggcatcatgctacctgac ttcaagctgcactacaaggctacagtaaccaaaacaacatggtactggtaccaaaacaga gatacagaccaatggaacagaacagagccctcagaaataatgctgcatatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatga >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_5|491_aa MLVDTGADCSLIYGNPDKFLGKPAFIDSYGGRSVKVKPVSQHPDIGCLAACLYTVYVSPI PKYILEVDILHGLALQAMAREFGLSACDGHMHHQPQVLPQPRWVTSTHQYHLLGGYTEIT ETIKKLEEVQKMCGTHSTYNSLVWAVRKPDGTWQMTVDYQELNKVTPPSHAAVPSIMDLI ANLMTELGQYHYVVDLANAFFSTDISPESQEQFTFMWDGQRWTSTVLLQGYVHSPTICHD LVATDLVTWQCPEGACESVTGWAAVIVQTTYPIAGMDAFTGNKFPDWDIADIHFGKVGYL LGAAEYAEYKSLSNRVGWSSKLDLVLLTLNERPQKGSQTPVEALLHQGTAPTQLQIHSKD DLLRPAHTYTEMTGVSSCWICTTFPATAVDGWPRHIHSASADNCTWLETWTPVADAWNAM QQILDKGHLKTQGTQCCTFIPDNWQNITAALQRVSWEIKVVKTLTDDPLQSWAEGDPQVP AAFPTNQPSVV >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_5|1476_bp atgctggtggatactggtgcagattgcagcctcatttatgggaacccagataagtttctg ggaaaacctgctttcattgacagttatggaggacggtcagtgaaagtgaaacctgtatct cagcaccctgacatcggctgtttggctgcctgtttatatactgtgtatgtctctcccata ccaaaatacattctggaggtggatattttgcatggcctggcattacaagccatggccagg gaattcggactgagtgcatgtgacggacatatgcatcaccagcctcaggtcctgccacaa ccacgatgggttacttccacccatcaataccatttgctgggtgggtatacagagataact gagacaattaaaaagctggaggaggtgcagaaaatgtgtggcacccacagcacctacaat tctctggtgtgggcagttagaaagcctgatggaacttggcagatgacggtggactatcag gaactgaataaagtaacacccccttcgcatgcagctgtaccatcaatcatggatttgata gccaatttgatgacagaactgggacagtaccactatgtagtggacttggccaacgcattt ttctccacagacatcagtccagagagccaggaacagttcaccttcatgtgggatgggcaa cgatggacttctacagtgttgctgcagggctatgtgcatagccccaccatatgtcatgat ctagttgccacagacttagtcacctggcaatgtccagaaggggcttgtgagagtgtgaca ggatgggctgcagtcattgtgcagacgacttacccaatagcaggaatggatgcattcacg ggtaacaaattcccagactgggacattgcagacatccactttggcaaagtggggtaccta cttggagcagcagagtacgctgagtacaagtcccttagcaacagagttggctggagttcc aagctggacctggtgctcctaaccttgaacgaacggccacagaaaggcagccagacccca gtggaggctttgttacaccagggcactgcccccactcagttgcagatacattccaaggat gacctccttcgaccagcccacacctacactgagatgaccggtgtttccagctgttggatc tgcaccacctttccagcaacagctgtggatggctggcctcggcacatacattcagcatct gcagacaactgcacatggctggagacttggacacctgtggctgatgcctggaacgcaatg cagcaaattttggacaaaggacacctcaagacccaaggaacacaatgttgtacctttatc cctgacaattggcagaatataacagcagccctgcaaagggtctcatgggagattaaggtg gtcaagacccttactgacgaccccctacagagttgggcggaaggtgacccgcaggtgccg gctgctttccccaccaatcagccatctgtggtttga >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_6|664_aa MALRRGGCGALGLLLLLLGAACLIPRSAQVRRLARCPATCSCTKESIICVGSSWVPRIVP GDISSLSLVNGTFSEIKDRMFSHLPSLQLLLLNSNSFTIIRDDAFAGLFHLEYLFIEGNK IETISRNAFRGLRDLTHLDLRGNKFECDCKAKWLYLWLKMTNSTVSDVLCIGPPEYQEKK LNDVTSFDYECTTTDFVVHQTLPYQSVSVDTFNSKNDVYVAIAQPSMENCMVLEWDHIEM NFRSYDNITGQSIVGCKAILIDDQVFVVVAQLFGGSHIYKYDESWTKFVKFQDIEVSRIS KPNDIELFQIDDETFFVIADSSKAGLSTVYKWNSKGFYSYQSLHEWFRDTDAEFVDIDGK SHLILSSRSQVPIILQWNKSSKKFVPHGDIPNMEDVLAVKSFRMQNTLYLSLTRFIGDSR VMRWNSKQFVEIQALPSRGAMTLQPFSFKDNHYLALGSDYTFSQIYQWDKEKQLFKKFKE IYEYCRSSLGASKCRYSPEKWLNAPVSGPTSQGAEIHTFSWLVMLMLSQVGTWKQPAGAK EGNDDMTYPSALFPDFETTGNLLNCRNQGLLFGVTQGKNKVHRFQKGAMSVGQPATDVIL LRNPKDTPRSEKPADFPSGYGAFYQRKPPFTLTQAWFQGHTVPKQGGYLQMASRVDTGLS DSAQ >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_6|1995_bp atggcgctgcggagaggcggctgcggagcgctcgggctgctgctgctgctgctgggcgcc gcgtgcctgataccgcggagcgcgcaggtgaggcggctggcgcgctgccccgccacttgc agctgtaccaaggagtctatcatctgcgtgggctcttcctgggtgcccaggatcgtgccg ggcgacatcagctccctgagcctggtaaatgggacgttttcagaaatcaaggaccgaatg ttttcccatctgccttctctgcagctgctattgctgaattctaactcattcacgatcatc cgggatgatgcttttgctggactttttcatcttgaatacctgttcattgaagggaacaaa atagaaaccatttcaagaaatgcctttcgtggcctccgtgacctgactcacctagatttg aggggtaataaatttgaatgtgactgcaaagccaagtggctatacctgtggttgaagatg acaaattccaccgtttctgatgtgctgtgtattggtccaccagagtatcaggaaaagaag ctaaatgacgtgaccagctttgactatgaatgcacaactacagattttgttgttcatcag actttaccctaccagtcggtttcagtggatacgttcaactccaagaacgatgtgtacgtg gccatcgcgcagcccagcatggagaactgcatggtgctggagtgggaccacattgaaatg aatttccggagctatgacaacattacaggtcagtccatcgtgggctgtaaggccattctc atcgatgatcaggtctttgtggtggtagcccagctcttcggtggctctcacatttacaaa tacgacgagagttggaccaaatttgtcaaattccaagacatagaggtctctcgcatttcc aagcccaatgacatcgagctgtttcagatcgacgacgagacgttctttgtcatcgcagac agctcaaaggctggtctgtccacagtttataaatggaacagcaaaggattctattcttac cagtcactgcacgagtggttcagggacacggatgcggagtttgttgatatcgatggaaaa tcgcatctcatcctgtccagccgctcccaggtccccatcatcctccagtggaataaaagc tctaagaagtttgtcccccatggtgacatccccaacatggaggacgtactggctgtgaag agcttccgaatgcaaaataccctctacctttcccttacccgcttcatcggggactcccgg gtcatgaggtggaacagtaagcagtttgtggagatccaagctcttccatcccggggggcc atgaccctgcagcccttttcttttaaagataatcactacctggccctggggagtgactat acattctctcagatataccagtgggataaagagaagcagctattcaaaaagtttaaggag atttacgaatactgcaggtccagtctaggggcttctaagtgcaggtacagccctgagaag tggctgaatgcccctgtttcagggcctacttcacaaggtgctgagatacacactttcagt tggctagtcatgcttatgctttcccaggttggaacctggaaacagccagcaggtgcaaag gaaggaaatgatgacatgacttacccatcagccctgtttcctgattttgaaaccacaggg aatctgctcaattgtaggaaccagggtctccttttcggggtaactcagggtaagaacaaa gtgcaccgcttccagaagggagccatgagtgttggacaaccagccacagatgtcatccta ttgagaaatcctaaagacacgcccaggtcagagaagccagcagacttcccttctggatat ggggcattttaccaaaggaaaccacccttcaccctgacacaggcctggttccagggccac acagtgcccaagcaaggaggatacctgcagatggcctcaagggtggacacgggacttagt gatagtgcacagtga >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_7|89_aa MAGWVGAETGDSLDRILNVSVPQFPHLHNGDSERAIGRLNWEMQVKTVHCTWYTSLSFWS PEVTQGAWSLTKEMKLEEGLEEPEEKMAP >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_7|270_bp atggctggatgggttggagctgaaactggggacagcttggatcgaatacttaacgtctct gtgcctcagtttcctcatctccacaatggggatagtgaaagggctattgggagactcaac tgggagatgcaagtaaagactgtgcactgcacctggtacacaagtctgtccttctggagc cctgaagtgacccaaggtgcctggagcctcaccaaggagatgaagttggaggaaggcctg gaagagccagaagagaaaatggcaccctaa >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_8|170_aa MPGAESLSPAVVFSGLIQGFVLEPAELTIEWPEAAADIFLLSLKDPGAYPSPTPSAISRG HKLSPCGHRQCESAVLWAGVKVPADEWETSGRRGRAEALLRLLGPKSPASSLAQGAKSSE KMRKLRCREVDLPKDLTYREAPVTLWGGRSESPGKDGTAHQLDVVRGTGN >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_8|513_bp atgcccggggccgagagcctttcccctgctgttgtgttctcaggtctcatccaaggattt gtcctggaaccagctgagcttactattgaatggcctgaggcagccgcagacatatttctg ctgtcactcaaggatcctggagcgtatccaagcccaactcccagtgccatatctagagga cacaagctgtcaccttgtggccaccgtcagtgcgagtcagcagtcctctgggcaggtgtg aaggttccagctgatgaatgggaaacttcaggccggagagggagggcagaagccctactc aggctcttgggtcccaagagccctgcatcctctttggctcaaggtgcaaaatcttctgag aagatgaggaaactgaggtgtagagaggtagacttgcccaaggacctcacctacagggaa gccccagtgaccctgtggggaggaaggtctgagtcaccaggcaaggacggaacagcacat caactggatgtggtgagaggcactggaaactga >gi568815594r:24903454_25130639|GENSCAN_predicted_peptide_9|128_aa XMTLKTLDEHRDKAVTQLGSMLFTRQVSGARVVPLGSMQTVSGYTFRGFMSHTNNYPCAY LNAASAIGMKMQDVDLFIKRLDRCLKAVRKERSKESDDNYDKTEDVDIEEMALKLDNVLL DTYQDASS >gi568815594r:24903454_25130639|GENSCAN_predicted_CDS_9|387_bp nctatgacacttaaaacactagatgaacaccgtgacaaagctgtcactcagcttggctcg atgctttttaccagacaggtttctggagccagggttgtgcctcttgggtccatgcaaact gtgagtggctatactttcagaggctttatgtcacatacaaataattacccttgtgcttac ctcaatgctgcatcagccatcggaatgaagatgcaggatgtggacctgttcataaagaga cttgacaggtgtttaaaggcagtaagaaaagaacgaagtaaagagagtgatgacaattat gacaaaactgaagatgtggatattgaagaaatggctttaaaactagataatgtacttctt gacacataccaggatgcttcttcatga