GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:55:59 Sequence gi568815595f:127952835_128169659 : 216825 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10355 10552 198 0 0 107 80 157 0.991 16.12 1.02 Term + 21526 21680 155 2 2 105 48 119 0.978 7.68 1.03 PlyA + 22141 22146 6 1.05 2.00 Prom + 26526 26565 40 -4.86 2.01 Init + 27219 27274 56 2 2 64 75 52 0.614 2.26 2.02 Intr + 29250 29275 26 0 2 126 16 0 0.237 -5.73 2.03 Term + 31263 31444 182 1 2 99 45 218 0.994 16.37 2.04 PlyA + 32455 32460 6 1.05 3.12 PlyA - 33657 33652 6 -0.45 3.11 Term - 35561 35374 188 0 2 59 53 74 0.797 -1.35 3.10 Intr - 36103 36023 81 2 0 79 100 44 0.922 4.31 3.09 Intr - 37192 37100 93 2 0 44 111 60 0.902 3.84 3.08 Intr - 39019 38849 171 2 0 87 58 66 0.897 3.41 3.07 Intr - 40155 40079 77 2 2 38 105 67 0.963 2.56 3.06 Intr - 41946 41749 198 2 0 81 105 15 0.475 0.97 3.05 Intr - 45380 45279 102 1 0 126 32 81 0.707 5.59 3.04 Intr - 51695 51622 74 2 2 35 96 55 0.216 -0.80 3.03 Intr - 70194 70078 117 0 0 87 62 49 0.256 2.86 3.02 Intr - 80968 80792 177 1 0 79 42 123 0.217 6.92 3.01 Init - 83136 83107 30 0 0 62 80 34 0.478 -0.21 3.00 Prom - 92692 92653 40 -2.66 4.00 Prom + 96241 96280 40 -4.06 4.01 Init + 98509 98512 4 0 1 62 72 0 0.216 -4.04 4.02 Intr + 98872 99076 205 2 1 95 111 78 0.351 9.06 4.03 Intr + 99927 100068 142 0 1 83 115 46 0.397 7.16 4.04 Intr + 102682 102747 66 0 0 98 115 23 0.938 5.10 4.05 Intr + 102839 102917 79 1 1 99 75 2 0.965 -0.88 4.06 Intr + 103875 104006 132 1 0 79 47 105 0.955 6.12 4.07 Intr + 107268 107377 110 1 2 58 91 113 0.984 8.60 4.08 Intr + 107674 107827 154 0 1 57 94 121 0.989 9.25 4.09 Intr + 112043 112203 161 0 2 99 89 249 0.999 25.81 4.10 Intr + 114120 114317 198 2 0 60 123 345 0.995 34.75 4.11 Intr + 114587 114824 238 1 1 95 35 87 0.388 1.29 4.12 Intr + 115111 115225 115 2 1 83 117 157 0.714 17.61 4.13 Term + 116642 116828 187 2 1 97 52 257 0.902 19.96 4.14 PlyA + 118825 118830 6 1.05 5.18 PlyA - 118887 118882 6 1.05 5.17 Term - 124355 124090 266 0 2 71 38 197 0.680 8.67 5.16 Intr - 126295 126198 98 0 2 49 94 70 0.375 3.35 5.15 Intr - 128575 128420 156 0 0 115 44 264 0.446 23.83 5.14 Intr - 129740 129649 92 2 2 67 83 124 0.996 8.59 5.13 Intr - 134974 134872 103 0 1 63 99 123 0.856 11.08 5.12 Intr - 144664 144466 199 2 1 90 91 368 0.998 35.71 5.11 Intr - 146111 146048 64 2 1 80 91 60 0.942 3.79 5.10 Intr - 147910 147761 150 1 0 96 84 112 0.993 11.96 5.09 Intr - 148814 148725 90 2 0 91 90 71 0.995 7.79 5.08 Intr - 152090 151939 152 0 2 97 87 97 0.718 10.38 5.07 Intr - 160186 160054 133 1 1 93 111 105 0.990 13.52 5.06 Intr - 161919 161722 198 0 0 97 42 65 0.763 2.35 5.05 Intr - 170907 170576 332 1 2 52 68 272 0.252 16.95 5.04 Intr - 171200 171057 144 0 0 35 52 115 0.238 2.85 5.03 Intr - 180012 179802 211 0 1 85 61 34 0.019 -1.11 5.02 Intr - 192174 192044 131 1 2 74 64 53 0.322 1.91 5.01 Init - 197294 197141 154 2 1 49 59 99 0.389 3.25 5.00 Prom - 197748 197709 40 -6.66 6.00 Prom + 198444 198483 40 -4.16 6.01 Init + 200674 200989 316 0 1 92 89 616 0.166 59.60 6.02 Intr + 202800 202906 107 1 2 101 89 -20 0.102 -0.67 6.03 Term + 205647 205751 105 2 0 44 49 110 0.445 1.11 6.04 PlyA + 216245 216250 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 200674 201162 489 0 0 92 53 643 0.824 57.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_1|117_aa XGIGCVGQDKGQVRKCLDVVEIYNPDGDFWREGPPMPSPLLSLRTNSTNAGAVDGKLYVC GGFHGAGASMIRKEGSTAISFILYRHSSLRALLSLDARAADCEQVSVWAIMKGMSFN >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_1|354_bp ngtggcattggctgtgtaggtcaagacaagggccaggttcgaaaatgccttgacgtggtg gagatctacaacccagatggggacttttggcgagagggccctcccatgccaagtcccctc ctctcactccgcaccaattccaccaatgcaggggcagtggatgggaaactctatgtctgc gggggattccatggagcaggggcctccatgatacggaaagagggctccacagccatcagt ttcatcctgtaccgtcactccagcctgagggcattgctctccctagatgccagagcagca gattgtgaacaggtttctgtatgggccatcatgaagggaatgtccttcaactga >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_2|87_aa MEKFELHQKSSKKRYSEEMLASIDSIKDRHEVISKEILELDPWENQWNVVAINVLMHDSY DVCLVARMNPRDLIPPPSDLVEEGNEH >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_2|264_bp atggagaaatttgaactgcaccagaaatcttctaaaaagaggtactcagaagaaatgctg gcctctatagacagcattaaagatcgccatgaggttatctccaaagaaatattggaactg gacccatgggaaaaccagtggaatgttgtagccatcaacgtcctcatgcatgacagctat gatgtctgcctagtagccaggatgaatccccgagacctcatccccccgccttcagatttg gtggaagaaggcaatgagcactaa >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_3|435_aa MPPGKGFLRWEFSSGGQNQITNMNSGTISEPIQGLNKLAIKSCITGLKIEDQVENQRKRK DIRKLMTFKCWDYRHEHLAPVSDLTCKDFKAAIVNMVKELKDTMKQRKDPLSPHNNTRRQ LYVTDEEVEAQRGSSVAGEQKLTCTDVSVASITSIDSVECKHSKPMGELVSLGLPKTVLV GSTVSWSMSSFSLVYYLILLEHILKEPLKNAWMVDVAVLKHGCTFIASSVIERAFTSSGS SEISHKEPPVGYCTLVVRVGLVPSATDTAQPTTAQLQSLNASFSAWSVALGILHPAPHIL PITSYVLHHPQDILGLSPSASKWLEATEHNGSLPGLSLVLLSTHRKRRWLQFQGAGRLAK QKIAHLFLNITFKANTDILYPRLHPTHTVVLIPLLRFAQAPPNLAPNMRLAGLPGCSTLL SGSGLWMMFLEDFLA >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_3|1308_bp atgcccccgggcaaaggctttttgcgctgggagttcagctcaggtgggcaaaaccagata actaacatgaacagtggaacaatcagtgaaccaatacaagggttaaataagctagcaatt aaaagctgtatcactggtctaaagatagaagatcaagtagaaaatcagcgcaagaggaaa gatatacgaaaactaatgaccttcaagtgctgggattacaggcatgagcacctggcccca gtgtcagatttaacttgcaaagatttcaaagcagccattgtaaatatggtcaaagaacta aaggacaccatgaagcaaaggaaggaccctttgagtcctcacaacaacactagaaggcag ctctatgtcacagatgaggaagttgaggcacagagaggatcctcagtggctggtgagcag aagctgacatgcacggatgtgtcggttgcctctattacgagtattgactcagttgagtgc aaacacagcaagcccatgggtgagttggtcagcctgggacttcccaagactgtcctagta ggatccactgtttcctggtccatgtcttccttttccttggtttactaccttattctgctt gagcacatcctcaaggagcctcttaagaatgcatggatggtagatgtggcagttttgaaa catggctgcacatttattgcctcttctgtcattgagagggctttcacttcatctggaagt tctgaaatttcccataaggagcctcctgtgggctactgcacacttgttgtgagagtgggc ctcgtcccaagtgcaactgacacagctcagcccacgacagcacagttgcagagcctgaac gccagcttctcagcctggagcgtggccttgggcatcctccaccctgctcctcacatcctg cctataacatcctacgttttgcaccatcctcaagatatcttgggattaagcccaagtgcc tccaagtggcttgaagccactgagcacaacgggagcctcccaggattgtctctggtgctc ctgagcacccacagaaaacggcggtggctgcagttccagggggcaggccgacttgctaag cagaaaatagcccatctcttcctcaacatcacttttaaagcaaacacagacattctctat cctcggctgcacccaacccacactgttgttctgatcccactgctcaggttcgcccaggct ccccccaacttggcccccaacatgcgtcttgctggactgccaggctgctccaccctgctt agtggctctgggctgtggatgatgttcttagaggacttccttgcctga >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_4|596_aa MESIKARHKGTLPGGLPEFFCEADYFPSCPGPSPAKSLQKLIPFVLRWKGTHTHDKPPGL LNDSDSEGARRRPWVARKVTSLCPNSSCFVSPIKVKFLEVIKPFCVILPEIQKPERKIQF KEKVLWTAITLFIFLVCCQIPLFGIMSSDSADPFYWMRVILASNRGTLMELGISPIVTSG LIMQLLAGAKIIEVGDTPKDRALFNGAQKLFGMIITIGQSIVYVMTGMYGDPSEMGAGIC LLITIQLFVAGLIVLLLDELLQKGYGLGSGISLFIATNICETIVWKAFSPTTVNTGRGME FEGAIIALFHLLATRTDKVRALREAFYRQNLPNLMNLIATIFVFAVVIYFQGFRVDLPIK SARYRGQYNTYPIKLFYTSNIPIILQSALVSNLYVISQMLSARFSGNLLVSLLGTWSDTS SGGPARAYPVGGLCYYLSPPESFGSVLEDPVHAVVYIVFMLGSCAFFSKTWIEVSGSSAK DVSRSKTFWKGDESVTASNSNFSPVGTLQVAKQLKEQQMVMRGHRETSMVHELNRYIPTA AAFGGLCIGALSVLADFLGAIGSGTGILLAVTIIYQYFEIFVKEQSEVGSMGALLF >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_4|1791_bp atggaatctatcaaggctcggcacaaaggtactctcccaggaggccttcctgaattcttc tgtgaggcagactacttcccgagctgccccggcccttctccggccaagtctctccagaaa ctcatcccattcgttctcagatggaaggggacccacacccacgacaagcctccgggtttg cttaatgactcagacagcgagggtgctcgaaggcgtccctgggtagcgcggaaggttact tctctgtgtcccaactcttcctgttttgtttctcccatcaaagtcaaatttctggaagtc atcaagcccttctgtgtcatcctgccggaaattcagaagccagagaggaagattcagttt aaggagaaagtgctgtggaccgctatcaccctctttatcttcttagtgtgctgccagatt cccctgtttgggatcatgtcttcagattcagctgaccctttctattggatgagagtgatt ctagcctctaacagaggcacattgatggagctagggatctctcctattgtcacgtctggc cttataatgcaactcttggctggcgccaagataattgaagttggtgacaccccaaaagac cgagctctcttcaacggagcccaaaagttatttggcatgatcattactatcggccagtct atcgtgtatgtgatgaccgggatgtatggggacccttctgaaatgggtgctggaatttgc ctgctaatcaccattcagctctttgttgctggcttaattgtcctacttttggatgaactc ctgcaaaaaggatatggccttggctctggtatttctctcttcattgcaactaacatctgt gaaaccatcgtatggaaggcattcagccccactactgtcaacactggccgaggaatggaa tttgaaggtgctatcatcgcacttttccatctgctggccacacgcacagacaaggtccga gcccttcgggaggcgttctaccgccagaatcttcccaacctcatgaatctcatcgccacc atctttgtctttgcagtggtcatctatttccagggcttccgagtggacctgccaatcaag tcggcccgctaccgtggccagtacaacacctatcccatcaagctcttctatacgtccaac atccccatcatcctgcagtctgccctggtgtccaacctttatgtcatctcccaaatgctc tcagctcgcttcagtggcaacttgctggtcagcctgctgggcacctggtcggacacgtct tctgggggcccagcacgtgcttatccagttggtggcctttgctattacctgtcccctcca gaatcttttggctccgtgttagaagacccggtccatgcagttgtatacatagtgttcatg ctgggctcctgtgcattcttctccaaaacgtggattgaggtctcaggttcctctgccaaa gatgtaagtagaagcaaaactttctggaagggtgatgaaagtgtgactgcctccaattcc aacttctcccctgtgggcaccctgcaggttgcaaagcagctgaaggagcagcagatggtg atgagaggccaccgagagacctccatggtccatgaactcaaccggtacatccccacagcc gcggcctttggtgggctgtgcatcggggccctctcggtcctggctgacttcctaggcgcc attgggtctggaaccgggatcctgctcgcagtcacaatcatctaccagtactttgagatc ttcgttaaggagcaaagcgaggttggcagcatgggggccctgctcttctga >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_5|890_aa MLIKGEAFELTKMQEKGQAAQNRGKLMKERAAQRKQQAQRQQQPKPCGTWRDRLPPHNDK VAVSSSTKSPNQVSLALMKSCAPAPREKSSALPSQKGFLSVAITTGNVLVHLKPAWHWVS PKARGKYCLATTDVYSRPMHSTQQMMNPAKSGFSSFKTLHSLLSQALDQIRGVVPLCSRP QPRQEAPATGEANPETQTEVGDRSSSSSRDGRSPPGVCKMKIEEVKSTTKTQRIASHSHV KGLGLDESGLAKQAASGLVGQENAREVWPVDQGVGGCKQGCCGERAAEVGGSGRDARARG LPPLFPQGSENGPGARGGAPPWSWRFFSFALRRIPVFKVSALSSQVQMPNEAGNGDEGSG WVHCGVSALSPRCSALLKVTMRLQAKQNQATALALAIAQELGSKVPFCPMVGSEVYSTEI KKTEVLMENFRRAIGLRIKETKEVYEGEVTELTPCETENPMGGYGKTISHVIIGLKTAKG TKQLKLDPSIFESLQKERVEAGDVIYIEANSGAVKRQGRCDTYATEFDLEAEEYVPLPKG DVHKKKEIIQDVTLHDLDVANARPQGGQDILSMMGQLMKPKKTEITDKLRGEINKVVNKY IDQGIAELVPGVLFVDEVHMLDIECFTYLHRALESSIAPIVIFASNRGNCVIRGTEDITS PHGIPLDLLDRVMIIRTMLYTPQEMKQIIKIRAQTEGINISEEALNHLGEIGTKTTLRYS VQLLTPANLLAKINGKDSIEKEHVEEISELFYDAKSSAKILADQQDKYMKSKMEVALTAD LTAAGGDRVFFGCPRYQYPDQPGTVINGQAVRAGRPSARVSATDAAVRPARPALTPGQGP GAINGRRAAGGGWAAAIRWADGRLPIARSVTRRHRHRHAASTQISAVSAA >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_5|2673_bp atgcttattaagggagaggcatttgagctgacaaagatgcaagagaagggccaggctgca cagaaccgagggaaattgatgaaagaacgtgctgcacagagaaaacagcaggctcagaga cagcagcaacctaagccatgtggcacgtggcgagacaggctcccccctcacaatgacaaa gtggctgtcagcagctccacaaagtccccaaaccaagtctccttggccctgatgaagtca tgtgcccctgcaccaagagagaaatccagtgctctgcctagccagaaagggtttctttct gtggccatcacaactgggaatgtgctggttcacctgaagccagcatggcactgggtctca cccaaggctcgtggtaaatactgtctggctaccactgatgtttattcaaggcccatgcac tctactcagcagatgatgaatcctgccaagtctgggttttcctccttcaagacgctgcat tctcttttgtcccaggccctagatcaaatccggggcgtggtcccactgtgctcccgaccc cagcctcggcaggaagcgccggctacgggggaagccaacccggagacacagacggaagtg ggtgaccggagctctagcagcagccgcgatgggcgcagccctcccggcgtctgcaaaatg aagattgaggaggtgaagagcactacgaagacgcagcgcatcgcctcccacagccacgtg aaagggctggggctggacgagagcggcttggccaagcaggcggcctcagggcttgtgggc caggagaacgcgcgagaggtgtggccagtggaccagggagttgggggctgcaagcagggc tgctgcggcgagagagctgctgaagtcggtggctcggggcgggatgcgcgcgccaggggt ctcccgccattatttcctcagggaagtgaaaatgggccaggggctcggggaggggcgccg ccctggagctggagatttttttcctttgctcttagaagaataccagttttcaaggtgtca gccttgagctcacaagtgcagatgcctaatgaggcaggcaacggggatgagggaagtggc tgggtgcactgtggcgtgtctgccttgagccccaggtgctcggccttgctgaaggtcacc atgagactgcaggccaaacagaaccaggctacagctctggctctggctattgctcaggag ctgggtagtaaggtccccttctgcccaatggtggggagtgaagtttactcaactgagatc aagaagacagaggtgctgatggagaacttccgcagggccattgggctgcgaataaaggag accaaggaagtttatgaaggtgaagtcacagagctaactccgtgtgagacagagaatccc atgggaggatatggcaaaaccattagccatgtgatcataggactcaaaacagccaaagga accaaacagttgaaactggaccccagcatttttgaaagtttgcagaaagagcgagtagaa gctggagatgtgatttacattgaagccaacagtggggccgtgaagaggcagggcaggtgt gatacctatgccacagaattcgaccttgaagctgaagagtatgtccccttgccaaaaggg gatgtgcacaaaaagaaagaaatcatccaagatgtgaccttgcatgacttggatgtggct aatgcgcggccccaggggggacaagatatcctgtccatgatgggccagctaatgaagcca aagaagacagaaatcacagacaaacttcgaggggagattaataaggtggtgaacaagtac atcgaccagggcattgctgagctggtcccgggtgtgctgtttgttgatgaggtccacatg ctggacattgagtgcttcacctacctgcaccgcgccctggagtcttctatcgctcccatc gtcatctttgcatccaaccgaggcaactgtgtcatcagaggcactgaggacatcacatcc cctcacggcatccctcttgaccttctggaccgagtgatgataatccggaccatgctgtat actccacaggaaatgaaacagatcattaaaatccgtgcccagacggaaggaatcaacatc agtgaggaggcactgaaccacctgggggagattggcaccaagaccacactgaggtactca gtgcagctgctgaccccggccaacttgcttgctaaaatcaacgggaaggacagcattgag aaagagcatgtcgaagagatcagtgaacttttctatgatgccaagtcctccgccaaaatc ctggctgaccagcaggataagtacatgaagagcaagatggaagtggcgctgacagcagat ctcacagccgcgggtggggacagagtcttcttcggctgcccaaggtaccaatatccagat cagccaggcacggtgattaatggccaggccgtgcgggctgggcggccatcagcgcgcgtc tcggcgacggatgcggctgtcaggccggcccggcccgcgctgacccccgggcaggggccc ggcgcgatcaatgggcggcgcgcagcgggcggcggctgggccgccgcgataagatgggcc gatgggcgcctgccgattgcccgctccgtcacccgccgtcaccgtcaccgtcacgccgcc agcacgcagatcagcgccgtcagcgccgcgtga >gi568815595f:127952835_128169659|GENSCAN_predicted_peptide_6|175_aa MAGRRVNVNVGVLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSVPL PARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGAIFCMPHMGIYIGQM IIGLVADGLIGHWAFWVPAIMIATVTPTFINHHPDQSATITIEARPSTSKKIMTC >gi568815595f:127952835_128169659|GENSCAN_predicted_CDS_6|528_bp atggcagggcggcgggtgaacgtgaacgtgggcgtgctgggccacatcgacagcggcaag acggcgctggcgcgggcgctaagcaccacagcctccaccgccgcctttgacaagcagccg cagagccgcgagcgcggcatcacgctcgatctgggcttctcgtgcttctcggtgccgctg cccgcgcgcctgcggtcgtctttgcccgagttccaggcagcgcccgaggccgagcccgag cccggcgagccactgcttcaggtcacgctggtcgactgccccgggcacgcctccctcatc cggaccatcatcggcgcaatattctgtatgccacacatgggaatatatattgggcagatg ataattggcttggtggcagatggcctcattggtcactgggcattctgggtcccagctata atgattgccacagtcaccccaaccttcatcaaccaccaccctgatcagtcggcaaccatc accattgaggcaagaccctccaccagcaaaaagattatgacttgctga