GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:15:11 Sequence gi568815581r:49198416_49417725 : 219310 bp : 46.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 489 484 6 1.05 1.02 Term - 8467 8342 126 1 0 91 45 138 0.798 8.08 1.01 Init - 9007 8924 84 1 0 96 105 156 0.998 18.92 1.00 Prom - 9395 9356 40 -11.82 2.00 Prom + 10653 10692 40 -7.96 2.01 Init + 12310 12426 117 0 0 90 96 202 0.938 21.40 2.02 Intr + 12692 12800 109 1 1 89 68 5 0.417 -1.54 2.03 Intr + 15458 15566 109 0 1 70 95 37 0.504 1.94 2.04 Intr + 18058 18283 226 1 1 80 110 300 0.531 29.39 2.05 Intr + 19324 19500 177 0 0 74 75 282 0.965 25.62 2.06 Intr + 21125 21210 86 1 2 109 60 51 0.963 3.02 2.07 Intr + 21443 21538 96 2 0 98 80 117 0.999 11.02 2.08 Intr + 21826 21911 86 1 2 65 90 82 0.806 5.56 2.09 Intr + 23676 23810 135 1 0 72 77 90 0.960 6.84 2.10 Term + 24137 24300 164 0 2 100 48 243 0.999 19.60 2.11 PlyA + 24581 24586 6 1.05 3.02 PlyA - 24972 24967 6 -3.94 3.01 Sngl - 26580 25831 750 0 0 96 47 1468 0.431 139.68 3.00 Prom - 29664 29625 40 -3.56 4.11 PlyA - 29915 29910 6 -0.45 4.10 Term - 32503 32284 220 0 1 48 55 128 0.106 2.01 4.09 Intr - 59494 59348 147 0 0 109 95 -10 0.002 1.15 4.08 Intr - 92158 91992 167 1 2 125 66 34 0.308 3.66 4.07 Intr - 100509 100217 293 1 2 90 69 251 0.679 20.25 4.06 Intr - 103365 103232 134 2 2 72 70 -47 0.199 -7.81 4.05 Intr - 107274 107220 55 1 1 84 98 100 0.720 8.64 4.04 Intr - 113041 112897 145 1 1 46 99 202 0.991 16.96 4.03 Intr - 113627 113512 116 0 2 68 94 12 0.959 -0.03 4.02 Intr - 114430 114283 148 1 1 72 111 122 0.984 12.71 4.01 Init - 119247 118411 837 0 0 73 96 704 0.921 64.40 4.00 Prom - 121979 121940 40 -5.76 5.03 PlyA - 123198 123193 6 1.05 5.02 Term - 124622 124468 155 0 2 51 53 87 0.783 -0.42 5.01 Init - 125019 124923 97 0 1 58 57 114 0.836 5.77 5.00 Prom - 128715 128676 40 -3.46 6.03 PlyA - 128758 128753 6 1.05 6.02 Term - 162928 162919 10 0 1 100 49 6 0.374 -4.43 6.01 Init - 163546 163494 53 1 2 84 100 119 0.774 11.33 6.00 Prom - 191879 191840 40 -1.96 7.03 PlyA - 192320 192315 6 1.05 7.02 Term - 196787 196702 86 0 2 74 49 63 0.304 -1.18 7.01 Init - 202355 202064 292 2 1 49 91 141 0.409 7.81 7.00 Prom - 204676 204637 40 -7.66 8.07 PlyA - 205665 205660 6 1.05 8.06 Term - 206789 206577 213 2 0 93 55 434 0.916 37.83 8.05 Intr - 208439 208344 96 2 0 99 105 95 0.999 12.51 8.04 Intr - 210743 210627 117 2 0 88 91 120 0.998 12.96 8.03 Intr - 211058 210916 143 0 2 55 89 288 0.997 25.67 8.02 Intr - 213424 213263 162 2 0 85 113 116 0.991 13.75 8.01 Init - 214849 214762 88 1 1 104 77 50 0.975 6.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_1|69_aa MAQDLSEKDLLKMEVEQLKKEVKNTRIPISKAGKEIKEYVEAQAGNDPFLKGIPEDKNPF KEKGGCLIS >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_1|210_bp atggcccaggatctcagcgagaaggacctgttgaagatggaggtggagcagctgaagaaa gaagtgaaaaacacaagaattccgatttccaaagcgggaaaggaaatcaaggagtacgtg gaggcccaagcaggaaacgatccttttctcaaaggcatccctgaggacaagaatcccttc aaggagaaaggtggctgtctgataagctga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_2|434_aa MAELQQLQEFEIPTGREALRGNHSALLRVADYCEDNYVQVSPQVHPSEFPRVNPETQSLR PHTDAGDHSISRPALVKEGPELNQNLLSCLVLISWVDFFAYSIFLVTYAEEIQPRTQWKM LALRPGLCVFQATDKRKALEETMAFTTQALASVAYQVGNLAGHTLRMLDLQGAALRQVEA RVSTLGQMVNMHMEKVARREIGTLATVQRLPPGQKVIAPENLPPLTPYCRRPLNFGCLDD IGHGIKDLSTQLSRTGTLSRKSIKAPATPASATLGRPPRIPEPVHLPVVPDGRLSAASSA FSLASAGSLDPPPPPAAVEVFQRPPTLEELSPPPPDEELPLPLDLPPPPPLDGDELGLPP PPPGFGPDEPSWVPASYLEKVVTLYPYTSQKDNELSFSEGTVICVTRRYSDGWCEGVSSE GTGFFPGNYVEPSC >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_2|1305_bp atggcggagctacagcagctgcaggagtttgagatccccactggccgggaggctctgagg ggcaaccacagtgccctgctgcgggtcgctgactactgcgaggacaactatgtgcaggtg tcacctcaggttcatccttctgagttcccacgggtcaatccagagacccaaagcctccgt cctcacacagatgctggcgatcattctatttccaggccggctctggtaaaggaaggtcct gaactaaatcaaaatctgctctcctgtctggtgttaatatcctgggttgatttctttgcc tattcaatcttcttggtgacctatgctgaggagatccagccaaggacccagtggaagatg ctggcccttaggccagggctgtgtgtatttcaggccacagacaagcggaaggcgctggag gagaccatggccttcactacccaggcactggccagcgtggcctaccaggtgggcaacctg gccgggcacactctgcgcatgttggacctgcagggggccgccctgcggcaggtggaagcc cgtgtaagcacgctgggccagatggtgaacatgcatatggagaaggtggcccgaagggag atcggcaccttagccactgtccagcggctgccccccggccagaaggtcatcgccccagag aacctaccccctctcacgccctactgcaggagacccctcaactttggctgcctggacgac attggccatgggatcaaggacctcagcacgcagctgtcaagaacaggcaccctgtctcga aagagcatcaaggcccctgccacacccgcctccgccaccttggggagaccaccccggatt cccgagccagtgcacctgccggtggtgcccgacggcagactctccgccgcctcctctgcg ttttccctggcctcggccggctccttggacccacctcctccaccagcagccgtcgaggtg ttccagcggcctcccacgctggaggagttgtccccacccccaccggacgaagagctgccc ctgccactggacctgcctcctcctccacccctggatggagatgaattggggctgcctcca cccccaccaggatttgggcctgatgagcccagctgggtgcctgcctcatacttggagaaa gtggtgacactgtacccatacaccagccagaaggacaatgagctctccttctctgagggc actgtcatctgtgtcactcgccgctactccgatggctggtgcgagggcgtcagctcagag gggactggattcttccctgggaactatgtggagcccagctgctga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_3|249_aa MAAQGAPRFLLTFDFDETIVDENSDDSIVRAAPGQRLPESLRATYREGFYNEYMQRVFKY LGEQGVRPRDLSAIYEAIPLSPGMSDLLQFVAKQGACFEVILISDANTFGVESSLRAAGH HSLFRRILSNPSGPDARGLLALRPFHTHSCARCPANMCKHKVLSDYLRERAHDGVHFERL FYVGDGANDFCPMGLLAGGDVAFPRRGYPMHRLIQEAQKAEPSSFRASVVPWETAADVRL HLQQVLKSC >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_3|750_bp atggccgcgcagggcgcgccgcgcttcctcctgaccttcgacttcgacgagactatcgtg gacgaaaacagcgacgattcgatcgtgcgcgccgcgccgggccagcggctcccggagagc ctgcgagccacctaccgcgagggcttctacaacgagtacatgcagcgcgtcttcaagtac ctgggcgagcagggcgtgcggccgcgggacctgagcgccatctacgaagccatccctttg tcgccaggcatgagcgacctgctgcagtttgtggcaaaacagggcgcctgcttcgaggtg attctcatctccgatgccaacacctttggcgtggagagctcgctgcgcgccgccggccac cacagcctgttccgccgcatcctcagcaacccgtcggggccggatgcgcggggactgctg gctctgcggccgttccacacacacagctgcgcgcgctgccccgccaacatgtgcaagcac aaggtgctcagcgactacctgcgcgagcgggcccacgacggcgtgcacttcgagcgcctc ttctacgtgggcgacggcgccaacgacttctgccccatggggctgctggcgggcggcgac gtggccttcccgcgccgcggctaccccatgcaccgcctcattcaggaggcccagaaggcc gagcccagctcgttccgcgccagcgtggtgccctgggaaacggctgcagatgtgcgcctc cacctgcaacaggtgctgaagtcgtgctga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_4|753_aa MAQEDSRRGQVPSSFYHGANQELDLSTKVYKRESGSPYSVLVDTKMSKPHLHETEEQPYF RETRAVSDVHAVKEDRENSDDTEEEEEEVSYKREQIIVEVNLNNQTLNVSKGEKGVSSQS KETPVLKTSSEEEEEESEEEATDDSNDYGENEKQKKKEKIVEKVSVTQRRTRRAASVAAA TTSPTPRTTRGRRKSVEPPKRKKRATKEPKAPVQKAKCEEKETLTCEKCPRVFNTRWYLE KHMNVTHRRMQICDKCGKKFVLESELSLHQQTDCEKNIQCVSCNKSFKKLWSLHEHIKIV HGYAEKKFSCEICEKKFYTMAHVRKHMVAHTKDMPFTCETCGKSFKRSMSLKVHSLQHSG EKPFRCENCDERFQYKYQLRSHMSIHIGHKQFMCQWCGKDFNMKQYFDEHMKTHTGAYSL VNGNDGVDDDDNSSAGITGLKQHTLPEVEVFKTVRGNRLKRRKTIWAQNSSRKMNMSHRE KPFICEICGKSFTSRPNMKRHRRTHTGEKPYPCDVCGQRFRFSNMLKAHKEKCFRVTSPV NVPPAVQIPLTTSPATPVPSVVNTATTPTPPINMNPATTITSIGGTRALGELWSMSKEML LTFSAPKRYLLHQGTSPLGQALPPDQCNGHSSFWPHSHFDPLILTCPLAHGYVRNNGTHF TGAKPKEKSEVANGTPQNTRCWRSRSLRGRREAWPRAARPFDVRRHRCDWTVACDVGDCG GLRCCSSRSAGRGSGSGSGSRAFKGDAAAARGG >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_4|2262_bp atggcacaagaagatagccgtcgtggtcaagtgccatcttccttttatcatggtgccaac caagaacttgacctgtccaccaaagtgtacaaaagggaatcaggaagtccttattctgtg ttagtggacaccaagatgagcaaaccgcatctccatgaaacagaagaacagccatatttc agggagacaagagcagtgtctgacgtgcatgctgttaaggaagaccgggagaattctgat gacacagaggaggaagaggaagaagtctcttacaaaagggagcagatcatagtggaggta aaccttaataatcaaacattaaatgtatctaaaggggaaaagggtgtctcttctcagtcc aaagagactcctgttcttaagacaagcagtgaggaggaagaggaagagagtgaggaagag gccacagatgacagcaatgactatggagagaatgaaaagcagaagaaaaaggagaagata gtagagaaagtcagcgttacacaaaggagaaccaggagagctgcctctgttgccgcagct accacttcccctactcccagaactacaagaggtcgtaggaagagtgtagagccacctaag cgtaagaagcgggccacaaaggagcccaaagcaccagtccagaaagctaagtgtgaagag aaagagactctgacctgtgagaagtgccccagggtatttaacactcgctggtacctggag aagcacatgaacgttactcataggcgcatgcagatttgtgataaatgtggcaagaagttt gtcctggaaagtgagctgtcccttcaccagcaaacagactgtgaaaaaaacattcagtgt gtttcctgtaacaaatcgttcaagaaactctggtcccttcatgaacatatcaagatcgtc catggatatgcagaaaagaaattttcctgtgaaatttgtgagaagaaattctataccatg gctcatgtgcggaaacacatggttgcacacacaaaagacatgccatttacatgcgaaacc tgtggaaaatcattcaaacgcagtatgtcactcaaggtgcactccttgcagcattctgga gagaagccctttagatgcgagaactgtgacgaaaggtttcagtacaagtaccagctacgc tcccacatgagcattcatattgggcacaaacagttcatgtgccagtggtgtggcaaggat ttcaacatgaagcagtacttcgacgaacacatgaaaacacacactggagcttatagtctg gtgaatggtaatgatggtgttgatgatgatgataatagcagtgctgggattacaggcttg aaacaacacaccctgccagaagttgaagtttttaagacggttagaggaaatcgattaaaa agaaggaaaacaatctgggctcagaattcatccagaaagatgaatatgagtcacagagag aaaccctttatctgtgaaatctgtggcaaaagcttcaccagccgccccaacatgaagaga caccgcagaactcacacaggcgagaagccctatccatgtgatgtgtgtggccagcggttc cgcttctcgaacatgcttaaggcccacaaggagaagtgctttcgggtgaccagccccgtg aatgtgccacctgctgtccagatcccacttacaacttccccagccaccccagttccttct gtggtgaacacagccacaaccccaacccctccaatcaatatgaatcctgccaccaccatc acatctataggaggaacaagagcactgggggaactctggagtatgagtaaggaaatgctt ctcaccttctctgctccaaagagatatctgttacatcagggaacaagtcctctaggtcag gcacttcctcctgaccagtgcaacgggcactccagcttctggcctcatagccactttgac cccttgattctgacatgtcctctggctcatgggtatgtcagaaataatggcacccatttt acaggtgcaaaaccaaaggaaaaaagtgaagtggccaatggcacaccacaaaatacaaga tgctggcgctcccggagcctccggggcaggagggaggcgtggcctcgggcggcccgcccc tttgatgtgcgccggcaccgctgcgattggacagtcgcttgtgacgttggggactgcggt gggctccgctgctgcagcagccgcagcgccggccgcggctccggctccggctccggctcc cgggcatttaaaggggacgcggcggctgcccgggggggatga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_5|83_aa MDDFEGFNTSVEEVTADVVEIATELELEVEPEEIATATPAFSNHHPDHAEAINIEARPPI GKMIDSLKDQIIANIFNIYLFIY >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_5|252_bp atggatgactttgagggtttcaacacttcagtggaggaagtcactgcagatgtggtggaa atagcaacagaactagaattagaagtggagcctgaagaaattgccacagccaccccagcc ttcagcaaccaccaccctgatcatgcagaagccatcaacattgaagcaagaccccccatc ggcaaaatgattgattcactgaaggatcagataatcgctaacatttttaacatttattta tttatttattga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_6|20_aa MDRRGGGRGGGGGGGRPGLC >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_6|63_bp atggaccgccgaggcggcggccggggcggcggaggcggaggcggccggcccgggctctgc tga >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_7|125_aa MGAVQKGIPHKHFHGKTQKSLQCYQQAVGIVGNKGKNLVKRMNVLIEHIKHSESWNRFLK HIKINDQKKKDAKEKSMWVQLKRQPAPPREAHIVKTNDGSNLPVTSRASLCQPAQDVQKN IPGKD >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_7|378_bp atgggtgctgttcaaaaaggaattccccacaaacatttccatggcaaaactcaaaagagt ctacagtgttaccagcaagctgttggcattgttggaaacaagggcaagaatcttgtcaag agaatgaatgtacttattgagcacattaagcactctgagagctggaatcgcttcctgaaa catatcaaaataaatgatcagaaaaagaaagacgccaaagagaaaagtatgtgggttcaa ctgaagcgccagcctgctccacccagagaagcacacattgtgaaaaccaatgatggcagc aacctgcctgttactagccgtgccagcctgtgtcaaccagcacaagatgttcagaaaaat atcccggggaaagactag >gi568815581r:49198416_49417725|GENSCAN_predicted_peptide_8|272_aa MAAKVFESIGKFGLALAVAGGVVNSALYNVDAGHRAVIFDRFRGVQDIVVGEGTHFLIPW VQKPIIFDCRSRPRNVPVITGSKDLQNVNITLRILFRPVASQLPRIFTSIGEDYDERVLP SITTEILKSVVARFDAGELITQRELVSRQVSDDLTERAATFGLILDDVSLTHLTFGKEFT EAVEAKQVAQQEAERARFVVEKAEQQKKAAIISAEGDSKAAELIANSLATAGDGLIELRK LEAAEDIAYQLSRSRNITYLPAGQSVLLQLPQ >gi568815581r:49198416_49417725|GENSCAN_predicted_CDS_8|819_bp atggctgccaaagtgtttgagtccattggcaagtttggcctggccttagctgttgcagga ggcgtggtgaactctgccttatataatgtggatgctgggcacagagctgtcatctttgac cgattccgtggagtgcaggacattgtggtaggggaagggactcattttctcatcccgtgg gtacagaaaccaattatctttgactgccgttctcgaccacgtaatgtgccagtcatcact ggtagcaaagatttacagaatgtcaacatcacactgcgcatcctcttccggcctgtcgcc agccagcttcctcgcatcttcaccagcatcggagaggactatgatgagcgtgtgctgccg tccatcacaactgagatcctcaagtcagtggtggctcgctttgatgctggagaactaatc acccagagagagctggtctccaggcaggtgagcgacgaccttacagagcgagccgccacc tttgggctcatcctggatgacgtgtccttgacacatctgaccttcgggaaggagttcaca gaagcggtggaagccaaacaggtggctcagcaggaagcagagagggccagatttgtggtg gaaaaggctgagcaacagaaaaaggcggccatcatctctgctgagggcgactccaaggca gctgagctgattgccaactcactggccactgcaggggatggcctgatcgagctgcgcaag ctggaagctgcagaggacatcgcgtaccagctctcacgctctcggaacatcacctacctg ccagcggggcagtccgtgctcctccagctgccccagtga