GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:31:03 Sequence gi568815578r:661029_865774 : 204746 bp : 48.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 234 229 6 -1.75 1.02 Term - 3433 2643 791 2 2 125 52 1429 0.938 136.48 1.01 Init - 14573 14441 133 2 1 100 113 143 0.980 18.30 1.00 Prom - 14811 14772 40 -2.36 2.00 Prom + 17615 17654 40 -2.86 2.01 Init + 20500 20576 77 0 2 58 119 54 0.602 6.06 2.02 Intr + 31688 31792 105 2 0 95 95 67 0.630 7.43 2.03 Term + 46340 46370 31 2 1 116 48 28 0.014 -1.07 2.04 PlyA + 50411 50416 6 1.05 3.06 PlyA - 51368 51363 6 1.05 3.05 Term - 61680 61501 180 0 0 96 51 60 0.027 0.71 3.04 Intr - 67291 67172 120 1 0 85 47 77 0.208 3.99 3.03 Intr - 67776 67569 208 2 1 76 46 110 0.423 4.68 3.02 Intr - 73329 73246 84 2 0 86 31 79 0.426 0.94 3.01 Init - 74908 74901 8 1 2 114 57 0 0.374 -0.00 3.00 Prom - 91669 91630 40 -3.06 4.05 PlyA - 91691 91686 6 1.05 4.04 Term - 100210 99998 213 1 0 137 48 250 0.997 23.13 4.03 Intr - 100796 100658 139 1 1 138 56 120 0.891 14.27 4.02 Intr - 102975 102470 506 0 2 132 69 539 0.987 48.18 4.01 Init - 104746 104180 567 1 0 111 80 877 0.998 84.27 4.00 Prom - 112255 112216 40 -4.96 5.03 PlyA - 115145 115140 6 1.05 5.02 Term - 115915 115681 235 0 1 120 49 168 0.957 11.89 5.01 Init - 135750 135578 173 0 2 101 84 105 0.126 10.32 5.00 Prom - 152443 152404 40 -2.16 6.00 Prom + 152990 153029 40 -4.26 6.01 Init + 155329 155380 52 0 1 57 110 26 0.486 3.03 6.02 Intr + 163889 164062 174 0 0 54 70 82 0.217 2.91 6.03 Intr + 172711 172923 213 2 0 87 94 125 0.780 11.69 6.04 Term + 173299 173474 176 2 2 108 43 16 0.777 -2.98 6.05 PlyA + 174994 174999 6 1.05 7.05 PlyA - 176913 176908 6 1.05 7.04 Term - 177409 177219 191 2 2 107 41 37 0.361 -1.49 7.03 Intr - 178978 178586 393 2 0 -1 98 411 0.189 27.83 7.02 Intr - 181121 180853 269 1 2 53 97 107 0.235 5.28 7.01 Init - 181537 181479 59 1 2 93 75 47 0.911 3.20 7.00 Prom - 182479 182440 40 -7.06 8.00 Prom + 183244 183283 40 -15.08 8.01 Sngl + 183777 184664 888 2 0 99 48 978 0.956 88.99 8.02 PlyA + 185224 185229 6 1.05 9.02 PlyA - 186143 186138 6 1.05 9.01 Term - 204264 204080 185 1 2 58 45 128 0.551 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:661029_865774|GENSCAN_predicted_peptide_1|307_aa MPRSFLVKKIKGDGFQCSGVPAPTYHPLETAYVLPGARGPPGDNGYAPHRLPPSSYDADQ KPGLELAPAEPAYPPAAPEEYSDPESPQSSLSARYFRGEAAVTDSYSMDAFFISDGRSRR RRGGGGGDAGGSGDAGGAGGRAGRAGAQAGGGHRHACAECGKTYATSSNLSRHKQTHRSL DSQLARKCPTCGKAYVSMPALAMHLLTHNLRHKCGVCGKAFSRPWLLQGHMRSHTGEKPF GCAHCGKAFADRSNLRAHMQTHSAFKHYRCRQCDKSFALKSYLHKHCEAACAKAAEPPPP TPAGPAS >gi568815578r:661029_865774|GENSCAN_predicted_CDS_1|924_bp atgccgcgctccttcctggtaaagaagatcaaaggggacggcttccagtgcagcggggtg ccggcccccacctaccaccccttggagacagcctacgtgctgcctggcgcccgggggcct cccggggacaacgggtacgccccgcaccgcctgcccccgagcagctacgatgcggaccag aagccgggcctggagctggccccggccgagcccgcgtacccgccggcggcgccggaggag tacagcgaccccgaaagcccgcagtcgagcctgtcggcgcgctacttccgaggggaggcg gcagtgaccgacagctactccatggacgccttcttcatctcggacgggcgctcgcggcgg cggcggggcgggggcggcggggacgcggggggctcgggagacgcggggggcgccgggggg cgcgcggggcgcgcgggggcgcaggcgggcggcgggcaccggcacgcgtgcgccgagtgc ggcaagacctacgccacgtcgtcgaacctgagccgccacaagcagacgcaccgcagcctg gacagccagctggcgcgcaaatgcccgacgtgcggcaaggcctacgtgtccatgcccgcg ctcgccatgcacctgctcacgcacaacctgcgccacaagtgcggcgtctgcggcaaggcc ttctcgcggccctggctgctgcagggtcacatgcgctcgcacaccggcgaaaagccgttc ggctgcgcgcactgcggcaaggccttcgccgaccgctccaacctgcgcgcgcacatgcag acgcactcggccttcaagcactaccgctgccgccagtgcgacaagagcttcgcgctcaag tcctacctccacaagcactgcgaggcggcctgcgccaaggcggccgagccacccccgccg acccccgccggcccggccagctga >gi568815578r:661029_865774|GENSCAN_predicted_peptide_2|70_aa MEFCMNSKEESSHKCIRSTLMGLSLRCREERRKFFDRKILEILLLSPIGLLEAGVSPEVF SPQEAYWSIT >gi568815578r:661029_865774|GENSCAN_predicted_CDS_2|213_bp atggaattctgtatgaattccaaggaggagagtagccacaaatgcatccgctccacgctg atggggctctctctccgttgcagagaggaaaggaggaaatttttcgatagaaaaatcttg gagatcctgttgctgtcacccatcgggctgttggaagctggggttagtccagaagtcttc agcccccaagaggcctactggagcataacttga >gi568815578r:661029_865774|GENSCAN_predicted_peptide_3|199_aa MPGISASEFRKAEDAAHFGAKRAVVWDALAGEGCILTVCFQVHLQREAERPFLVRLATQR GPAPDRAFKGTARLPCSQHFTLSSYRRATRCKVEKKVNERLARSRPEPPGRDQGTPKPAG APPVCSPPPPSAADPARPGRGQPAKDTEKRRRKECRPWAPCHDQAEDPAAATMSHSSATE TLFIPHSGSMSTGGRDLKV >gi568815578r:661029_865774|GENSCAN_predicted_CDS_3|600_bp atgcccggaatttctgcttcagaattccgtaaggctgaagatgctgcacactttggagcc aagagggcagttgtctgggatgctctggctggggagggctgcattctgacggtctgcttc caagtccatctgcagcgcgaggctgagcgccctttcctggtgaggttggccactcagagg ggcccagctcccgacagggcatttaaaggtacggcccgcttgccctgttctcagcacttc accctcagctcgtaccgtagagccacccgctgtaaggttgagaaaaaggtgaacgagcga ctggcccggagtcggcccgagccccctggacgggaccaggggacccccaagccggccggc gccccgcccgtctgctccccacccccaccgtcagctgcggacccggcccggccgggaagg gggcagcctgccaaggacacagagaagaggaggagaaaggaatgtaggccatgggcaccc tgccatgaccaagctgaagaccccgcagcagccaccatgtcccacagctcagcaactgag acactgttcattcctcactcaggttccatgtccactgggggccgggatctgaaggtctga >gi568815578r:661029_865774|GENSCAN_predicted_peptide_4|474_aa MAFLMHLLVCVFGMGSWVTINGLWVELPLLVMELPEGWYLPSYLTVVIQLANIGPLLVTL LHHFRPSCLSEVPIIFTLLGVGTVTCIIFAFLWNMTSWVLDGHHSIAFLVLTFFLALVDC TSSVTFLPFMSRLPTYYLTTFFVGEGLSGLLPALVALAQGSGLTTCVNVTEISDSVPSPV PTRETDIAQGVPRALVSALPGMEAPLSHLESRYLPAHFSPLVFFLLLSIMMACCLVAFFV LQRQPRCWEASVEDLLNDQVTLHSIRPREENDLGPAGTVDSSQGQGYLEEKAAPCCPAHL AFIYTLVAFVNALTNGMLPSVQTYSCLSYGPVAYHLAATLSIVANPLASLVSMFLPNRSL LFLGVLSVLGTCFGGYNMAMAVMSPCPLLQGHWGGEVLIVSIRPVASWVLFSGCLSYVKV MLGVVLRDLSRSALLWCGAAVQLGSLLGALLMFPLVNVLRLFSSADFCNLHCPA >gi568815578r:661029_865774|GENSCAN_predicted_CDS_4|1425_bp atggccttcctgatgcacctgctggtctgcgtcttcggaatgggctcctgggtgaccatc aatgggctctgggtagagctgcccctgctggtgatggagctgcccgagggctggtacctg ccctcctacctcacggtggtcatccagctggccaacatcgggcccctcctggtcaccctg ctccatcacttccggcccagctgcctttccgaagtgcccatcatcttcaccctgctgggc gtgggaaccgtcacctgcatcatctttgccttcctctggaatatgacctcctgggtgctg gacggccaccacagcatcgccttcttggtcctcaccttcttcctggccctggtggactgc acctcttcagtgaccttcctgccgttcatgagccggctgcccacctactacctcaccacc ttctttgtgggtgaaggactcagcggcctcttgcccgccctggtggctcttgcccagggc tccggtctcactacctgcgtcaatgtcactgagatatcagacagcgtaccaagccctgta cccacgagggagactgacatcgcacagggagttcccagagctttggtgtccgccctcccc ggaatggaagcacccttgtcccacctggagagccgctaccttcccgcccacttctcaccc ctggtcttcttcctcctcctatccatcatgatggcctgctgcctcgtggcgttctttgtc ctccagcgtcaacccaggtgctgggaggcttccgtggaagacctcctcaatgaccaggtc accctccactccatccggccgcgggaagagaatgacttgggccctgcaggcacggtggac agcagccagggccaggggtatctagaggagaaagcagccccctgctgcccggcgcacctg gccttcatctataccctggtggccttcgtcaacgcgctcaccaacggcatgctgccctct gtgcagacctactcctgcctgtcctatgggccagttgcctaccacctggctgccaccctc agcattgtggccaaccctcttgcctcgttggtctccatgttcctgcctaacaggtctctg ctgttcctgggggtcctctccgtgcttgggacctgctttgggggctacaacatggccatg gcggtgatgagcccctgccccctcttgcagggccactggggtggggaagtcctcattgtg agtatccggccggtggcctcgtgggtgcttttcagcggctgcctcagttacgtcaaggtg atgctgggcgtggtcctgcgcgacctcagccgcagcgccctcttgtggtgcggggcggcg gtgcagctgggctcgctgctcggagcgctgctcatgttccctctggtcaacgtgctgcgg ctcttctcgtccgcggacttctgcaatctgcactgtccagcctag >gi568815578r:661029_865774|GENSCAN_predicted_peptide_5|135_aa MDNSYFFRQLKIIQCLGEQLLVNKVLTFTINETSFSPNFPKCMGHSKGIPGICIDVVLFF ADLTPMAPPKPDVSTRTANPAPVAPPPKAIQPVGAQLQPPMISSPPQPISSKHLLPGHPQ LFPYTAFEKPLTYKL >gi568815578r:661029_865774|GENSCAN_predicted_CDS_5|408_bp atggacaatagctatttcttccggcaactgaagattattcaatgcttgggtgaacagctc cttgtgaacaaggtcttgacctttactattaatgagacctcgttcagtccaaattttccc aaatgtatgggccactccaaaggcatacctggaatctgtatagatgtggttctgtttttt gcagatttgacacccatggctccacctaaacctgatgtctccaccaggaccgccaaccct gctcctgtggccccccccccaaaagcgattcagcctgtaggagcacagcttcaaccccct atgatttcatctccaccccaaccaatcagcagcaagcacctgttacctggccacccccaa ctcttcccctatactgcctttgaaaaacccctcacctacaagctttga >gi568815578r:661029_865774|GENSCAN_predicted_peptide_6|204_aa MWEKKAERGPSEGASTPGSVLKALPASFQRLDVTSYIVGEIIVPIKDGPTVWLSQWGMSS DLECSHRKLGPILSEGTFPAVGAFDSEGHPLSSTAEGSSKGDPKGAHAGLGPPGRRWERP LPARVRAVSLSKACARGAGTRLARAPVRGWTRLIWRLSVYGDRGPLFGDRTGLTFLGWMQ GQQILQPPEPLLASWESSPVWGRE >gi568815578r:661029_865774|GENSCAN_predicted_CDS_6|615_bp atgtgggagaagaaggcagagagagggccttctgagggggcctccacgccaggctctgtg ctaaaagctttgccagcatcatttcagagacttgatgtgacatcttacatcgtaggggaa attattgttcccattaaagatgggcccacagtgtggctaagccagtggggcatgtcctca gacctggagtgctcccataggaagctgggcccgattctttctgaagggactttccctgcg gtgggggccttcgacagtgagggccacccgctcagctcgaccgcggagggcagctccaaa ggggaccccaaaggtgcccacgcggggctggggcctcctgggcgtcgttgggagcggcca ctaccggcccgggtccgagctgtcagcctctccaaagcctgcgcgagaggagccgggaca cgcctagcgcgggctccagtccggggttggactcggctcatttggcgcctttctgtctac ggggacagaggaccactctttggggatcggacgggactgacatttctgggctggatgcag gggcagcagattctccagcccccagagccgctgctggcctcttgggaatcatccccagtt tggggaagggaatag >gi568815578r:661029_865774|GENSCAN_predicted_peptide_7|303_aa MEWMGKEASLHQGRPPGDLRLGYLIRETRLKIDLPEPPGHDKDGEIKQVEGFPRHLAKRR SHEPGSWSSRRFRLTKSTDHPHAGVSKQAPRGSVPAHTADKAGPRQRLIAPAPDPTAAKM LMPKKNRIAIHELLFKEGVMVAKKDVHMPKHPELADKNVPNLHVMKAMQSLKSRGCVKEQ FAWRHFYWYLTNEGSQYLRDYLHLPPEIVPATLHLPPEIVPATLHRSRPETGRPRPKGLE GTKHEEIPLNNRLFWTQPTSKAKATHQEVYIKCIKLFLISVRKREERKRGRKKGEKVKHI CQE >gi568815578r:661029_865774|GENSCAN_predicted_CDS_7|912_bp atggagtggatgggaaaggaagcctcgctccaccagggccgacccccaggagacctcaga ctcggttaccttatccgtgaaacgaggttaaaaatagaccttcctgagcctccaggtcac gataaggatggagagatcaaacaggtcgagggctttcctcggcatctggcaaaaaggcgg agccacgaaccaggctcctggtcttcaagaagatttcgactaaccaagtccacagaccac ccccacgcaggggtctctaagcaagccccacgagggtctgtcccagcccacaccgcggac aaagcaggcccaagacagaggctgatcgccccagccccggaccctacagctgccaagatg ctgatgcctaagaagaaccggattgccattcatgaactcctttttaaggagggagtcatg gtggccaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgtgccc aaccttcacgtcatgaaggccatgcagtctctcaagtcccggggctgcgtgaaggaacag tttgcctggagacatttctactggtaccttaccaatgagggtagccagtatctccgtgat taccttcatctgcccccagagattgttcctgccaccctacatctgcccccggagattgtt cctgccaccctacaccgcagccgtccagagactggcaggcctcggcctaaaggtctggag gggaccaaacacgaggaaataccgttgaacaaccggctgttctggacccagcccacctca aaggccaaggccactcatcaagaggtatatattaaatgtataaagctttttcttatatca gtaagaaagagagaagaaagaaagagaggaagaaagaaaggagagaaagttaaacatatc tgtcaagaataa >gi568815578r:661029_865774|GENSCAN_predicted_peptide_8|295_aa MPVHTLSPGAPSAPALPCRLRTRVPGYLLRGPADGGARKPSAVERLEADKAKYVKSLHVA NTRQEPVQPLLSKQPLFSPETRRTVLTPSRRALPGPCRRPQLDLDILSSLIDLCDSPVSP AEASRTPGRAEGAGRPPPATPPRPPPSTSAVRRVDVRPLPASPARPCPSPGPAAASSPAR PPGLQRSKSDLSERFSRAAADLERFFNFCGLDPEEARGLGVAHLARASSDIVSLAGPSAG PGSSEGGCSRRSSVTVEERARERVPYGVSVVERNARVIKWLYGLRQARESPAAEG >gi568815578r:661029_865774|GENSCAN_predicted_CDS_8|888_bp atgcctgtgcacacgctgagccccggagccccgtccgcccccgccctaccttgccgcctg cggaccagggtccctggctacctgctacgggggccggcagatggtggagcccggaaaccg agcgctgtggagcgcctggaggccgacaaggccaagtacgtcaagagcctgcacgtggcc aacacccgccaggagcctgtgcagcccctgctgtccaaacagccgctctttagccctgag actcgccgcacagtgctcacgcccagccgccgagccctgcctggcccctgccgacggccc cagctggacctggacatcctcagcagcctcatcgacttgtgtgacagccccgtgtcccct gccgaggccagccgcactcctggacgggccgagggagccggccgtcctcccccagccacc cctccgcgaccgccgcccagtacctctgcggtccgccgggtggacgtccgccccctgccc gcctcgcctgcccggccctgcccatcacccggccctgccgccgcctccagcccagcccgg ccgccgggtttgcaacgctccaagtcggacttgagcgagcgcttttctagggcagccgct gatctcgagcgcttttttaacttctgcggcctggacccggaggaggcgagagggttgggt gtggcccacctggcacgggccagctcggatatcgtgtccctggcagggcccagtgctggg ccgggcagctctgaagggggctgctcccgccgcagctcggtgactgttgaggagcgggcc cgggagcgcgttccctatggcgtgtcggtggtggagcgcaatgcccgcgtgatcaagtgg ttgtatgggctaaggcaggctcgggagagcccagcagctgaaggctag >gi568815578r:661029_865774|GENSCAN_predicted_peptide_9|61_aa XAQMRMCVLRFNTGTYSATTDVSSGPSQQPGITHVPTEDVFGIAFCCFSAAFLLLRPHEL A >gi568815578r:661029_865774|GENSCAN_predicted_CDS_9|186_bp ngtgcccagatgcgcatgtgtgtgctgaggttcaacactggaacttattcagctacaaca gatgtgtcctctgggccctcgcagcaacctggtataacacacgtgcctacagaggatgtg tttggcatcgcattctgctgcttttctgctgcttttctgcttcttcgaccacatgaattg gcttag