GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:30:07 Sequence gi568815592r:139273135_139473944 : 200810 bp : 41.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6718 6820 103 0 1 58 44 138 0.243 6.85 1.02 Intr + 9970 10032 63 2 0 88 131 -8 0.092 1.27 1.03 Term + 13711 13724 14 2 2 125 33 35 0.088 -0.91 1.04 PlyA + 13830 13835 6 1.05 2.06 PlyA - 14787 14782 6 1.05 2.05 Term - 15779 15334 446 0 2 39 33 580 0.156 42.01 2.04 Intr - 19327 19182 146 0 2 20 42 142 0.148 1.81 2.03 Intr - 23551 23471 81 0 0 60 84 92 0.297 3.83 2.02 Intr - 24102 23920 183 2 0 22 45 142 0.306 1.28 2.01 Init - 25893 25769 125 0 2 50 49 153 0.624 7.29 2.00 Prom - 32398 32359 40 -6.85 3.00 Prom + 36651 36690 40 -3.65 3.01 Init + 47630 47851 222 1 0 74 22 134 0.508 4.00 3.02 Term + 57242 57463 222 1 0 54 48 165 0.851 5.03 3.03 PlyA + 57926 57931 6 1.05 4.00 Prom + 59982 60021 40 -4.65 4.01 Init + 63646 63682 37 0 1 83 102 30 0.476 4.26 4.02 Intr + 70950 71098 149 1 2 86 30 87 0.027 1.73 4.03 Intr + 76395 76542 148 2 1 3 80 120 0.035 1.59 4.04 Term + 82783 82955 173 2 2 83 49 159 0.740 8.51 4.05 PlyA + 83965 83970 6 1.05 5.00 Prom + 85743 85782 40 -5.25 5.01 Sngl + 88576 88989 414 0 0 88 45 165 0.667 8.25 5.02 PlyA + 89664 89669 6 1.05 6.08 PlyA - 90733 90728 6 1.05 6.07 Term - 100818 99998 821 1 2 115 41 1016 0.849 91.88 6.06 Intr - 101118 100923 196 0 1 3 22 57 0.354 -11.33 6.05 Intr - 101422 101279 144 1 0 4 121 306 0.755 25.06 6.04 Intr - 101676 101474 203 1 2 57 40 117 0.759 1.88 6.03 Intr - 104414 104294 121 2 1 41 93 79 0.490 2.85 6.02 Intr - 105927 105826 102 0 0 81 109 95 0.777 10.35 6.01 Init - 110163 110125 39 0 0 108 100 12 0.513 2.74 6.00 Prom - 121346 121307 40 -1.15 7.02 PlyA - 122641 122636 6 1.05 7.01 Sngl - 130157 129939 219 2 0 59 54 154 0.423 4.01 7.00 Prom - 132906 132867 40 -4.75 8.00 Prom + 135597 135636 40 -6.85 8.01 Sngl + 135689 135874 186 1 0 75 45 211 0.939 10.33 8.02 PlyA + 136461 136466 6 1.05 9.00 Prom + 138138 138177 40 -3.75 9.01 Init + 149579 149743 165 1 0 70 45 86 0.365 2.18 9.02 Intr + 153715 153856 142 0 1 65 -22 148 0.535 0.51 9.03 Intr + 153919 154095 177 2 0 114 87 46 0.542 6.07 9.04 Term + 154818 154999 182 1 2 39 55 125 0.559 1.09 9.05 PlyA + 155353 155358 6 1.05 10.00 Prom + 157338 157377 40 -9.25 10.01 Init + 163284 163326 43 2 1 98 82 76 0.952 8.64 10.02 Term + 163668 163816 149 1 2 46 48 135 0.928 2.38 10.03 PlyA + 166035 166040 6 1.05 11.03 PlyA - 166952 166947 6 1.05 11.02 Term - 170742 170662 81 0 0 67 42 123 0.689 2.31 11.01 Intr - 199461 199393 69 0 0 101 87 76 0.953 7.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 15765 15334 432 0 0 99 33 564 0.827 48.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_1|59_aa MLSVVGSLHESGTTCVADTALLHKAVVTGTVLLEDAKGKVTLPTREIFKYSPNCSDGTV >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_1|180_bp atgctcagcgtggtaggaagtctacatgaaagtggaaccacctgtgtagcagacacagcg ctgttgcataaggctgtggtcactggcacagttttattggaagatgccaaggggaaagtg actttacccacgagagagatttttaaatattcccccaactgctccgatgggaccgtctag >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_2|326_aa MVPASPSGESFRKFPLMVEGEGEQASHGKREEEGENMEEGTSKEEEEKRQKKGSSNVREA RRIESINPKPQRMRTRTQYIVVNDSTGSVLHRDELDVSGTGKQIPEDCSLKKLNQKGSEL SDMRLRGGQGNLTLIKLVQEEEEKEDESSTTLEITQEVRKTDLYPKRTTMVKDCICNGDT TPKMEANHSEQLSAERQSTPPGDSSSLPSHNGLEKEDGQDSPTPVQPPEKEASVHPDISE ELNRQLEDIINTYGSAASTAGKEGSARASEQPENAESPDNEDGDCEETTEEAGREPVASG EPPTVKEPVSNKEQKLEKKILKGLGK >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_2|981_bp atggtgccagcatctccttctggggagagtttcaggaagtttccactcatggtggaaggt gaaggggagcaggcatcacatggcaagagggaggaagaaggagagaacatggaggaaggt accagtaaagaagaagaagaaaaaaggcagaaaaagggaagctctaatgttagggaagcc agaagaattgaatccataaaccccaaaccacaacggatgaggactagaacacaatacata gtggtaaatgattcaacaggaagtgtgcttcacagggatgagttggatgtaagtgggaca ggcaagcagatacctgaggattgttctctgaagaagcttaaccagaaaggctctgaactg agcgatatgaggctcagaggaggacaaggaaacctgacactaattaagctagtgcaagaa gaagaagaaaaagaagatgaatcttccaccacactggagataacccaagaagtaagaaaa actgatctctacccaaaaaggacaacaatggtgaaagactgcatttgcaacggggatact actcccaagatggaggctaatcactctgaacagctctcagcggaacgacagtcaacacct ccaggtgacagttcatcattacccagtcacaatggcctggagaaggaagatggccaggat tctccaaccccagtccaaccaccagagaaagaggcaagtgtgcaccccgatatctctgaa gagctgaatcgacagctggaagacatcattaacacttatgggtctgctgccagcacagca gggaaagagggctctgccagggccagtgagcagcctgagaatgcagaatcacctgacaac gaggatggggactgtgaggaaacaactgaagaggctggaagagaacccgttgcttctgga gagccacccactgtcaaagagcccgtcagcaataaggagcaaaaattggaaaagaaaatc ctaaaaggattaggcaagtag >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_3|147_aa MILTQQGDFRVYISPMLHRHKSLGNTSKKALLPFPTFSFFLPEADVFVRATVAIFHSKAT VKMKAMGLGQHSREEFVEPISTLHSLIGSDIVQPPNLSQAGMLTFADFWEKVCFPAEQEA SWNGMPSAAALFCFHEEPKTESGTGEV >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_3|444_bp atgattctcactcaacaaggtgacttcagagtctatatctctcccatgctacataggcac aagtcattgggtaacacttctaagaaagctcttctgccctttcccactttctccttcttc ttacctgaagcagatgtatttgttagagctacagtggccatattccactctaaggcaact gtgaagatgaaagccatgggcttgggacagcacagcagagaagaatttgtggaaccaatc tctacgttgcattctctgattggttcagacattgtccagccacccaatctgagtcaagca ggtatgctgacattcgctgatttctgggaaaaggtctgcttcccagctgaacaagaagcc tcatggaatggaatgcctagtgcagctgccctattttgcttccatgaagagccaaagaca gagagcggcacaggggaagtctga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_4|168_aa MNLAVAAVAVRQGSCTASWNITGAAGELAPSSYWGYLPCMWRARAWTCLTQPPPGFAPPL ALWGSTPVLKEEGSVYVSPKTFSVGTGSSSSSSEFGMIDGGGWSGSNLLTQTHCILDRAF TTCLSPARQNTRMHKFYEAGLLPTDRQQGTTDAKDAGQAGPSGLRKAT >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_4|507_bp atgaatttggcagtggctgctgtagccgtgaggcaaggcagctgcactgcttcctggaat attaccggagcagccggagaactggcccccagctcctactggggttatttgccctgcatg tggagagccagagcatggacctgcctgacccagcccccacctggctttgctccaccactt gccctgtgggggtcgacgccagttctgaaagaagaaggaagtgtttatgtgagtcctaag acattctcagttggaactggaagtagttcttccagttctgagtttggcatgattgatgga ggtggttggtctggttctaacctgttaacacaaacccactgcatcttggaccgcgctttt actacttgcctgagtccagcaagacagaacactcgcatgcacaagttttatgaagcgggt ttattacccacagataggcagcaaggcacaacagatgccaaggatgcagggcaagctggt ccctcaggacttaggaaagctacctag >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_5|137_aa MAAQIHHFSWPELVQGLCSIAYNTWGFHREDATDAGTRQLDPGSTWRQSIVAITWEPWLL LLVDAPTHGFSLWHGFLTAWPCQGSWTSYIVAQSSRGVCPKRTKGKLCGVFLFSLESYTG RESISEPQIQGRGLMLF >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_5|414_bp atggctgcccagattcaccatttcagttggcctgagctagtgcaaggactttgctccata gcgtataatacctgggggttccacagggaagatgctacagatgctggcactcgacagctg gaccctggaagcacctggaggcagtcgatagtggctatcacctgggaaccttggctgctg ctgctggtggacgcacctacacatggcttttccctgtggcatggcttccttacagcatgg ccgtgtcagggtagctggacttcttatattgtagcccagagctccagaggcgtgtgtccc aagagaaccaaggggaagctatgcggtgttttcttattcagtcttgaaagttacacagga agagagtccataagtgaacctcagattcagggaagaggcctaatgctgttttga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_6|541_aa MVMPVISVLWEAKVLAKARDSPLKTCFSQKFTPMGTFVHYSAVSDAQLWEMNTTIQCCEG HRQITCSLTQVHGINPGYTELLGALGYSSAARAAGGGDAAPGPGPGCSRCSGIRVLAPPA LEALAPPFRNAYINCGQSPKKHSSLLAAAGRSCRARPLRDLGADIATAKRFARQEGPLYV LLSRSWTRRARPRSSEQKSQKRKGYGALPLLGHFGEQRGCGLGSPRPIAGEGAVCAASGC VMSGSSSVVASPGAQAVSGILGVVVRSRRLEMADHMMAMNHGRFPDGTNGLHHHPAHRMG MGQFPSPHHHQQQQPQHAFNALMGEHIHYGAGNMNATSGIRHAMGPGTVNGGHPPSALAP AARFNNSQFMGPPVASQGGSLPASMQLQKLNNQYFNHHPYPHNHYMPDLHPAAGHQMNGT NQHFRDCNPKHSGGSSTPGGSGGSSTPGGSGSSSGGGAGSSNSGGGSGSGNMPASVAHVP AAMLPPNVIDTDFIDEEVLMSLVIEMGLDRIKELPELWLGQNEFDFMTDFVCKQQPSRVS C >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_6|1626_bp atggtcatgcctgtaatctcagtgctttgggaggccaaggttttagctaaagccagagac tcacctctaaaaacttgtttcagtcaaaagttcacaccaatggggacctttgtccattac tcagcagtcagtgatgcccagctctgggaaatgaatacaaccattcagtgctgtgagggc cacagacagatcacttgctcgctcacccaggttcacgggataaaccctggttatacggaa cttctgggagccctgggttactcctctgcagcacgtgccgcgggcggcggggacgcggct ccgggacccggtccagggtgttcgcggtgttccggaatccgcgtcttggcgccgcccgcc ctggaggctctcgctccgcctttccgaaatgcctatattaactgtggccaaagccctaag aaacacagctcattgttggcagctgccgggcggtcctgccgagctcggccgctgcgagac ctcggcgccgacatcgcgacagcgaagcgctttgcacgccaggaaggtcccctctatgtg ctgctgagccggtcctggacgcgacgagcccgccctcggtcttcggagcagaaatcgcaa aaacggaagggctacggggcacttcctttattaggccacttcggggagcaaagggggtgt gggctcgggtccccccgcccgatcgcaggggaaggggctgtttgtgcagcgtccggctgt gttatgagtggtagctcttccgtggtggctagcccgggtgcacaggctgttagtgggatc ttgggggtggtggttcgcagccgacgactggaaatggcagaccatatgatggccatgaac cacgggcgcttccccgacggcaccaatgggctgcaccatcaccctgcccaccgcatgggc atggggcagttcccgagcccccatcaccaccagcagcagcagccccagcacgccttcaac gccctaatgggcgagcacatacactacggcgcgggcaacatgaatgccacgagcggcatc aggcatgcgatggggccggggactgtgaacggagggcaccccccgagcgcgctggccccc gcggccaggtttaacaactcccagttcatgggtcccccggtggccagccagggaggctcc ctgccggccagcatgcagctgcagaagctcaacaaccagtatttcaaccatcacccctac ccccacaaccactacatgccggatttgcaccctgctgcaggccaccagatgaacgggaca aaccagcacttccgagattgcaaccccaagcacagcggcggcagcagcacccccggcggc tcgggcggcagcagcacccccggcggctctggcagcagctcgggcggcggcgcgggcagc agcaacagcggcggcggcagcggcagcggcaacatgcccgcctccgtggcccacgtcccc gctgcaatgctgccgcccaatgtcatagacactgatttcatcgacgaggaagttcttatg tccttggtgatagaaatgggtttggaccgcatcaaggagctgcccgaactctggctgggg caaaacgagtttgattttatgacggacttcgtgtgcaaacagcagcccagcagagtgagc tgttga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_7|72_aa MHSIKCSKIVTDVSHSSHSSNRSVLRGREKPQASKRQWEKSITCSLKVLFLGLVTHQTCC YGVSMPRVSFPG >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_7|219_bp atgcacagcattaagtgctcaaagattgtgaccgatgttagtcattcttcccactcgtcc aacaggagtgttcttagaggacgagagaaaccacaggccagcaagaggcaatgggagaag tccatcacatgctctctaaaggtcctgttcctgggcctggttacacatcagacctgttgt tacggtgtctctatgccaagggtgtcatttcctggctga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_8|61_aa MQEEKVNHTAETLLEAGEFFFGTTLDCSHFDLEKTHRRKSPRGNVECGSFEKGSEREKGT S >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_8|186_bp atgcaggaagaaaaggtcaatcatactgctgagactttgctggaagctggtgaatttttc tttggtacaactttggactgtagccactttgatttagagaagacccatcgtcggaaaagc cctcgtgggaacgtggaatgcgggagctttgagaaggggtctgaaagagaaaagggaact agctag >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_9|221_aa MDGTLLRETPTTGPGEERGRPHGGFRCSSGWKSQKRMYQRVVSTHTPSVGRGAAEGRVAF QFESRPEMFERTKESSPMSLDVQNLPIADPDWQRSCHTHAEPESRCLSKTQIGCPLFSSG KPYPPPLKTGVICPAALHFYSTFGTSHTKTCVLVTCQPSQPGCLAISHRRGGAPGCSRRP KRAKEPKQIGETEFQPRSYLLNTQGPMVATHTNALIHNMKD >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_9|666_bp atggatggaactctgctaagggaaacgccaacaacaggcccaggggaagagagaggaaga ccccatggaggattcagatgttcttctgggtggaagagccaaaagcgaatgtaccagcgt gttgttagcacacacacacccagcgtgggaagaggagcagctgagggcagagttgctttt caatttgaaagcagacctgaaatgttcgaaagaaccaaggagtcaagtccaatgtccctg gacgttcaaaacctccccatcgccgaccccgattggcagagaagctgccacacccatgca gagcccgaatcccgctgtctttccaagacccagatagggtgccctctcttctcttctggg aaaccttatccccctcccctgaagactggagttatctgtcctgctgcccttcacttttat tcaacgtttggcacgtcccatactaagacgtgtgttttagttacctgccagccttcccaa ccaggatgtttggctatcagccacagaagaggaggggccccagggtgcagcagaagaccc aagagagccaaagaacccaaacagataggtgaaacagagttccaaccaagaagttaccta ctgaacactcagggacccatggttgctacacatactaatgctctaattcataatatgaag gactga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_10|63_aa MGMQAAVAIKLVLTDLQMPQATEVFGWTPRLCRRTKVLLFADCVGSRSHDDDDDDDDHDH AFS >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_10|192_bp atggggatgcaagcagctgtggccatcaaactggttctaacagatctgcagatgcctcag gccacagaggtatttggctggactccacgtctgtgtagaaggacaaaagtcttgctattt gcagattgtgtgggtagcaggagtcatgacgatgatgatgatgatgacgatcatgatcat gcttttagttga >gi568815592r:139273135_139473944|GENSCAN_predicted_peptide_11|49_aa LEIKPFWLPIVGEVETVNGVCNEHLKFVELNQRETCSKIRIQMNQVEAS >gi568815592r:139273135_139473944|GENSCAN_predicted_CDS_11|150_bp ctagaaatcaagcctttctggcttccgattgttggagaagtggaaacagttaatggagta tgcaatgagcatttaaagtttgtggaactaaaccagagggagacctgcagcaaaatccga attcaaatgaaccaggtggaagcaagttag