GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:30:25 Sequence gi568815594f:139566020_139804145 : 238126 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9304 9508 205 0 1 53 78 125 0.342 7.01 1.02 Term + 31644 31828 185 1 2 30 41 116 0.011 -1.19 1.03 PlyA + 32967 32972 6 1.05 2.00 Prom + 41207 41246 40 -1.56 2.01 Init + 59799 59848 50 2 2 38 96 50 0.705 1.25 2.02 Intr + 59968 60182 215 1 2 88 77 99 0.522 7.16 2.03 Term + 68012 68337 326 0 2 80 37 167 0.148 5.73 2.04 PlyA + 69062 69067 6 -0.45 3.00 Prom + 69756 69795 40 -4.46 3.01 Init + 74985 75036 52 2 1 110 94 18 0.170 5.96 3.02 Intr + 77470 77551 82 2 1 113 88 20 0.171 3.20 3.03 Intr + 84006 84171 166 0 1 24 93 81 0.039 2.16 3.04 Term + 90750 90896 147 2 0 119 53 10 0.093 -1.50 3.05 PlyA + 93642 93647 6 -0.45 4.00 Prom + 94979 95018 40 -6.06 4.01 Init + 100001 100058 58 1 1 87 69 110 0.925 8.48 4.02 Intr + 112524 112623 100 1 1 109 98 44 0.110 6.67 4.03 Term + 121828 121855 28 1 1 125 40 10 0.007 -2.55 4.04 PlyA + 122037 122042 6 1.05 5.03 PlyA - 122518 122513 6 1.05 5.02 Term - 130137 130072 66 0 0 118 39 88 0.785 4.84 5.01 Init - 130669 130508 162 1 0 21 8 169 0.406 1.93 5.00 Prom - 131846 131807 40 -4.16 6.02 PlyA - 132085 132080 6 -0.45 6.01 Sngl - 132535 132125 411 1 0 79 48 375 0.971 28.89 6.00 Prom - 136719 136680 40 -2.76 7.03 PlyA - 136909 136904 6 1.05 7.02 Term - 138481 138364 118 0 1 9 38 101 0.143 -4.79 7.01 Init - 142588 141696 893 1 2 39 53 256 0.304 11.04 7.00 Prom - 150157 150118 40 0.24 8.07 PlyA - 150339 150334 6 1.05 8.06 Term - 151698 151537 162 0 0 71 43 73 0.418 -0.96 8.05 Intr - 152657 152497 161 0 2 56 94 113 0.708 8.41 8.04 Intr - 154304 153315 990 0 0 92 19 340 0.768 18.33 8.03 Intr - 159816 159732 85 0 1 87 100 49 0.606 5.39 8.02 Intr - 164648 164397 252 2 0 105 92 365 0.755 36.23 8.01 Init - 169941 169861 81 0 0 53 -15 148 0.587 -0.03 8.00 Prom - 171553 171514 40 -5.26 9.00 Prom + 173309 173348 40 -4.66 9.01 Init + 197292 197436 145 2 1 16 84 195 0.715 12.18 9.02 Term + 213983 214008 26 0 2 120 38 19 0.014 -1.51 9.03 PlyA + 214086 214091 6 1.05 10.02 PlyA - 214557 214552 6 1.05 10.01 Term - 234687 234572 116 1 2 97 53 116 0.975 7.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 61882 61952 71 2 2 77 39 102 0.867 2.30 S.002 Term + 100187 100347 161 0 2 109 38 35 0.801 -1.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_1|129_aa MCIKYSLKGEESIQTLLSFIRQTHIEHLERMKEQQRKGSGSESAPKARQVEADHKNPWKG KCWEIFPGGLFTQLPTHQLHWMSDKHLNPDMSSTELLDFSPKLAFFSISVNGKFIFTVAQ IKIRVTVVS >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_1|390_bp atgtgcatcaagtactccctcaaaggggaagaatccatccagacactcttatctttcatt cgacaaacacatattgagcacctggaacgcatgaaggaacagcaaagaaaaggctcgggt tctgaaagtgctcccaaggcgaggcaggtggaggcagatcacaagaacccttggaagggc aagtgctgggagatttttccaggaggcttattcacccagctgcccactcatcagctccac tggatgtcagacaagcatctcaaccctgacatgtccagcaccgagctcctggacttctct cccaaactggcattcttctccatctcagttaatggcaagttcatctttacagttgctcag attaaaataagagtcactgtagtgagttga >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_2|196_aa MPKIGLSECQYLLQPIRLWVSHRRAEVYSMNVCVVYEKMVTITNLQGNANQNQNEILLTP VKVAIIKKTKYNRAGEDVEKGEHSLYTVAPHPPVISVPQPVTLPSTQPASLYPSSCMDAP AILVAQDCANNHQYAFTSSAPPMPLLTLSYQSDILNFSFSYLHILFLSLLCRLHLTCLLL KLPCNAYYAKAKKQVD >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_2|591_bp atgcctaaaattggcttgtctgagtgccaatatctgctgcagcccatcagattatgggtt tctcataggagagctgaagtctactcaatgaatgtctgtgtggtatatgaaaaaatggtc accatcactaatcttcaaggaaatgcaaatcaaaaccaaaatgagatattactcactcca gttaaagtggctattatcaaaaagacaaaatataacagagctggtgaggatgtggagaaa ggggaacactccttgtacactgttgcccctcatcctcctgtcatttcggtccctcaaccg gtcactttgccatccactcaacctgcttctctgtacccttcttcatgcatggatgcccct gccattcttgttgcacaggattgcgccaataaccaccagtatgcttttacctcttctgct cccccaatgccccttctcacactctcataccagtccgacatcctcaacttcagtttctct tatctacacatacttttcctgtcacttctatgccgactccatctcacgtgcctgctcttg aaacttccatgcaacgcttattacgccaaagcaaagaagcaagtagattag >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_3|148_aa MGPVQLWRALSKETRLAAPCATSLHDTERGHLCSQMLLSVFVTKALEKSQRMREFYELQE WSALEDIQRRVNRERTAFHKEVDTGDVLVINRDLQTRSIQFATGMPNVKCPFLTLLHGMY VSGAASVFGFRLALSGSGLTVKSWYFYI >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_3|447_bp atggggccagtgcagctgtggagagctctgagcaaagaaacaaggttggcagcaccctgt gcaaccagcttgcatgacactgagcgggggcacctctgctcacagatgttgctgtctgtc tttgtcaccaaggcgttagaaaaatctcagagaatgagggagttctatgaactccaggaa tggtctgctttggaggacatccaaaggagagtgaacagagaaaggacagcttttcacaaa gaagtggacactggtgatgttctggtcatcaatagagacctccaaaccagatctatccag tttgccacagggatgccaaacgtaaagtgtccttttctgacccttctccatggaatgtac gtctctggggcagcgtctgtgtttggcttccgtctagctctgagtggatcaggactaacg gttaaatcctggtatttttacatttga >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_4|61_aa MAGNSILLAAVSILSACQQSYFALQVGKARLKYKVTPPAVTGSPEFERVFRAQRGLTDDS L >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_4|186_bp atggccgggaactcgatcctgctggctgctgtctctattctctcggcctgtcagcaaagt tattttgctttgcaagttggaaaggcaagattaaaatacaaagttacgcccccagcagtc actgggtcaccagagtttgagagagtatttcgggcacagagaggacttacagatgacagt ctctag >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_5|75_aa MTLTGFTTEPIKEIMKEIVAVAKNVGDEGSQDMNLGEIQELIDTTPDELTEDDLLTQREN DEDEDLYDGPLPLNK >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_5|228_bp atgacactcacaggatttacaacagagccaatcaaggaaatcatgaaagagatagtggct gtggcaaagaatgtgggggatgaagggtctcaagatatgaatcttggagaaattcaagag ctaatagacaccacgccagatgaattaacagaagatgacttgcttactcaacgtgaaaat gatgaggatgaagacctttatgatggtccacttccacttaataaatag >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_6|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTE LLVRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLAHSIRGERA >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_6|411_bp atggctcgtacaaagcagactgcccgcaaatcgaccggtggtaaagcacccaggaagcaa ctggctacaaaagccgctcgcaagagtgcgccctctactggaggggtgaagaaacctcat cgttacaggcctggtactgtggcgctccgtgaaattagacgttatcagaagtccactgaa cttctggttcgcaaacttcccttccagcgtctggtgcgagaaattgctcaggactttaaa acagatctgcgcttccagagcgcagctatcggtgctttgcaggaggcaagtgaggcctat ctggttggcctttttgaagacaccaacctgtgtgctatccatgccaaacgtgtaacaatt atgccaaaagacatccagctagcacacagcatacgtggagaacgtgcttaa >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_7|336_aa MTFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITIPDFKLYYKATVNKTAWYWYQK RDIDQWNRTEPSEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFL TPYTKINSRWNKDLRVRPKTIKTLEENLGNIIQDIGMGKDKDFMSKTPKAMATKDKMDKR DLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKK WAKDMNRHFSKEDIYAAKKYMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIQKSGNNSFC LKLSIKREAVRVAGYVSSPRNGQHTGQAAVSLHNRS >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_7|1011_bp atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacatcaccaagtcaatcctaagccaaaagaacaaagccggaggcatcacgatacct gacttcaaactatactacaaggctacagtcaacaaaacagcatggtactggtaccaaaaa agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac catctgatctttgataaacctgacaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggaataaagacttacgtgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaatatcattcaggacataggcatgggcaaggac aaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaatggacaaacgg gatctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacagg caacctacagaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatcc agaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaag tgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaatac atgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgaga taccatctcacaccagttagaatggcgatcattcaaaagtcaggaaacaacagcttctgt ctgaaactttccatcaagagagaagcagtcagagtggcaggatatgtcagtagtcctcgg aacggccagcacacgggtcaggctgctgtgtccctacacaaccggtcatga >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_8|576_aa MRPLEHTAASRAPERRARAEDLQGPSREQHPVGLPRTTGPMQSSVPPGSGGMVSGASPAG PGFLGSQPQAAIMKQMLIDQRAQLIEQQKQQFLREQRQQQQQQQQQILAEQQLQQSHLPR QHLQPQRNPYPVQQVNQFQGSPQDIAAVRSQAALQSMRTSRLMAQNAGMMGIGPSQNPGT MATAAAQSEMGLAPYSTTPTSQPGMYNMSTGMTQMLQHPNQSGMSITHNQAQGPRQPASG QGVGMVSGFGQSMLVNSAITQQHPQMKGPVGQALPRPQAPPRLQSLMGTVQQGAQSWQQR SLQGMPGRTSGELGPFNNGASYPLQAGQPRLTKQHFPQGLSQSVVDANTGTVRTLNPAAM GRQMMPSLPGQQGTSQARPMVMSGLSQGVPGMPAFSQPPAQQQIPSGSFAPSSQSQAYER NAPQDVSYNYSGDGAGGSFPGLPDGADLVDSIIKGGPGDEWMQELDELFGIAKALLSQGE QAEFGAGSIRQSWDGFPLLVPLVAVPPGGAGACWRPIQNPYIQADHYGRVGACGVGLATL SPEKASCHGKLGAEVFLERRRQANIAFDLLSAYPGL >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_8|1731_bp atgcgcccgctggagcacaccgcagcctcgcgggctcccgagcggcgcgcgcgggccgag gacctccagggcccctcgcgggagcagcatccagttggacttccccgaaccacaggcccc atgcagtcctccgtgcccccaggctcaggtggcatggtctcaggagccagtcccgcaggc cccggcttcctgggcagccagccccaagcagccatcatgaagcagatgctcattgatcag cgggcccagttgatagagcagcagaagcaacagttcctgcgggagcaaaggcagcagcag cagcagcagcagcagcagattttggcggaacagcagttgcagcaatcacatctaccccgg cagcacctccagccacagcggaatccatacccagtgcagcaggtcaatcagtttcaaggt tctccccaggatatagcagccgtaagaagccaagcagccctccagagcatgcgaacgtca cggctgatggcacagaacgcaggcatgatgggaataggaccctcccagaaccctgggacg atggccaccgcagctgcgcagtcggagatgggactggccccttatagcaccacgcctacc agccaaccaggaatgtacaatatgagcacaggcatgacccaaatgttgcagcatccaaac caaagtggcatgagcatcacacataaccaagcccagggaccgaggcaacctgcctctggg cagggggttggaatggtgagtggctttggtcagagcatgctggtgaactcagccattacc cagcaacatccacagatgaaagggccagtaggccaggccttgcctaggccccaagcccct ccaaggctgcagagccttatgggaacagtccagcaaggagcacaaagctggcaacagagg agcttgcagggcatgcctgggaggactagtggagaattgggaccattcaacaatggcgcc agctaccctcttcaagctgggcagccgagactgaccaagcagcacttcccacagggactg agccagtcagtcgtggatgctaacacgggcacagtgaggaccctcaacccagctgccatg ggtcggcagatgatgccatcgctcccggggcagcaaggcaccagccaggcgaggccaatg gtcatgtctggcctgagccagggagtcccaggcatgccagcgttcagccagcccccagca cagcagcagatacccagtggcagctttgctccaagcagccagagccaagcctatgagcgg aatgcccctcaggacgtgtcatacaattacagtggcgacggagctgggggttccttccct ggcctcccggacggtgcagaccttgtggactccatcatcaaaggcgggccaggggacgag tggatgcaggagcttgatgaattgtttggaattgcaaaggctctgctgtcacaaggagag caggctgagtttggagcagggtccatccggcagtcctgggacggcttccctctgctggtg cccctggtggcagtccctccaggtggggctggagcctgctggcgcccaatacaaaaccca tacatccaggcagatcactacggaagagtcggagcctgtggggttggactggccacactc agtcctgagaaggcgagttgccatggaaagctgggggcagaggtgtttttggagaggagg cggcaggcaaacattgcctttgacttgctctccgcgtacccggggttgtag >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_9|56_aa MTKALGKAEEESGFCENEQGSCSESAAEGAICLSNEKYGAKEAHGIQTCQQMRCHS >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_9|171_bp atgactaaggccctaggcaaagctgaagaggagtctggattctgtgagaatgaacaaggc agctgctcggagtcggctgctgagggggccatctgtttaagcaacgagaagtacggagct aaagaggcacacggtattcaaacatgccagcagatgcgctgccattcctga >gi568815594f:139566020_139804145|GENSCAN_predicted_peptide_10|38_aa XIKADFEFSNENKSEAFLIVLSCVYNLPVTEASFVAKV >gi568815594f:139566020_139804145|GENSCAN_predicted_CDS_10|117_bp naaattaaagctgactttgagttttcaaatgaaaacaaatcggaggcctttctgattgtt ttatcttgcgtgtacaatcttccagtgacggaggcctccttcgttgctaaagtctga