GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:59:02 Sequence gi568815594r:139598147_139798554 : 200408 bp : 43.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9080 9119 40 -1.56 1.01 Init + 27672 27721 50 2 2 38 96 50 0.703 1.25 1.02 Intr + 27841 28055 215 1 2 88 77 99 0.523 7.16 1.03 Term + 35885 36210 326 0 2 80 37 167 0.148 5.73 1.04 PlyA + 36935 36940 6 -0.45 2.00 Prom + 37629 37668 40 -4.46 2.01 Init + 42858 42909 52 2 1 110 94 18 0.170 5.96 2.02 Intr + 45343 45424 82 2 1 113 88 20 0.171 3.20 2.03 Intr + 51879 52044 166 0 1 24 93 81 0.039 2.16 2.04 Term + 58623 58769 147 2 0 119 53 10 0.093 -1.50 2.05 PlyA + 61515 61520 6 -0.45 3.00 Prom + 62852 62891 40 -6.06 3.01 Init + 67874 67931 58 1 1 87 69 110 0.925 8.48 3.02 Intr + 80397 80496 100 1 1 109 98 44 0.110 6.67 3.03 Term + 89701 89728 28 1 1 125 40 10 0.007 -2.55 3.04 PlyA + 89910 89915 6 1.05 4.03 PlyA - 90391 90386 6 1.05 4.02 Term - 98010 97945 66 0 0 118 39 88 0.785 4.84 4.01 Init - 98542 98381 162 1 0 21 8 169 0.406 1.93 4.00 Prom - 99719 99680 40 -4.16 5.02 PlyA - 99958 99953 6 -0.45 5.01 Sngl - 100408 99998 411 1 0 79 48 375 0.971 28.89 5.00 Prom - 104592 104553 40 -2.76 6.03 PlyA - 104782 104777 6 1.05 6.02 Term - 106354 106237 118 0 1 9 38 101 0.143 -4.79 6.01 Init - 110461 109569 893 1 2 39 53 256 0.304 11.04 6.00 Prom - 118030 117991 40 0.24 7.07 PlyA - 118212 118207 6 1.05 7.06 Term - 119571 119410 162 0 0 71 43 73 0.418 -0.96 7.05 Intr - 120530 120370 161 0 2 56 94 113 0.708 8.41 7.04 Intr - 122177 121188 990 0 0 92 19 340 0.768 18.33 7.03 Intr - 127689 127605 85 0 1 87 100 49 0.606 5.39 7.02 Intr - 132521 132270 252 2 0 105 92 365 0.755 36.23 7.01 Init - 137814 137734 81 0 0 53 -15 148 0.587 -0.03 7.00 Prom - 139426 139387 40 -5.26 8.00 Prom + 141182 141221 40 -4.66 8.01 Init + 165165 165309 145 2 1 16 84 195 0.716 12.18 8.02 Intr + 196636 196764 129 2 0 36 116 25 0.018 0.77 8.03 Intr + 197774 197827 54 0 0 106 91 13 0.021 2.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 29755 29825 71 2 2 77 39 102 0.867 2.30 S.002 Term + 68060 68220 161 0 2 109 38 35 0.801 -1.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_1|196_aa MPKIGLSECQYLLQPIRLWVSHRRAEVYSMNVCVVYEKMVTITNLQGNANQNQNEILLTP VKVAIIKKTKYNRAGEDVEKGEHSLYTVAPHPPVISVPQPVTLPSTQPASLYPSSCMDAP AILVAQDCANNHQYAFTSSAPPMPLLTLSYQSDILNFSFSYLHILFLSLLCRLHLTCLLL KLPCNAYYAKAKKQVD >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_1|591_bp atgcctaaaattggcttgtctgagtgccaatatctgctgcagcccatcagattatgggtt tctcataggagagctgaagtctactcaatgaatgtctgtgtggtatatgaaaaaatggtc accatcactaatcttcaaggaaatgcaaatcaaaaccaaaatgagatattactcactcca gttaaagtggctattatcaaaaagacaaaatataacagagctggtgaggatgtggagaaa ggggaacactccttgtacactgttgcccctcatcctcctgtcatttcggtccctcaaccg gtcactttgccatccactcaacctgcttctctgtacccttcttcatgcatggatgcccct gccattcttgttgcacaggattgcgccaataaccaccagtatgcttttacctcttctgct cccccaatgccccttctcacactctcataccagtccgacatcctcaacttcagtttctct tatctacacatacttttcctgtcacttctatgccgactccatctcacgtgcctgctcttg aaacttccatgcaacgcttattacgccaaagcaaagaagcaagtagattag >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_2|148_aa MGPVQLWRALSKETRLAAPCATSLHDTERGHLCSQMLLSVFVTKALEKSQRMREFYELQE WSALEDIQRRVNRERTAFHKEVDTGDVLVINRDLQTRSIQFATGMPNVKCPFLTLLHGMY VSGAASVFGFRLALSGSGLTVKSWYFYI >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_2|447_bp atggggccagtgcagctgtggagagctctgagcaaagaaacaaggttggcagcaccctgt gcaaccagcttgcatgacactgagcgggggcacctctgctcacagatgttgctgtctgtc tttgtcaccaaggcgttagaaaaatctcagagaatgagggagttctatgaactccaggaa tggtctgctttggaggacatccaaaggagagtgaacagagaaaggacagcttttcacaaa gaagtggacactggtgatgttctggtcatcaatagagacctccaaaccagatctatccag tttgccacagggatgccaaacgtaaagtgtccttttctgacccttctccatggaatgtac gtctctggggcagcgtctgtgtttggcttccgtctagctctgagtggatcaggactaacg gttaaatcctggtatttttacatttga >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_3|61_aa MAGNSILLAAVSILSACQQSYFALQVGKARLKYKVTPPAVTGSPEFERVFRAQRGLTDDS L >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_3|186_bp atggccgggaactcgatcctgctggctgctgtctctattctctcggcctgtcagcaaagt tattttgctttgcaagttggaaaggcaagattaaaatacaaagttacgcccccagcagtc actgggtcaccagagtttgagagagtatttcgggcacagagaggacttacagatgacagt ctctag >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_4|75_aa MTLTGFTTEPIKEIMKEIVAVAKNVGDEGSQDMNLGEIQELIDTTPDELTEDDLLTQREN DEDEDLYDGPLPLNK >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_4|228_bp atgacactcacaggatttacaacagagccaatcaaggaaatcatgaaagagatagtggct gtggcaaagaatgtgggggatgaagggtctcaagatatgaatcttggagaaattcaagag ctaatagacaccacgccagatgaattaacagaagatgacttgcttactcaacgtgaaaat gatgaggatgaagacctttatgatggtccacttccacttaataaatag >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_5|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTE LLVRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLAHSIRGERA >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_5|411_bp atggctcgtacaaagcagactgcccgcaaatcgaccggtggtaaagcacccaggaagcaa ctggctacaaaagccgctcgcaagagtgcgccctctactggaggggtgaagaaacctcat cgttacaggcctggtactgtggcgctccgtgaaattagacgttatcagaagtccactgaa cttctggttcgcaaacttcccttccagcgtctggtgcgagaaattgctcaggactttaaa acagatctgcgcttccagagcgcagctatcggtgctttgcaggaggcaagtgaggcctat ctggttggcctttttgaagacaccaacctgtgtgctatccatgccaaacgtgtaacaatt atgccaaaagacatccagctagcacacagcatacgtggagaacgtgcttaa >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_6|336_aa MTFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITIPDFKLYYKATVNKTAWYWYQK RDIDQWNRTEPSEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFL TPYTKINSRWNKDLRVRPKTIKTLEENLGNIIQDIGMGKDKDFMSKTPKAMATKDKMDKR DLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKK WAKDMNRHFSKEDIYAAKKYMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIQKSGNNSFC LKLSIKREAVRVAGYVSSPRNGQHTGQAAVSLHNRS >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_6|1011_bp atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacatcaccaagtcaatcctaagccaaaagaacaaagccggaggcatcacgatacct gacttcaaactatactacaaggctacagtcaacaaaacagcatggtactggtaccaaaaa agagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaac catctgatctttgataaacctgacaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggaataaagacttacgtgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaatatcattcaggacataggcatgggcaaggac aaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaatggacaaacgg gatctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacagg caacctacagaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatcc agaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaag tgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaatac atgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgaga taccatctcacaccagttagaatggcgatcattcaaaagtcaggaaacaacagcttctgt ctgaaactttccatcaagagagaagcagtcagagtggcaggatatgtcagtagtcctcgg aacggccagcacacgggtcaggctgctgtgtccctacacaaccggtcatga >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_7|576_aa MRPLEHTAASRAPERRARAEDLQGPSREQHPVGLPRTTGPMQSSVPPGSGGMVSGASPAG PGFLGSQPQAAIMKQMLIDQRAQLIEQQKQQFLREQRQQQQQQQQQILAEQQLQQSHLPR QHLQPQRNPYPVQQVNQFQGSPQDIAAVRSQAALQSMRTSRLMAQNAGMMGIGPSQNPGT MATAAAQSEMGLAPYSTTPTSQPGMYNMSTGMTQMLQHPNQSGMSITHNQAQGPRQPASG QGVGMVSGFGQSMLVNSAITQQHPQMKGPVGQALPRPQAPPRLQSLMGTVQQGAQSWQQR SLQGMPGRTSGELGPFNNGASYPLQAGQPRLTKQHFPQGLSQSVVDANTGTVRTLNPAAM GRQMMPSLPGQQGTSQARPMVMSGLSQGVPGMPAFSQPPAQQQIPSGSFAPSSQSQAYER NAPQDVSYNYSGDGAGGSFPGLPDGADLVDSIIKGGPGDEWMQELDELFGIAKALLSQGE QAEFGAGSIRQSWDGFPLLVPLVAVPPGGAGACWRPIQNPYIQADHYGRVGACGVGLATL SPEKASCHGKLGAEVFLERRRQANIAFDLLSAYPGL >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_7|1731_bp atgcgcccgctggagcacaccgcagcctcgcgggctcccgagcggcgcgcgcgggccgag gacctccagggcccctcgcgggagcagcatccagttggacttccccgaaccacaggcccc atgcagtcctccgtgcccccaggctcaggtggcatggtctcaggagccagtcccgcaggc cccggcttcctgggcagccagccccaagcagccatcatgaagcagatgctcattgatcag cgggcccagttgatagagcagcagaagcaacagttcctgcgggagcaaaggcagcagcag cagcagcagcagcagcagattttggcggaacagcagttgcagcaatcacatctaccccgg cagcacctccagccacagcggaatccatacccagtgcagcaggtcaatcagtttcaaggt tctccccaggatatagcagccgtaagaagccaagcagccctccagagcatgcgaacgtca cggctgatggcacagaacgcaggcatgatgggaataggaccctcccagaaccctgggacg atggccaccgcagctgcgcagtcggagatgggactggccccttatagcaccacgcctacc agccaaccaggaatgtacaatatgagcacaggcatgacccaaatgttgcagcatccaaac caaagtggcatgagcatcacacataaccaagcccagggaccgaggcaacctgcctctggg cagggggttggaatggtgagtggctttggtcagagcatgctggtgaactcagccattacc cagcaacatccacagatgaaagggccagtaggccaggccttgcctaggccccaagcccct ccaaggctgcagagccttatgggaacagtccagcaaggagcacaaagctggcaacagagg agcttgcagggcatgcctgggaggactagtggagaattgggaccattcaacaatggcgcc agctaccctcttcaagctgggcagccgagactgaccaagcagcacttcccacagggactg agccagtcagtcgtggatgctaacacgggcacagtgaggaccctcaacccagctgccatg ggtcggcagatgatgccatcgctcccggggcagcaaggcaccagccaggcgaggccaatg gtcatgtctggcctgagccagggagtcccaggcatgccagcgttcagccagcccccagca cagcagcagatacccagtggcagctttgctccaagcagccagagccaagcctatgagcgg aatgcccctcaggacgtgtcatacaattacagtggcgacggagctgggggttccttccct ggcctcccggacggtgcagaccttgtggactccatcatcaaaggcgggccaggggacgag tggatgcaggagcttgatgaattgtttggaattgcaaaggctctgctgtcacaaggagag caggctgagtttggagcagggtccatccggcagtcctgggacggcttccctctgctggtg cccctggtggcagtccctccaggtggggctggagcctgctggcgcccaatacaaaaccca tacatccaggcagatcactacggaagagtcggagcctgtggggttggactggccacactc agtcctgagaaggcgagttgccatggaaagctgggggcagaggtgtttttggagaggagg cggcaggcaaacattgcctttgacttgctctccgcgtacccggggttgtag >gi568815594r:139598147_139798554|GENSCAN_predicted_peptide_8|110_aa MTKALGKAEEESGFCENEQGSCSESAAEGAICLSNEKYGAKEAHGIQTYDTANSLCSSVE LAKIVSVFVLLNRLGSGAKRLPVLKASFLNEDRLSHLPHYKCDCIFFSTX >gi568815594r:139598147_139798554|GENSCAN_predicted_CDS_8|330_bp atgactaaggccctaggcaaagctgaagaggagtctggattctgtgagaatgaacaaggc agctgctcggagtcggctgctgagggggccatctgtttaagcaacgagaagtacggagct aaagaggcacacggtattcaaacatatgacactgccaacagtctttgctcctctgtggag ctggccaagattgtttctgtcttcgtactgcttaacagactagggtcaggtgccaagagg ttgcctgtcctgaaagcttcttttctgaatgaagacagactttcccatttaccccactat aagtgtgactgtatcttctttagcacagnn