GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:44 Sequence gi568815581f:37935054_38114590 : 179537 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1363 1520 158 2 2 95 39 134 0.970 7.30 1.02 PlyA + 4812 4817 6 1.05 2.00 Prom + 22789 22828 40 -4.06 2.01 Init + 28642 28644 3 0 0 113 81 0 0.652 1.80 2.02 Intr + 31167 31256 90 2 0 110 0 65 0.058 0.09 2.03 Term + 39036 39167 132 2 0 105 43 126 0.732 7.89 2.04 PlyA + 40106 40111 6 1.05 3.15 PlyA - 40382 40377 6 1.05 3.14 Term - 42057 41900 158 1 2 95 39 129 0.098 6.80 3.13 Intr - 43985 43773 213 0 0 101 49 81 0.082 4.19 3.12 Intr - 45119 44967 153 0 0 85 80 74 0.107 6.34 3.11 Intr - 45589 45490 100 1 1 91 89 78 0.118 7.88 3.10 Intr - 46995 46943 53 1 2 100 38 82 0.094 3.03 3.09 Intr - 47431 47311 121 1 1 92 110 146 0.994 17.27 3.08 Intr - 48023 47975 49 1 1 97 117 62 0.987 8.68 3.07 Intr - 49381 49272 110 1 2 102 82 146 0.996 14.48 3.06 Intr - 49818 49711 108 0 0 102 101 153 0.993 18.48 3.05 Intr - 50342 50262 81 2 0 52 92 42 0.607 0.83 3.04 Intr - 50864 50825 40 1 1 80 116 -10 0.947 -0.77 3.03 Intr - 52233 52148 86 0 2 81 79 118 0.945 8.82 3.02 Intr - 52468 52396 73 0 1 90 82 60 0.941 5.01 3.01 Init - 52668 52499 170 0 2 86 58 92 0.738 4.16 3.00 Prom - 58144 58105 40 -3.26 4.00 Prom + 64831 64870 40 -3.26 4.01 Init + 70307 70476 170 1 2 86 58 92 0.737 4.16 4.02 Intr + 70507 70579 73 1 1 90 82 60 0.941 5.01 4.03 Intr + 70742 70827 86 1 2 81 79 118 0.945 8.82 4.04 Intr + 72111 72150 40 0 1 80 116 -10 0.947 -0.77 4.05 Intr + 72633 72713 81 2 0 52 92 42 0.607 0.83 4.06 Intr + 73157 73264 108 1 0 102 101 153 0.993 18.48 4.07 Intr + 73594 73703 110 0 2 102 82 146 0.996 14.48 4.08 Intr + 74952 75000 49 0 1 97 117 62 0.987 8.68 4.09 Intr + 75544 75664 121 0 1 92 110 146 0.994 17.27 4.10 Intr + 75980 76032 53 0 2 100 38 77 0.306 2.53 4.11 Intr + 77386 77485 100 0 1 91 89 78 0.399 7.88 4.12 Intr + 77856 78008 153 1 0 85 80 82 0.365 7.14 4.13 Intr + 78972 79184 213 1 0 101 49 81 0.263 4.19 4.14 Term + 80900 81057 158 0 2 95 39 129 0.327 6.80 4.15 PlyA + 84352 84357 6 1.05 5.00 Prom + 102315 102354 40 -4.06 5.01 Init + 108167 108169 3 1 0 113 81 0 0.640 1.80 5.02 Term + 118574 118705 132 1 0 105 43 126 0.585 7.89 5.03 PlyA + 119644 119649 6 1.05 6.15 PlyA - 119920 119915 6 1.05 6.14 Term - 121595 121438 158 0 2 95 39 129 0.186 6.80 6.13 Intr - 123523 123311 213 2 0 101 49 81 0.156 4.19 6.12 Intr - 124657 124505 153 2 0 85 80 88 0.221 7.74 6.11 Intr - 125127 125028 100 0 1 91 89 78 0.242 7.88 6.10 Intr - 126534 126482 53 1 2 100 38 77 0.184 2.53 6.09 Intr - 126970 126850 121 1 1 92 110 146 0.994 17.27 6.08 Intr - 127562 127514 49 1 1 97 117 85 0.987 10.98 6.07 Intr - 128927 128818 110 2 2 102 82 146 0.997 14.48 6.06 Intr - 129364 129257 108 1 0 102 101 153 0.993 18.48 6.05 Intr - 129888 129808 81 0 0 52 92 42 0.607 0.83 6.04 Intr - 130410 130371 40 2 1 80 116 -10 0.947 -0.77 6.03 Intr - 131779 131694 86 1 2 81 79 118 0.945 8.82 6.02 Intr - 132014 131942 73 1 1 90 82 60 0.941 5.01 6.01 Init - 132214 132045 170 1 2 86 58 92 0.737 4.16 6.00 Prom - 137690 137651 40 -3.26 7.00 Prom + 137979 138018 40 -6.96 7.01 Init + 141749 141751 3 1 0 71 101 0 0.699 -0.40 7.02 Intr + 143209 143293 85 0 1 131 116 54 0.956 11.89 7.03 Intr + 166350 166427 78 1 0 88 110 55 0.912 7.22 7.04 Term + 176348 176490 143 0 2 80 38 67 0.210 -1.01 7.05 PlyA + 177618 177623 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 46995 46838 158 1 2 100 48 133 0.883 8.60 S.002 Init + 53731 53782 52 0 1 123 105 85 0.968 15.02 S.003 Init - 69244 69193 52 1 1 123 105 85 0.962 15.02 S.004 Init + 133277 133328 52 1 1 123 105 85 0.950 15.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_1|52_aa XCRKAGVNAIVNARRRKLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_1|159_bp ncgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaagctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatggaatgagctgtga >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_2|74_aa MVETELKLICGDVLDVLDKHLIPAATTGKSKAVCEMFHVRGKQHIQIPKLYTSSVTRHLH HFRLMQDSQPLDLS >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_2|225_bp atggttgagactgagctaaagttaatctgtggcgacgttctggatgtactggacaaacac ctcattccagcagctacaactggcaagtccaaggcagtctgtgagatgtttcatgtccga ggcaaacagcacattcagatccccaagctctacacctccagtgtgaccaggcacctgcac cacttcaggctcatgcaggactcacagcctttggacctcagctaa >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_3|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHERADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKQGDLQPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_3|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtataacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaacgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggacctgcaaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_4|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_4|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtataacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggacctgccaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_5|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_5|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_6|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKKGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_6|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtacaacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagaagggggacctgccaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581f:37935054_38114590|GENSCAN_predicted_peptide_7|102_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFEVWVILL >gi568815581f:37935054_38114590|GENSCAN_predicted_CDS_7|309_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacaggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgaggtatgggttatt cttctctaa