GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:46 Sequence gi568815581f:38029496_38238550 : 209055 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13725 13727 3 2 0 113 81 0 0.670 1.80 1.02 Term + 24132 24263 132 2 0 105 43 126 0.585 7.89 1.03 PlyA + 25202 25207 6 1.05 2.15 PlyA - 25478 25473 6 1.05 2.14 Term - 27153 26996 158 1 2 95 39 129 0.185 6.80 2.13 Intr - 29081 28869 213 0 0 101 49 81 0.156 4.19 2.12 Intr - 30215 30063 153 0 0 85 80 88 0.221 7.74 2.11 Intr - 30685 30586 100 1 1 91 89 78 0.241 7.88 2.10 Intr - 32092 32040 53 2 2 100 38 77 0.183 2.53 2.09 Intr - 32528 32408 121 2 1 92 110 146 0.994 17.27 2.08 Intr - 33120 33072 49 2 1 97 117 85 0.987 10.98 2.07 Intr - 34485 34376 110 0 2 102 82 146 0.997 14.48 2.06 Intr - 34922 34815 108 2 0 102 101 153 0.993 18.48 2.05 Intr - 35446 35366 81 1 0 52 92 42 0.607 0.83 2.04 Intr - 35968 35929 40 0 1 80 116 -10 0.947 -0.77 2.03 Intr - 37337 37252 86 2 2 81 79 118 0.945 8.82 2.02 Intr - 37572 37500 73 2 1 90 82 60 0.941 5.01 2.01 Init - 37772 37603 170 2 2 86 58 92 0.737 4.16 2.00 Prom - 43248 43209 40 -3.26 3.00 Prom + 43537 43576 40 -6.96 3.01 Init + 47307 47309 3 2 0 71 101 0 0.699 -0.40 3.02 Intr + 48767 48851 85 1 1 131 116 54 0.957 11.89 3.03 Intr + 71908 71985 78 2 0 88 110 55 0.973 7.22 3.04 Intr + 81906 82027 122 1 2 80 64 53 0.764 2.31 3.05 Intr + 88045 88245 201 0 0 92 84 171 0.980 16.58 3.06 Intr + 93305 93469 165 1 0 110 55 43 0.878 3.36 3.07 Intr + 94544 94648 105 1 0 73 93 122 0.984 11.61 3.08 Intr + 99695 99969 275 1 2 88 58 43 0.065 -2.36 3.09 Intr + 100000 100072 73 1 1 90 82 60 0.932 5.01 3.10 Intr + 100235 100320 86 1 2 81 79 118 0.944 8.82 3.11 Intr + 101604 101643 40 0 1 80 116 -10 0.946 -0.77 3.12 Intr + 102126 102206 81 2 0 52 92 42 0.606 0.83 3.13 Intr + 102650 102757 108 1 0 102 101 154 0.993 18.58 3.14 Intr + 103087 103196 110 0 2 102 82 163 0.997 16.18 3.15 Intr + 104452 104500 49 1 1 97 117 85 0.987 10.98 3.16 Intr + 105044 105164 121 1 1 92 44 146 0.646 10.67 3.17 Intr + 105480 105532 53 1 2 100 38 77 0.409 2.53 3.18 Intr + 106886 106985 100 1 1 91 89 93 0.541 9.38 3.19 Intr + 107356 107508 153 2 0 85 80 88 0.496 7.74 3.20 Intr + 108490 108702 213 2 0 101 49 81 0.416 4.19 3.21 Term + 110418 110575 158 1 2 95 39 134 0.498 7.30 3.22 PlyA + 113867 113872 6 1.05 4.00 Prom + 131841 131880 40 -4.06 4.01 Init + 137696 137698 3 1 0 113 81 0 0.651 1.80 4.02 Term + 148098 148229 132 2 0 105 43 126 0.594 7.89 4.03 PlyA + 149168 149173 6 1.05 5.20 PlyA - 149444 149439 6 1.05 5.19 Term - 151119 150962 158 1 2 95 39 134 0.161 7.30 5.18 Intr - 153047 152835 213 0 0 101 49 81 0.134 4.19 5.17 Intr - 154181 154029 153 0 0 85 80 88 0.181 7.74 5.16 Intr - 154651 154552 100 1 1 91 89 78 0.197 7.88 5.15 Intr - 156057 156005 53 1 2 100 38 82 0.158 3.03 5.14 Intr - 156493 156373 121 1 1 92 110 146 0.994 17.27 5.13 Intr - 157085 157037 49 1 1 97 117 85 0.987 10.98 5.12 Intr - 158448 158339 110 0 2 102 82 146 0.997 14.48 5.11 Intr - 158885 158778 108 2 0 102 101 153 0.993 18.48 5.10 Intr - 159409 159329 81 1 0 52 92 42 0.607 0.83 5.09 Intr - 159931 159892 40 0 1 80 116 -10 0.947 -0.77 5.08 Intr - 161300 161215 86 2 2 81 79 118 0.945 8.82 5.07 Intr - 161535 161463 73 2 1 90 82 60 0.935 5.01 5.06 Intr - 161840 161566 275 2 2 88 58 49 0.097 -1.76 5.05 Intr - 166976 166872 105 2 0 73 93 122 0.984 11.61 5.04 Intr - 168215 168051 165 2 0 110 55 43 0.878 3.36 5.03 Intr - 173486 173286 201 2 0 92 84 171 0.980 16.58 5.02 Intr - 179621 179500 122 0 2 80 64 46 0.747 1.61 5.01 Intr - 189628 189551 78 2 0 88 110 55 0.967 7.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 38835 38886 52 2 1 123 105 85 0.950 15.02 S.002 Term - 156057 155900 158 1 2 100 48 133 0.806 8.60 S.003 Init - 190685 190637 49 2 1 83 58 48 0.909 0.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:38029496_38238550|GENSCAN_predicted_peptide_1|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:38029496_38238550|GENSCAN_predicted_CDS_1|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581f:38029496_38238550|GENSCAN_predicted_peptide_2|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKKGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:38029496_38238550|GENSCAN_predicted_CDS_2|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtacaacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagaagggggacctgccaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581f:38029496_38238550|GENSCAN_predicted_peptide_3|792_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFENVIDRKPYPDDENLVEVKFARTPV TSTYLVAFVVGEYDFVETRSKDGVCVCVYTPVGKAEQGKFALEEWWTHLWLNEGFASWIE YLCVDHCFPEYDIWTQFVSADYTRAQELDALDNSHPIEVSVGHPSEVDEIFDAISYSKGA SVIRMLHDYIGDKVETIQPAGAGVSPFHGIGGQQDSLSRSHCTQVLGKMPTPAWDLRLQR LEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDVVEVAGSWWAQER EDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRREISRK SKWVDMLGDWEKYKSSRKLIDQAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIMKEKG KKSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLSHIAA LFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISLGLTLRLWDVYLV EGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLT RKKGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPR SSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRKLTVRPGFSRVARLLGDGCDPEDR AQASVMPGWNEL >gi568815581f:38029496_38238550|GENSCAN_predicted_CDS_3|2379_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacaggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgagaatgtaattgac cggaaaccataccctgatgatgaaaatttagtggaagtgaagtttgcccgcacacctgtt acatctacatatctggtggcatttgttgtgggtgaatatgactttgtagaaacaaggtca aaagatggtgtgtgtgtctgtgtttacactcctgttggcaaagcagaacaaggaaaattt gcattagaggaatggtggactcatctttggttaaatgaaggttttgcatcctggattgaa tatctgtgtgtagaccactgcttcccagagtatgatatttggactcagtttgtttctgct gattacacccgtgcccaggagcttgacgccttagataacagccatcctattgaagtcagt gtgggccatccatctgaggttgatgagatatttgatgctatatcatatagcaaaggtgca tctgtcatccgaatgctgcatgactacattggggataaggtggaaaccattcaacctgct ggggccggtgtgtccccatttcatggcattgggggacaacaggattctctgtctaggtcc cactgtactcaagtccttgggaagatgcccacccctgcttgggacttgagactccagaga ctggagcagctgtgggccactgggtctggcccctttttccctgggggcggcggtggaatg ggggttacgcagccagccagcatctgggagcccggcgagagcggttcaggtgttctccga agccgccgcgtacggatggacgtggtagaggtcgcgggcagttggtgggcacaagagcga gaggacatcattatgaaatacgaaaagggacaccgagctgggctgccagaggacaagggg cctaagccttttcgaagctacaacaacaacgtcgatcatttggggattgtacatgagacg gagctgcctcctctgactgcgcgggaggcgaagcaaattcggcgggagatcagccgaaag agcaagtgggtggatatgctgggagactgggagaaatacaaaagcagcagaaagctcata gatcaagcgtacaagggaatgcccatgaacatccggggcccgatgtggtcagtcctcctg aacactgaggaaatgaagttgaaaaaccccggaagataccagatcatgaaggagaagggc aagaagtcatctgagcacatccagcgcatcgaccgggacgtaagcgggacattaaggaag catatattcttcagggatcgatacggaaccaagcagcgggaactactccacatcctcctg gcatatgaggagtacaacccggaggtgggctactgcagggacctgagccacatcgccgcc ttgttcctcctctatcttcctgaggaggatgcattctgggcactggtgcagctgctggcc agtgagaggcactccctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgg gccgaccaagcccagatctctctcgggctcaccctgcgcctgtgggacgtgtatctggta gaaggcgaacaggcgctgatgccgataacaagaatcgcctttaaggttcagcagaagcgc ctcacgaagacgtccaggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacc tgggccagggatgaggacactgtgctcaagcatcttagggcctctatgaagaaactaaca agaaagaagggggacctgccacccccagccaaacccgagcaagggtcgtcggcatccagg cctgtgccggcttcacgtggcgggaagaccctctgcaagggggacaggcaggcccctcca ggcccaccagcccggttcccgcggcccatttggtcagcttccccgccacgggcacctcgt tcttccacaccctgtcctggtggggctgtccgggaagacacctaccctgtgggcactcag gcgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaagctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatggaatgagctgtga >gi568815581f:38029496_38238550|GENSCAN_predicted_peptide_4|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:38029496_38238550|GENSCAN_predicted_CDS_4|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581f:38029496_38238550|GENSCAN_predicted_peptide_5|763_aa XIHATGFNYQNEDEKVTLSFPSTLQTGTGTLKIDFVGELNDKMKGFYRSKYTTPSGEVPY AAVTQFENVIDRKPYPDDENLVEVKFARTPVTSTYLVAFVVGEYDFVETRSKDGVCVCVY TPVGKAEQGKFALEEWWTHLWLNEGFASWIEYLCVDHCFPEYDIWTQFVSADYTRAQELD ALDNSHPIEVSVGHPSEVDEIFDAISYSKGASVIRMLHDYIGDKVETIQPAGASVSPFHG IGGQQDSLSRSHCTQVLGKMPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIW EPGESGSGVLRSRRVRMDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNN NVDHLGIVHETELPPLTAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPM NIRGPMWSVLLNTEEMKLKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYG TKQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAA RAPAAIGAHERADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGP WARFCNRFVDTWARDEDTVLKHLRASMKKLTRKKGDLPPPAKPEQGSSASRPVPASRGGK TLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNA IVNARRRKLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:38029496_38238550|GENSCAN_predicted_CDS_5|2292_bp naaatacatgctacaggatttaactatcagaatgaagatgaaaaagtcaccttgtctttc cctagtactctgcaaacaggtacaggaaccttaaagatagattttgttggagagctgaat gacaaaatgaaaggtttctatagaagtaagtatactaccccttctggagaggtgccctat gctgctgtaacacagtttgagaatgtaattgaccggaaaccataccctgatgatgaaaat ttagtggaagtgaagtttgcccgcacacctgttacatctacatatctggtggcatttgtt gtgggtgaatatgactttgtagaaacaaggtcaaaagatggtgtgtgtgtctgtgtttac actcctgttggcaaagcagaacaaggaaaatttgcattagaggaatggtggactcatctt tggttaaatgaaggttttgcatcctggattgaatatctgtgtgtagaccactgcttccca gagtatgatatttggactcagtttgtttctgctgattacacccgtgcccaggagcttgac gccttagataacagccatcctattgaagtcagtgtgggccatccatctgaggttgatgag atatttgatgctatatcatatagcaaaggtgcatctgtcatccgaatgctgcatgactac attggggataaggtggaaaccattcaacctgctggggccagtgtgtccccatttcatggc attgggggacaacaggattctctgtctaggtcccactgtactcaagtccttgggaagatg cccacccctgcttgggacttgagactccagagactggagcagctgtgggccactgggtct ggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatctgg gagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtggta gaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaaaag ggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaacaac aacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgggag gcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctgggagac tgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgcccatg aacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaaaac cccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccagcgc atcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatacgga accaagcagcgggaactactccacatcctcctggcatatgaggagtacaacccggaggtg ggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggag gatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcc cgggctcctgctgccatcggtgcccacgaacgggccgaccaagcccagatctctctcggg ctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccgata acaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccg tgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctc aagcatcttagggcctctatgaagaaactaacaagaaagaagggggacctgccaccccca gccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcgggaag accctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcggccc atttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggct gtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgcc attgttaatgcacggaggaggaagctgactgttagacctgggttttccagggttgcacgg cttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatgg aatgagctgtga