GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:49 Sequence gi568815581r:37987521_38167066 : 179546 bp : 45.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1264 1315 52 0 1 123 105 85 0.871 15.02 1.02 Term + 4485 4498 14 1 2 115 37 10 0.410 -3.04 1.03 PlyA + 4892 4897 6 1.05 2.00 Prom + 12364 12403 40 -3.26 2.01 Init + 17840 18009 170 1 2 86 58 92 0.737 4.16 2.02 Intr + 18040 18112 73 1 1 90 82 60 0.941 5.01 2.03 Intr + 18275 18360 86 1 2 81 79 118 0.945 8.82 2.04 Intr + 19644 19683 40 0 1 80 116 -10 0.947 -0.77 2.05 Intr + 20166 20246 81 2 0 52 92 42 0.607 0.83 2.06 Intr + 20690 20797 108 1 0 102 101 153 0.993 18.48 2.07 Intr + 21127 21236 110 0 2 102 82 146 0.996 14.48 2.08 Intr + 22485 22533 49 0 1 97 117 62 0.987 8.68 2.09 Intr + 23077 23197 121 0 1 92 110 146 0.994 17.27 2.10 Intr + 23513 23565 53 0 2 100 38 77 0.306 2.53 2.11 Intr + 24919 25018 100 0 1 91 89 78 0.399 7.88 2.12 Intr + 25389 25541 153 1 0 85 80 82 0.365 7.14 2.13 Intr + 26505 26717 213 1 0 101 49 81 0.263 4.19 2.14 Term + 28433 28590 158 0 2 95 39 129 0.327 6.80 2.15 PlyA + 31885 31890 6 1.05 3.00 Prom + 49848 49887 40 -4.06 3.01 Init + 55700 55702 3 1 0 113 81 0 0.640 1.80 3.02 Term + 66107 66238 132 1 0 105 43 126 0.585 7.89 3.03 PlyA + 67177 67182 6 1.05 4.15 PlyA - 67453 67448 6 1.05 4.14 Term - 69128 68971 158 0 2 95 39 129 0.186 6.80 4.13 Intr - 71056 70844 213 2 0 101 49 81 0.156 4.19 4.12 Intr - 72190 72038 153 2 0 85 80 88 0.221 7.74 4.11 Intr - 72660 72561 100 0 1 91 89 78 0.242 7.88 4.10 Intr - 74067 74015 53 1 2 100 38 77 0.184 2.53 4.09 Intr - 74503 74383 121 1 1 92 110 146 0.994 17.27 4.08 Intr - 75095 75047 49 1 1 97 117 85 0.987 10.98 4.07 Intr - 76460 76351 110 2 2 102 82 146 0.997 14.48 4.06 Intr - 76897 76790 108 1 0 102 101 153 0.993 18.48 4.05 Intr - 77421 77341 81 0 0 52 92 42 0.607 0.83 4.04 Intr - 77943 77904 40 2 1 80 116 -10 0.947 -0.77 4.03 Intr - 79312 79227 86 1 2 81 79 118 0.945 8.82 4.02 Intr - 79547 79475 73 1 1 90 82 60 0.941 5.01 4.01 Init - 79747 79578 170 1 2 86 58 92 0.737 4.16 4.00 Prom - 85223 85184 40 -3.26 5.00 Prom + 85512 85551 40 -6.96 5.01 Init + 89282 89284 3 1 0 71 101 0 0.699 -0.40 5.02 Intr + 90742 90826 85 0 1 131 116 54 0.957 11.89 5.03 Intr + 113883 113960 78 1 0 88 110 55 0.973 7.22 5.04 Intr + 123881 124002 122 0 2 80 64 53 0.764 2.31 5.05 Intr + 130020 130220 201 2 0 92 84 171 0.980 16.58 5.06 Intr + 135280 135444 165 0 0 110 55 43 0.878 3.36 5.07 Intr + 136519 136623 105 0 0 73 93 122 0.984 11.61 5.08 Intr + 141670 141944 275 0 2 88 58 43 0.065 -2.36 5.09 Intr + 141975 142047 73 0 1 90 82 60 0.932 5.01 5.10 Intr + 142210 142295 86 0 2 81 79 118 0.944 8.82 5.11 Intr + 143579 143618 40 2 1 80 116 -10 0.946 -0.77 5.12 Intr + 144101 144181 81 1 0 52 92 42 0.606 0.83 5.13 Intr + 144625 144732 108 0 0 102 101 154 0.993 18.58 5.14 Intr + 145062 145171 110 2 2 102 82 163 0.997 16.18 5.15 Intr + 146427 146475 49 0 1 97 117 85 0.987 10.98 5.16 Intr + 147019 147139 121 0 1 92 44 146 0.646 10.67 5.17 Intr + 147455 147507 53 0 2 100 38 77 0.359 2.53 5.18 Intr + 148861 148960 100 0 1 91 89 93 0.475 9.38 5.19 Intr + 149331 149483 153 1 0 85 80 88 0.435 7.74 5.20 Intr + 150465 150677 213 1 0 101 49 81 0.367 4.19 5.21 Term + 152393 152550 158 0 2 95 39 134 0.441 7.30 5.22 PlyA + 155842 155847 6 1.05 6.03 PlyA - 156720 156715 6 1.05 6.02 Term - 158542 158522 21 1 0 62 48 22 0.012 -6.09 6.01 Intr - 175029 174920 110 1 2 49 111 112 0.918 9.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16777 16726 52 1 1 123 105 85 0.962 15.02 S.002 Init + 80810 80861 52 1 1 123 105 85 0.950 15.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_1|21_aa METKEPIVYTGSVERAPASWF >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_1|66_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgcttcctgg ttctag >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_2|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_2|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtataacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggacctgccaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_3|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_3|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_4|504_aa MPTPAWDLRLQRLEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDV VEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAR EAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEMKLK NPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPE VGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISL GLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTV LKHLRASMKKLTRKKGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPR PIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVA RLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_4|1515_bp atgcccacccctgcttgggacttgagactccagagactggagcagctgtgggccactggg tctggcccctttttccctgggggcggcggtggaatgggggttacgcagccagccagcatc tgggagcccggcgagagcggttcaggtgttctccgaagccgccgcgtacggatggacgtg gtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatgaaatacgaa aagggacaccgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaac aacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgg gaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggatatgctggga gactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgccc atgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatgaagttgaaa aaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgagcacatccag cgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagggatcgatac ggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtacaacccggag gtgggctactgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgag gaggatgcattctgggcactggtgcagctgctggccagtgagaggcactccctgcaggct gcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctc gggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccg ataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggc ccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtg ctcaagcatcttagggcctctatgaagaaactaacaagaaagaagggggacctgccaccc ccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcggg aagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcgg cccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggg gctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaac gccattgttaatgcacggaggaggaacctgactgttagacctgggttttccagggttgca cggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccagga tggaatgagctgtga >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_5|792_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFENVIDRKPYPDDENLVEVKFARTPV TSTYLVAFVVGEYDFVETRSKDGVCVCVYTPVGKAEQGKFALEEWWTHLWLNEGFASWIE YLCVDHCFPEYDIWTQFVSADYTRAQELDALDNSHPIEVSVGHPSEVDEIFDAISYSKGA SVIRMLHDYIGDKVETIQPAGAGVSPFHGIGGQQDSLSRSHCTQVLGKMPTPAWDLRLQR LEQLWATGSGPFFPGGGGGMGVTQPASIWEPGESGSGVLRSRRVRMDVVEVAGSWWAQER EDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRREISRK SKWVDMLGDWEKYKSSRKLIDQAYKGMPMNIRGPMWSVLLNTEEMKLKNPGRYQIMKEKG KKSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLSHIAA LFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISLGLTLRLWDVYLV EGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLT RKKGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPR SSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRKLTVRPGFSRVARLLGDGCDPEDR AQASVMPGWNEL >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_5|2379_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacaggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgagaatgtaattgac cggaaaccataccctgatgatgaaaatttagtggaagtgaagtttgcccgcacacctgtt acatctacatatctggtggcatttgttgtgggtgaatatgactttgtagaaacaaggtca aaagatggtgtgtgtgtctgtgtttacactcctgttggcaaagcagaacaaggaaaattt gcattagaggaatggtggactcatctttggttaaatgaaggttttgcatcctggattgaa tatctgtgtgtagaccactgcttcccagagtatgatatttggactcagtttgtttctgct gattacacccgtgcccaggagcttgacgccttagataacagccatcctattgaagtcagt gtgggccatccatctgaggttgatgagatatttgatgctatatcatatagcaaaggtgca tctgtcatccgaatgctgcatgactacattggggataaggtggaaaccattcaacctgct ggggccggtgtgtccccatttcatggcattgggggacaacaggattctctgtctaggtcc cactgtactcaagtccttgggaagatgcccacccctgcttgggacttgagactccagaga ctggagcagctgtgggccactgggtctggcccctttttccctgggggcggcggtggaatg ggggttacgcagccagccagcatctgggagcccggcgagagcggttcaggtgttctccga agccgccgcgtacggatggacgtggtagaggtcgcgggcagttggtgggcacaagagcga gaggacatcattatgaaatacgaaaagggacaccgagctgggctgccagaggacaagggg cctaagccttttcgaagctacaacaacaacgtcgatcatttggggattgtacatgagacg gagctgcctcctctgactgcgcgggaggcgaagcaaattcggcgggagatcagccgaaag agcaagtgggtggatatgctgggagactgggagaaatacaaaagcagcagaaagctcata gatcaagcgtacaagggaatgcccatgaacatccggggcccgatgtggtcagtcctcctg aacactgaggaaatgaagttgaaaaaccccggaagataccagatcatgaaggagaagggc aagaagtcatctgagcacatccagcgcatcgaccgggacgtaagcgggacattaaggaag catatattcttcagggatcgatacggaaccaagcagcgggaactactccacatcctcctg gcatatgaggagtacaacccggaggtgggctactgcagggacctgagccacatcgccgcc ttgttcctcctctatcttcctgaggaggatgcattctgggcactggtgcagctgctggcc agtgagaggcactccctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgg gccgaccaagcccagatctctctcgggctcaccctgcgcctgtgggacgtgtatctggta gaaggcgaacaggcgctgatgccgataacaagaatcgcctttaaggttcagcagaagcgc ctcacgaagacgtccaggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacc tgggccagggatgaggacactgtgctcaagcatcttagggcctctatgaagaaactaaca agaaagaagggggacctgccacccccagccaaacccgagcaagggtcgtcggcatccagg cctgtgccggcttcacgtggcgggaagaccctctgcaagggggacaggcaggcccctcca ggcccaccagcccggttcccgcggcccatttggtcagcttccccgccacgggcacctcgt tcttccacaccctgtcctggtggggctgtccgggaagacacctaccctgtgggcactcag gcgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaagctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatggaatgagctgtga >gi568815581r:37987521_38167066|GENSCAN_predicted_peptide_6|43_aa XQSEEDVSQFDSKFTRQTPVDSPDDTTLSESANQVFLVYIQKK >gi568815581r:37987521_38167066|GENSCAN_predicted_CDS_6|132_bp nagcaatctgaagaggatgtaagtcagtttgattccaagtttacacgtcagacacctgtc gacagcccagatgacacaactctcagtgaaagtgccaatcaggtgtttttggtttatatc caaaagaagtga