GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:04:58 Sequence gi568815577f:32926863_33127831 : 200969 bp : 46.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 291 286 6 -0.45 1.03 Term - 1808 1640 169 1 1 20 54 146 0.808 1.75 1.02 Intr - 3851 3795 57 1 0 133 101 26 0.830 6.40 1.01 Init - 25683 25556 128 0 2 69 87 25 0.028 0.13 1.00 Prom - 29166 29127 40 -4.26 2.03 PlyA - 31743 31738 6 1.05 2.02 Term - 39973 39720 254 2 2 84 44 138 0.527 4.80 2.01 Init - 47761 47623 139 1 1 58 68 91 0.367 4.41 2.00 Prom - 53579 53540 40 0.94 3.00 Prom + 64670 64709 40 -2.06 3.01 Init + 80033 80062 30 1 0 73 78 48 0.613 1.83 3.02 Intr + 84707 84817 111 1 0 102 78 91 0.980 10.08 3.03 Intr + 93782 93928 147 1 0 87 1 178 0.193 9.43 3.04 Intr + 96785 96947 163 1 1 64 -35 119 0.031 -3.25 3.05 Intr + 99240 99398 159 1 0 47 93 100 0.436 6.36 3.06 Term + 99939 100972 1034 1 2 55 52 1526 0.982 137.99 3.07 PlyA + 101668 101673 6 1.05 4.10 PlyA - 102067 102062 6 1.05 4.09 Term - 103823 103678 146 0 2 83 48 46 0.074 -1.83 4.08 Intr - 105856 105771 86 0 2 92 55 45 0.105 1.06 4.07 Intr - 106781 106622 160 0 1 48 69 110 0.259 4.15 4.06 Intr - 107122 106944 179 0 2 26 35 164 0.414 4.46 4.05 Intr - 109023 108926 98 0 2 28 100 70 0.139 1.01 4.04 Intr - 114752 114502 251 0 2 145 75 42 0.316 5.76 4.03 Intr - 118616 118593 24 0 0 146 58 5 0.446 1.30 4.02 Intr - 126152 125984 169 2 1 44 48 117 0.140 2.82 4.01 Init - 133524 133462 63 0 0 66 44 98 0.131 4.35 4.00 Prom - 135319 135280 40 -5.66 5.00 Prom + 138436 138475 40 -4.26 5.01 Init + 139536 139589 54 2 0 54 51 87 0.407 2.88 5.02 Term + 143385 144200 816 2 0 70 55 1267 0.432 114.74 5.03 PlyA + 145531 145536 6 1.05 6.00 Prom + 150481 150520 40 -6.56 6.01 Init + 150654 150781 128 2 2 73 87 136 0.782 9.65 6.02 Intr + 152221 152302 82 1 1 38 89 46 0.493 -0.66 6.03 Intr + 159344 159406 63 1 0 16 94 80 0.010 0.31 6.04 Intr + 171482 171588 107 1 2 91 105 42 0.759 5.21 6.05 Intr + 197737 197930 194 1 2 76 80 149 0.756 12.04 6.06 Term + 198202 198362 161 2 2 48 49 78 0.639 -2.00 6.07 PlyA + 200321 200326 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 153579 153725 147 2 0 95 48 64 0.840 1.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_1|117_aa MVTHLQSRSSKKATEAFSPGISIGNLFAFKVPSPPIEMKGELKWDHLVAGKQAQVPTDST LCKRVRENEEVIFGKPDPDQKDFIKEMRHNLGLKLRDMQEGERDDVRTKRKNNSLGG >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_1|354_bp atggtcacccaccttcagtctcgcagcagcaagaaagcaacagaagccttctctccagga attagtataggaaacctgtttgcttttaaagttccgagtcctcccatagagatgaaaggg gaattaaaatgggaccacctagttgcaggaaaacaagctcaggttcctactgattctaca ttatgtaagagggtgagagagaacgaagaagttatctttgggaagcccgaccctgatcag aaagacttcatcaaggagatgcggcataacctggggcttaaacttagggatatgcaagaa ggagaaagggatgatgtgagaacaaagagaaaaaacaactcactgggaggctga >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_2|130_aa MTSQPDVQVRFMSVPTWFGEIPLEIPVLVVSVTISEVPCSQSAIDTAIMLFLSHMEDDCD SGEKLSHAMIRSGVLLLALKSQVTTTVCGGQTAGGGSQEVCQSFPGMLGFREEVNGFEAP VPITNSKSPH >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_2|393_bp atgacttcccagcctgatgtgcaagtcagattcatgagtgttccaacatggtttggagaa attccacttgaaattccagtcttggtcgtctctgtgactatcagcgaggtgccctgttct caatcggccattgacactgcaatcatgctattcctaagtcatatggaagatgactgtgat tctggggagaagctaagtcatgctatgatcaggagtggggtcctgctcctggctctgaaa tcccaagtcacaaccacagtgtgcggtggtcagacagctggtggaggaagtcaggaggtc tgccagtcctttcccggaatgctgggcttccgggaagaagtgaatggctttgaggctcct gtgcccatcaccaactccaaaagtccccattga >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_3|547_aa MIADDKPAAMAGESAGQSSESGVGANFFGITFQTTETLMSTGHLNGAECKAGPGTVKTLA VEEEASRLWRKPDPYNTRREPDLRGGALDATGAQGGPLDRARKEPETACSHRRKSVAGQT PVGSNKKRNRVPGHVPAAGLTSAAATKERAQCSDTGGLDPYSAGHFSRATLLFCPPHTHR CISLTKSEKSERQQLFLPKPQSAVFGSEGRRTLRKLRRLSSPGAMDSDASLVSSRPSSPE PDDLFLPARSKGSSGSAFTGGTVSSSTPSDCPPELSAELRGAMGSAGAHPGDKLGGSGFK SSSSSTSSSTSSAAASSTKKDKKQMTEPELQQLRLKINSRERKRMHDLNIAMDGLREVMP YAHGPSVRKLSKIATLLLARNYILMLTNSLEEMKRLVSEIYGGHHAGFHPSACGGLAHSA PLPAATAHPAAAAHAAHHPAVHHPILPPAAAAAAAAAAAAAVSSASLPGSGLPSVGSIRP PHGLLKSPSAAAAAPLGGGGGGSGASGGFQHWGGMPCPCSMCQVPPPHHHVSAMGAGSLP RLTSDAK >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_3|1644_bp atgatcgctgatgataagccagctgcgatggctggtgagtcagctggacagtcatctgag tctggagttggggccaacttctttggcatcacattccagacaacagaaacactgatgagc acagggcatctgaatggggccgaatgcaaagcaggtccaggcactgtgaagaccctggcg gtggaggaagaggcttcccggctgtggaggaagccagacccttacaacacaagacgagaa ccagacctgcgtgggggagctctggatgctacaggggctcaaggagggccactggaccga gcgcgcaaagaacctgagaccgcttgctctcaccgccgcaagtcggtcgcaggacagaca ccagtgggcagcaacaaaaaaagaaaccgggttccgggacacgtgccggcggctggacta acctcagcggctgcaaccaaggagcgcgcacaatgctccgatacagggggtctggatccc tactctgcgggccatttctccagagcgactttgctcttctgtcctccccacactcaccgc tgcatctccctcaccaaaagcgagaagtcggagcgacaacagctctttctgcccaagccc cagtcagctgttttcgggtccgagggaaggaggaccctgcgaaagctgcgacgactatct tcccctggggccatggactcggacgccagcctggtgtccagccgcccgtcgtcgccagag cccgatgacctttttctgccggcccggagtaagggcagcagcggcagcgccttcactggg ggcaccgtgtcctcgtccaccccgagtgactgcccgccggagctgagcgccgagctgcgc ggcgctatgggctctgcgggcgcgcatcctggggacaagctaggaggcagtggcttcaag tcatcctcgtccagcacctcgtcgtctacgtcgtcggcggctgcgtcgtccaccaagaag gacaagaagcaaatgacagagccggagctgcagcagctgcgtctcaagatcaacagccgc gagcgcaagcgcatgcacgacctcaacatcgccatggatggcctccgcgaggtcatgccg tacgcacacggcccttcggtgcgcaagctttccaagatcgccacgctgctgctggcgcgc aactacatcctcatgctcaccaactcgctggaggagatgaagcgactggtgagcgagatc tacgggggccaccacgctggcttccacccgtcggcctgcggcggcctggcgcactccgcg cccctgcccgccgccaccgcgcacccggcagcagcagcgcacgccgcacatcaccccgcg gtgcaccaccccatcctgccgcccgccgccgcagcggctgctgccgccgctgcagccgcg gctgtgtccagcgcctctctgcccggatccgggctgccgtcggtcggctccatccgtcca ccgcacggcctactcaagtctccgtctgctgccgcggccgccccgctggggggcgggggc ggcggcagtggggcgagcgggggcttccagcactggggcggcatgccctgcccctgcagc atgtgccaggtgccgccgccgcaccaccacgtgtcggctatgggcgccggcagcctgccg cgcctcacctccgacgccaagtga >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_4|391_aa MKDAYVNVCLAISGYQERPTVGKFGHRDRYVWKGDNMKTHGEDSPLEAKERGLEQILPSR KEPTQLTAGFYTSGFQNWSYSVAQAGKIYLSTLSVPYFMLSLVLYRLTSASDLLCIPSQA MGVYRLTSASDLLCIPSQAMGVLTGLPGDTMLIHTVASTEGMLSVTDRQSMGPPRPQCNA KDIPRELTVHHAQYQAPQGMQRRFWRALRVSFLGDLQPFPQPVDGLWPHRRCNVAKGKLR ASKPAASILADAGRAGVSSRWEPCQIVLVDRARFQGLQTEGPLACLQGPRCAVKEGLRSP WQESAHSQQPRDWRSSSLQSSTSNLFHRQTASSNHWGPRGEIRNTKHTQSLHQYTCLAVG VEEPKLQSRPLPPPARFSAPTGLRSSLLESL >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_4|1176_bp atgaaggatgcctacgtgaacgtctgcctagcaatatcaggttaccaagaaaggccaaca gtggggaaatttggacacagagacagatacgtatggaagggagataacatgaagacccat ggagaagacagccctctagaagccaaggagaggggcctggaacagatccttccctcacgg aaggaaccaacacaattgacagctggattttatacttctggcttccagaactggtcttat tctgttgcccaggctggtaaaatttacctttccacgctgtcagtgccctacttcatgtta tcactagtactttacagactaaccagtgccagtgacttgctgtgtattccttcccaggca atgggcgtttatagactaaccagtgccagtgacttgctgtgtattccttcccaggcaatg ggcgttttgacagggctccctggagacactatgcttatccacacagtggcatcaacagaa ggcatgctgagtgtgacagacaggcagagcatgggccctcctcgtccacaatgcaatgca aaggatattcccagagaactaaccgtgcatcatgctcagtaccaggcaccccagggcatg caaaggcggttctggagagctctgcgtgtctccttcttgggggacctccaaccgttccca caaccagtggacggattgtggccgcaccgacgctgcaatgtcgccaaaggaaaactgcgc gcgtccaagccagctgcttcaatcctggcggatgcgggccgtgccggggtctccagccgg tgggagccctgccagatcgtcctggtggaccgcgcccgctttcagggtttgcaaactgaa ggcccgctcgcgtgtctgcagggccctcggtgtgctgtaaaggagggtctgaggtccccc tggcaggagagcgcgcactcgcagcagccgcgggactggcgcagttcctcgctgcagtcc tccacctccaacctcttccacagacaaacagccagtagcaatcactggggccctagagga gagatacggaacaccaagcacactcagagcctgcaccagtacacctgcctggcagtgggt gtggaagagccaaaactgcaatcaagacccctacccccaccggccaggttctcagcgccc actggtctacgttccagcctcctggagtccctctga >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_5|289_aa MGSSIPKESSLSKNNYRRMYYAVSQARVNAVPGTMLRPQRPGDLQLGASLYELVGYRQPP SSSSSSTSSTSSTSSSSTTAPLLPKAAREKPEAPAEPPGPGPGSGAHPGGSARPDAKEEQ QQQLRRKINSRERKRMQDLNLAMDALREVILPYSAAHCQGAPGRKLSKIATLLLARNYIL LLGSSLQELRRALGEGAGPAAPRLLLAGLPLLAAAPGSVLLAPGAVGPPDALRPAKYLSL ALDEPPCGQFALPGGGAGGPGLCTCAVCKFPHLVPASLGLAAVQAQFSK >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_5|870_bp atgggatcaagcatccccaaggagtcctccctgtccaagaacaactaccgaaggatgtac tatgcggtttcccaggcgcgcgtgaacgcggtccccgggaccatgctgcggccacagcgg cccggagacttgcagctcggggcctccctctacgagctggtgggctacaggcagccgccc tcctcctcctcctcctccacctcctccacctcctccacttcctcctcctccacgacggcc cccctcctccccaaggctgcgcgcgagaagccggaggcgccggccgagcctccaggcccc gggcccgggtcaggcgcgcacccgggcggcagcgcccggccggacgccaaggaggagcag cagcagcagctgcggcgcaagatcaacagccgcgagcggaagcgcatgcaggacctgaac ctggccatggacgccctgcgcgaggtcatcctgccctactcagcggcgcactgccagggc gcgcccggccgcaagctctccaagatagccacgctgctgctcgcccgcaactacatccta ctgctgggcagctcgctgcaggagctgcgccgcgcgctgggcgagggcgccgggcccgcc gcgccgcgcctgctgctggccgggctgcccctgctcgccgccgcgcccggctccgtgctg ctggcgcccggcgccgtaggaccccccgacgcgctgcgccccgccaagtacctgtcgctg gcgctggacgagccgccgtgcggccagttcgctctccccggcggcggcgcaggcggcccc ggcctctgcacctgcgccgtgtgcaagttcccgcacctggtcccggccagcctgggcctg gccgccgtgcaggcgcaattctccaagtga >gi568815577f:32926863_33127831|GENSCAN_predicted_peptide_6|244_aa MVVMVVVVVVVVVVLVLFHLLASGGFTHMLPLQSWFPTAMGRRGHKLTACHSPPAERHLG VAPNSINRFVSGDVTGDVAEMQCREGTGYRQRFSEFDLPLEKNRLIICKDVAIRCQTNPI EEKGWESKGGQDNEESIGKAEKTGYIACQAPGILRRCFSCDSGHWYGKRLTIPGQNSEGG GMFLQKRSRGSGKCLWSGAMETRATDCEATQPSGPSSCDRKSLFCPVPPEPAAQTTGGLH YVSK >gi568815577f:32926863_33127831|GENSCAN_predicted_CDS_6|735_bp atggtggtgatggtggtggtggtggtggtggtggtcgtggtgctggttttgttccatttg cttgcttccggtggcttcactcacatgcttcctctgcagtcctggttccccacagccatg ggcagaagaggccacaagctgactgcatgtcattctccaccagcagagcgtcacctcggg gtagctccaaacagtatcaaccggtttgtgtcaggtgatgtaacaggcgatgtggctgaa atgcagtgcagagaagggaccggctaccgccagagattttctgagtttgatttgcccctt gaaaagaatagattgatcatctgtaaagacgttgcaatcagatgtcaaaccaatccaatc gaggaaaaaggctgggagagcaaaggaggacaggacaatgaagaatctattggcaaagca gagaaaacaggctatatcgcttgccaagcccctggaattctgagaaggtgcttctcctgt gatagcggtcactggtacggcaagaggttgacaatccctggccaaaactccgaagggggt ggcatgtttctgcagaaacgcagcagagggtcaggcaagtgtctgtggtcaggagccatg gagactcgtgccacggactgcgaagccacccagccatctggcccaagctcctgtgaccgg aagagcttgttttgccctgttcccccagagccagcagcgcagaccacagggggcttgcat tatgttagcaaatga