GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:05:04 Sequence gi568815581f:36111142_36312566 : 201425 bp : 45.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2213 2368 156 1 0 75 77 60 0.495 3.81 1.02 Intr + 6584 6637 54 1 0 108 72 30 0.183 2.58 1.03 Intr + 7851 7926 76 2 1 22 113 55 0.327 0.49 1.04 Intr + 8147 8204 58 0 1 109 101 42 0.373 5.54 1.05 Term + 10040 10154 115 2 1 53 44 78 0.232 -2.06 1.06 PlyA + 11545 11550 6 1.05 2.00 Prom + 12442 12481 40 -1.56 2.01 Init + 18343 18454 112 0 1 68 72 69 0.526 3.67 2.02 Intr + 36439 36554 116 2 2 68 109 20 0.083 2.27 2.03 Intr + 38439 38570 132 2 0 91 72 21 0.021 1.64 2.04 Intr + 45810 45860 51 2 0 62 79 51 0.049 0.70 2.05 Term + 50472 50603 132 2 0 105 43 104 0.247 5.69 2.06 PlyA + 51542 51547 6 1.05 3.14 PlyA - 51818 51813 6 1.05 3.13 Term - 53494 53337 158 2 2 95 39 129 0.728 6.80 3.12 Intr - 55422 55210 213 1 0 101 49 71 0.634 3.19 3.11 Intr - 56555 56403 153 0 0 85 80 82 0.818 7.14 3.10 Intr - 57025 56926 100 1 1 91 89 78 0.700 7.88 3.09 Intr - 58434 58382 53 1 2 100 38 77 0.764 2.53 3.08 Intr - 58870 58750 121 1 1 92 110 146 0.994 17.27 3.07 Intr - 59462 59414 49 1 1 97 117 62 0.988 8.68 3.06 Intr - 60823 60714 110 1 2 92 82 150 0.999 13.88 3.05 Intr - 61260 61153 108 0 0 107 101 162 0.994 19.88 3.04 Intr - 61784 61704 81 2 0 52 92 42 0.599 0.83 3.03 Intr - 62306 62267 40 1 1 80 116 -10 0.935 -0.77 3.02 Intr - 63675 63590 86 0 2 81 79 118 0.921 8.82 3.01 Init - 63909 63838 72 0 0 71 82 50 0.673 3.77 3.00 Prom - 64502 64463 40 -14.47 4.00 Prom + 64728 64767 40 -14.86 4.01 Init + 65173 65224 52 0 1 123 105 85 0.997 15.02 4.02 Intr + 67589 67621 33 0 0 80 82 35 0.546 0.29 4.03 Intr + 68310 68448 139 1 1 70 59 125 0.809 7.32 4.04 Intr + 71313 71437 125 0 2 94 84 22 0.735 2.63 4.05 Intr + 71491 71835 345 2 0 29 59 283 0.436 14.86 4.06 Term + 76014 76117 104 1 2 54 42 75 0.212 -2.06 4.07 PlyA + 77490 77495 6 1.05 5.04 PlyA - 82876 82871 6 1.05 5.03 Term - 84235 84145 91 0 1 128 55 78 0.976 5.69 5.02 Intr - 84771 84657 115 1 1 126 103 111 0.999 15.91 5.01 Init - 85532 85457 76 2 1 82 94 103 0.940 9.66 5.00 Prom - 98865 98826 40 -0.66 6.00 Prom + 99435 99474 40 -7.66 6.01 Init + 100001 100076 76 1 1 68 108 73 0.943 6.59 6.02 Intr + 108622 108798 177 2 0 51 58 85 0.166 1.69 6.03 Intr + 109139 109401 263 0 2 69 77 138 0.359 8.01 6.04 Intr + 130558 130647 90 0 0 110 0 65 0.075 0.09 6.05 Term + 138432 138563 132 2 0 105 43 163 0.842 11.59 6.06 PlyA + 139502 139507 6 1.05 7.14 PlyA - 139778 139773 6 1.05 7.13 Term - 141453 141296 158 1 2 95 39 96 0.006 3.50 7.12 Intr - 143381 143169 213 0 0 101 49 81 0.005 4.19 7.11 Intr - 144496 144344 153 2 0 85 80 82 0.020 7.14 7.10 Intr - 144966 144867 100 0 1 91 89 78 0.022 7.88 7.09 Intr - 146372 146320 53 0 2 100 38 72 0.016 2.03 7.08 Intr - 146808 146688 121 0 1 92 110 146 0.994 17.27 7.07 Intr - 147400 147352 49 0 1 97 117 62 0.987 8.68 7.06 Intr - 148758 148649 110 0 2 102 82 146 0.996 14.48 7.05 Intr - 149195 149088 108 2 0 99 101 160 0.975 18.88 7.04 Intr - 149719 149639 81 1 0 52 92 42 0.602 0.83 7.03 Intr - 150241 150202 40 0 1 80 116 -10 0.939 -0.77 7.02 Intr - 151610 151525 86 2 2 81 79 118 0.923 8.82 7.01 Init - 151844 151773 72 2 0 71 82 55 0.663 4.27 7.00 Prom - 157520 157481 40 -3.26 8.00 Prom + 157809 157848 40 -6.96 8.01 Init + 161598 161600 3 2 0 71 101 0 0.685 -0.40 8.02 Intr + 163058 163142 85 1 1 131 116 54 0.961 11.89 8.03 Intr + 186168 186245 78 1 0 88 110 55 0.924 7.22 8.04 Term + 196156 196298 143 2 2 80 38 64 0.242 -1.31 8.05 PlyA + 197426 197431 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 146372 146215 158 0 2 100 48 123 0.977 7.60 S.002 Init + 153108 153159 52 2 1 123 105 85 0.939 15.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_1|152_aa GYINSPVLCHNLIQRDLEHFLLLQDITLVHYNDDIMMTRSSEQEVANTLDLLAVSMDDDD CWQNRGVVVKNSSPQAQATDWYWSMTCYEPSHTAEDGTIKLQENKLRAPTDSTLCTMYNS KDMEPTQIAINDRRDKENVAYKHPGILRSHKK >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_1|459_bp gggtatatcaactctccggttttgtgtcataatcttattcagagagaccttgaacacttt ttgcttctgcaagatatcacactggtccattacaatgatgacattatgatgactagatcc agtgaacaagaagtagcaaacacactggacttattggcagtgagcatggatgatgatgat tgctggcaaaacaggggagtggtagtgaagaacagcagtccccaggcccaggccacagat tggtactggtccatgacctgttacgaaccgagccacacagcagaagatgggaccatcaag ttgcaggaaaacaagctcagggctcccactgattctacattatgcactatgtacaatagc aaagacatggaaccaacccaaattgccatcaacgatagaagggataaagaaaatgtggca tataaacaccctggaatactacgcagccataaaaaatga >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_2|180_aa MEEVRVSWIEDQTSHNIPLSQNLIKALPVFNSVKAERSICVHQCRKDGDYKEGKCCIRKG CWHIFVPPPQNLIPKQTHAFLLSAKSELEKTNSVPVGFPIFGWLAGLPPQPLSRLLSRSW ASVCISEEKTGTVEGKTAVCEMFHVRGKQHIQIPKLSTSSVTRHLHHFRLMQDSQPLDLS >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_2|543_bp atggaggaagttcgagtgtcctggatagaagatcaaaccagccacaacattcccttaagc caaaacctaatcaaagccctacctgtcttcaactctgtcaaggctgagagaagcatctgt gtgcaccaatgcagaaaagacggggactacaaggaaggaaaatgttgcatcaggaagggc tgctggcacatctttgttccccctccccaaaacctcatccccaagcagacccatgcgttc ctgctctctgccaagtccgaactggagaaaacaaattctgtcccagtggggtttcccatc tttggctggctggctggacttcctccacaaccactctctcgccttctaagcaggagctgg gcttctgtgtgcatcagtgaagagaagacggggactgtggaagggaaaacagcagtctgt gagatgtttcatgtccgaggcaaacagcacattcagatccccaagctctccacctccagt gtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcctttggacctcagc taa >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_3|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNIEEM KLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGRKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_3|1344_bp atggacgtggtagaggtcgcgggtagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacattgaggaaatg aagttgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacataagcgggacattaaggaagcatatgttcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcaggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaacgccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_4|265_aa METKEPIVYTGSVERAPGPFGALATSSLDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EHGTPHTPHFEDAGRCERVDASTTCLHMCVCTCIVLAHLRVSRESGAQHVLAPAGTLSRR RGWGPYLRSLRIQHRSSPVQPPAKPPEDEPDAEGYEWTIAVSFQLADFAPLHWLRLDGPG FGVLSVPPHRVVAFSLGITPSRGAADPKSSMSSPRPAHRPRPRPLPRPDQRALEQCPLEK AERSHPPCTRAAKNSHVPEYVSAAV >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_4|798_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccctggatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcacggcacaccccacacccctcattttgaggatgctgggaggtgtgagcgagtggat gcttccactacgtgtctgcacatgtgtgtgtgcacttgcattgtccttgcacacctgcgt gtgtcccgtgagagcggagcccagcatgtgctggcacctgcaggaacattgtcacgacga cgaggatggggaccatatcttcggagcctgaggatccaacacaggtccagccctgtgcag ccgcccgccaagccgccagaggacgagccggacgccgaaggctacgagtggacgattgca gttagtttccaactcgccgacttcgcgcccctccactggctccggcttgatggtcccggc ttcggggtgctctcggtccctccccatcgcgtcgtcgctttctcccttggcataaccccc agccgcggggccgcagaccctaagagctccatgagctctccgcgccctgcccaccggccc cggccccgacccctccccagaccggaccagagagctttggaacaatgtcctctagaaaaa gctgaacgtagccacccaccttgtacgagagccgcaaagaatagccatgtgcctgaatat gtgtcagccgctgtgtga >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_5|93_aa MQVSTAALAVLLCTMALCNQVLSAPLAADTPTACCFSYTSRQIPQNFIADYFETSSQCSK PSVIFLTKRGRQVCADPSEEWVQKYVSDLELSA >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_5|282_bp atgcaggtctccactgctgcccttgccgtcctcctctgcaccatggctctctgcaaccag gtcctctctgcaccacttgctgctgacacgccgaccgcctgctgcttcagctacacctcc cgacagattccacagaatttcatagctgactactttgagacgagcagccagtgctccaag cccagtgtcatcttcctaaccaagagaggccggcaggtctgtgctgaccccagtgaggag tgggtccagaaatacgtcagtgacctggagctgagtgcctga >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_6|245_aa MKLCVTVLSLLVLVAAFCSLALSAPIKLGAYGGQVINGVLAQVQLTVGPVGPWIHPVVIS PMTECVIGIDIFNSWQNPYIGSLTGIDLANAFFSIPVHKAHQKQFAFSWQGQQYTFTVLP HGYINSPALRHNLIQRDLDHFLLLKDITLVHYNDIMMIGSSEQKVANTLDLLVETELKLI CGDVLDVLDKHLIPAATTGKSKAVCEMFDVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQ PLDRS >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_6|738_bp atgaagctctgcgtgactgtcctgtctctcctcgtgctagtagctgccttctgctctcta gcactctcagcaccaattaaattaggggcttatggaggtcaggtaattaatggagtttta gctcaggtccaacttacagtgggcccagtgggtccctggattcatcctgtggtcatttcc ccaatgacagaatgtgtaattggcatagatatattcaacagctggcagaacccctacatt ggctccctgactggtattgacttagcaaatgcctttttctccattcctgtccataaggcc caccagaagcaatttgccttcagctggcaaggtcagcaatatacctttactgtcctacct cacgggtatatcaactctccagctttgcgtcataatcttattcagagagaccttgatcac tttttgcttctgaaagatatcacactggtccattacaatgacattatgatgattggatcc agtgagcaaaaagtagcaaacacactggacttattggttgagactgagctaaagttaatc tgtggcgacgttctggatgtactggacaaacacctcattccagcagctacaactggcaag tccaaggcagtctgtgagatgtttgatgtccgaggcaaacagcacattcagatccccaag ctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacag cctttggaccgcagctaa >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_7|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KMKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAQAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNSIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_7|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagatgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccaggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaactccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581f:36111142_36312566|GENSCAN_predicted_peptide_8|102_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFEVWVILL >gi568815581f:36111142_36312566|GENSCAN_predicted_CDS_8|309_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacgggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgaggtatgggttatt cttctctaa