GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:52:45 Sequence gi568815585r:20041984_20243288 : 201305 bp : 43.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 679 478 202 1 1 66 121 144 0.623 14.54 1.00 Prom - 5383 5344 40 -1.36 2.00 Prom + 6903 6942 40 -6.26 2.01 Init + 10323 10328 6 2 0 74 98 0 0.566 0.51 2.02 Intr + 16592 16721 130 1 1 114 93 37 0.946 7.07 2.03 Intr + 17464 17579 116 2 2 92 95 48 0.985 6.07 2.04 Intr + 19070 19241 172 1 1 25 53 189 0.991 8.62 2.05 Intr + 20863 20988 126 2 0 39 63 125 0.846 5.75 2.06 Intr + 22468 22562 95 2 2 40 83 82 0.815 2.58 2.07 Intr + 24868 25036 169 0 1 50 89 78 0.975 3.62 2.08 Intr + 41673 41793 121 1 1 37 113 93 0.990 6.25 2.09 Term + 43839 44031 193 0 1 80 42 146 0.993 6.09 2.10 PlyA + 44484 44489 6 1.05 3.05 PlyA - 44955 44950 6 1.05 3.04 Term - 49529 49509 21 2 0 103 39 1 0.244 -4.99 3.03 Intr - 51209 51051 159 2 0 69 89 71 0.247 5.48 3.02 Intr - 57519 57326 194 1 2 53 7 103 0.116 -2.09 3.01 Init - 59954 59810 145 2 1 65 32 147 0.510 5.31 3.00 Prom - 65643 65604 40 -3.76 4.06 PlyA - 66193 66188 6 1.05 4.05 Term - 71288 71208 81 2 0 125 48 67 0.289 4.09 4.04 Intr - 76640 76445 196 1 1 78 56 101 0.356 5.32 4.03 Intr - 77247 77165 83 0 2 105 60 20 0.104 -0.66 4.02 Intr - 85692 85593 100 2 1 97 34 63 0.137 1.91 4.01 Init - 86966 86782 185 2 2 82 80 89 0.273 6.35 4.00 Prom - 93130 93091 40 -4.96 5.00 Prom + 94531 94570 40 -3.36 5.01 Init + 94752 94862 111 2 0 58 81 106 0.708 5.77 5.02 Term + 95575 95670 96 0 0 105 44 35 0.782 -1.23 5.03 PlyA + 96850 96855 6 1.05 6.09 PlyA - 97086 97081 6 1.05 6.08 Term - 101317 99998 1320 1 0 20 39 2355 0.877 215.37 6.07 Intr - 101484 101323 162 0 0 45 66 147 0.787 8.37 6.06 Intr - 102649 102521 129 1 0 100 91 12 0.682 3.59 6.05 Intr - 132144 132029 116 1 2 100 68 2 0.050 -0.43 6.04 Intr - 134438 134412 27 0 0 99 96 15 0.729 1.49 6.03 Intr - 135348 135208 141 1 0 96 71 163 0.951 15.72 6.02 Intr - 135830 135701 130 2 1 50 -11 68 0.101 -6.63 6.01 Init - 137479 137111 369 1 0 92 35 267 0.144 18.79 6.00 Prom - 137878 137839 40 -5.56 7.03 PlyA - 138616 138611 6 1.05 7.02 Term - 139755 139552 204 0 0 61 55 145 0.965 5.87 7.01 Init - 142661 142608 54 2 0 90 59 82 0.926 4.85 7.00 Prom - 145473 145434 40 -6.66 8.02 PlyA - 145580 145575 6 1.05 8.01 Sngl - 147598 146918 681 1 0 62 42 1047 0.992 93.69 8.00 Prom - 179332 179293 40 0.94 9.02 PlyA - 180010 180005 6 1.05 9.01 Sngl - 181497 180712 786 0 0 80 41 655 0.958 55.95 9.00 Prom - 184555 184516 40 -6.86 10.00 Prom + 184746 184785 40 -6.56 10.01 Init + 185872 185959 88 0 1 76 59 20 0.490 -1.30 10.02 Term + 189421 189689 269 2 2 142 41 147 0.689 11.06 10.03 PlyA + 193297 193302 6 1.05 11.03 PlyA - 194673 194668 6 1.05 11.02 Term - 195857 195741 117 2 0 67 48 73 0.447 -0.26 11.01 Init - 199013 198807 207 2 0 81 58 118 0.271 6.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 25256 25407 152 0 2 78 88 49 0.884 3.78 S.002 Init + 40989 41049 61 2 1 64 67 34 0.817 0.31 S.003 Term - 126722 126553 170 0 2 54 37 154 0.902 4.94 S.004 Init - 177183 177111 73 0 1 81 98 80 0.974 7.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_1|68_aa MKKNSHKNPDNSKSQSAFFLPNNCTTSPVRVLNQAEMAEITEIEFRIWKGMKITEKQEHT ETQSKEQX >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_1|204_bp atgaaaaagaactcgcacaagaaccctgacaactcaaaaagccagagtgccttctttctt ccaaataactgtactacctctccagtaagggttctgaatcaggcagagatggctgaaata acagaaatagaattcagaatatggaaaggcatgaagatcactgagaagcaggagcacact gaaactcaatccaaggaacaagnn >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_2|375_aa MTGSAPPPSPTPNKEMKNKAVLCKPLTMTKATYCKPHMQTKSCQTDDTWRTEYVPVPIPV PVYIPVPMHMYSQNIPVPTTVPVPVPVPVFLPAPLDSSEKIPAAIEELKSKVSSDALDTE LLTMTDMMSEDEGKTETTNINSVIIETDIIGSDLLKNSDPETQSSMPDVPYEPDLDIEID FPRAAEELDMENEFLLPPVFGEEYEEQPRPRSKKKGAKRKAVSGYQSHDDSSDNSECSFP FKYTYGVNAWKHWVKTRQLDEDLLVLDELKSYKITTGKRKHEDDEPVFEQIENTANPSRC PVKMFECYLSKSPQNLNQRMDVFYLQPECSSSTDSPVWYTSTSLDRNTLENMLVRVLLVK DIYDKDNYELDEDTD >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_2|1128_bp atgacaggttcagcaccacccccttctccaacacctaacaaagagatgaagaacaaagca gttctttgcaaacctttaacaatgacaaaagctacttactgtaaacctcacatgcagacc aaatcttgtcagacagatgatacttggaggacagaatatgttccagtgcctatccctgtg cctgtgtatatcccagttcctatgcacatgtacagtcagaatattcctgttcctactaca gttcctgttcctgtgccagttcctgtttttctgcctgctccattggacagcagtgagaag attcctgcagcaattgaggagctaaaaagcaaggtttcttcagatgctcttgatacagag ttgcttacaatgacggatatgatgagtgaagacgaggggaaaacagagacaaccaacatc aacagtgtaattattgaaacagatataattggttcagaccttttgaagaactctgaccca gagacacagtccagcatgcctgatgtaccatatgaaccagatttggatatcgaaatagat tttcccagagctgctgaggagcttgatatggaaaatgaatttttattaccacctgttttt ggcgaagaatatgaggaacagcccagacctcgatctaaaaaaaagggagccaagagaaag gctgtatcaggataccagtctcatgatgatagttctgacaattcagaatgcagctttcct ttcaaatatacgtatggcgtaaatgcatggaaacactgggtcaaaactaggcaacttgat gaagatcttctggtattagatgagttaaaatcttataaaattactactggaaaaagaaaa catgaagatgatgagccagtatttgaacaaattgaaaacacagccaatccttccagatgt cctgtgaaaatgtttgaatgctacttgtctaaaagtccacagaatcttaatcagaggatg gatgttttttatttgcaaccagaatgctctagttctacagatagccctgtctggtatacg tctacttcactggaccgaaacaccttggaaaatatgcttgtacgggttcttctagtaaaa gatatttatgataaagacaattatgaactggatgaagacacagactaa >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_3|172_aa MGQVFKGWWAASASPWFLWLYDKPSLAFEDLIPALCSAGPKLLSTKLRGKLPPGVLLAAS PRKQEKQGLSPKKTLPSCNVFPCPLLTRLAIATAGKGEILTGSSSSSTKNGRRYLQNILN AASWCPRPKILTLWAFTEKADQPLSVKNGEVFMCGRKGMGQQSSCQNGNRLV >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_3|519_bp atggggcaggtgttcaaaggatggtgggcagcctcagcttcaccctggttcctgtggctg tacgacaaaccgtcactcgcctttgaagacctcatccccgcgctctgtagtgctggcccc aagctcctctctaccaagctgagaggaaaactgccccctggggttctgctggcagcctct ccacgaaaacaagaaaaacaaggcctgagccccaagaaaacccttccctcttgcaatgtc ttcccgtgccctttactgacaaggcttgccattgcgacggccggcaaaggggaaatactt acagggtccagctccagtagcacaaagaacggcagaaggtacttacagaatatcctcaat gctgcctcttggtgtccaagacctaaaatactgactctctgggcctttacagaaaaagct gaccaacccctgagtgtaaagaatggagaagtgttcatgtgtggccggaaaggtatggga cagcaatccagctgccagaatgggaatagactggtgtaa >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_4|214_aa MGNRAAARWPPPGKGPQLEGVEWSRGCFNKIPQMRWFTRRTVQLHSHRVLGVRNRRHLAA VRARLLSNFPSPMDTRGGILMFQLGGTRENQDLVQKKMRRPQSCRRLLRKALCTQEWKRC GSGPPGTNAKSLEAQLEKTGGPPGSLPGAPGSTRPSAHSEQRDGDPAEDPGHLGRPGAQD AARAVPSKLSVNSSAPIIRVEMQCGRVQEHGLRT >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_4|645_bp atggggaaccgggctgctgccaggtggccgccgccgggtaaaggaccacagctggagggg gtggagtggtcccgtggctgctttaacaaaatcccacagatgcggtggtttacacgccgc acagtacagctgcattcccacagagttctgggcgtccggaatcggcgccacctggctgca gtcagggcaaggttgctcagcaacttccccagccccatggacacccggggcggcatcctg atgtttcagttgggaggaaccagagagaatcaagacttggtccagaaaaaaatgcgccgg ccccagagctgccgccgactgttaaggaaggccctttgcacccaagaatggaagcggtgt gggtcgggcccgccaggcacaaacgcgaagtccctggaagcgcagctcgaaaagacaggc ggccctccaggctccctccctggcgctccgggctccacgcggccctcggcgcactcggag cagagggatggggaccccgccgaggatccaggccacctgggcaggccgggtgctcaggac gcagcccgggctgtcccatcgaagctttctgtcaactcttctgctcccatcatcagagtt gagatgcagtgcggtagagtccaggaacatgggcttcggacctga >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_5|68_aa MLWEKRGAGFSHLSMTTTAHLDSGVELSTVQDRYGFLAATVFPKLSIRTGVCTCRQYGKL PPICVWKS >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_5|207_bp atgctgtgggagaagagaggtgcaggattcagccacttgtccatgaccacaacggctcac ctggactcgggcgtggaactgtccaccgtacaggatcgttacgggttcctggccgcgact gtcttccccaaactgtccatcaggacaggtgtttgcacttgtcggcaatatgggaaactc ccaccaatttgtgtttggaagagctga >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_6|797_aa MVNSAMFYDIAEPLSHISSELAADKVPNIAGNIHAVRSGENGFGHKGSCFHRIIPGLMCQ GGDFTRHHDTGSKSIYGQKFDGENFILKHSGPGILCMANAGPDTNGSQVFICTAKTEWWA SSQLERRARPCRQARAGRADQAQRRSPTEGQALLGSEAWAPVTGPWAWDFAAGTVLHGSK MCLCFRHVPARGGHIGACLQTGGLRGEGGNQVARSLPVYQVSVWHCLKAWGWWGVLVHPW LQPKLPGAGCATKKEASSCEQAERGSCLPGDVYLTLAELGLGVGLQLVPPPPWAAFFGVT LLGQCLLVQVLGSCDACPVEKLPISPIPVPSRPLPPPMGADPALSRCSPVFMSIFLLQES EAMGDWSFLGRLLENAQEHSTVIGKVWLTVLFIFRILVLGAAAEDVWGDEQSDFTCNTQQ PGCENVCYDRAFPISHIRFWALQIIFVSTPTLIYLGHVLHIVRMEEKKKEREEEEQLKRE SPSPKEPPQDNPSSRDDRGRVRMAGALLRTYVFNIIFKTLFEVGFIAGQYFLYGFELKPL YRCDRWPCPNTVDCFISRPTEKTIFIIFMLAVACASLLLNMLEIYHLGWKKLKQGVTSRL GPDASEAPLGTADPPPLPPSSRPPAVAIGFPPYYAHTAAPLGQARAVGYPGAPPPAADFK LLALTEARGKGQSAKLYNGHHHLLMTEQNWANQAAERQPPALKAYPAASTPAAPSPVGSS SPPLAHEAEAGAAPLLLDGSGSSLEGSALAGTPEEEEQAVTTAAQMHQPPLPLGDPGRAS KASRASSGRARPEDLAI >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_6|2394_bp atggtcaattctgccatgttttatgacattgctgagcccttaagccacatctcttctgag ctagctgcagacaaagttccaaacatagcaggaaacattcatgctgtgaggtctggagag aatggatttggccataagggctcctgctttcacagaattattccagggcttatgtgccag ggtggtgacttcacacgccatcatgacactggcagcaagtccatctatgggcagaaattt gatggtgagaacttcatcctgaagcattcaggtcctggcatcttgtgcatggcaaatgct ggacccgacacaaatggttcccaggttttcatctgtactgccaaaactgagtggtgggcc agcagccagctggagcgcagggcgcggccgtgtcgtcaggcccgggctggcagggccgac caggctcaaaggcgcagccccacggaagggcaggcgctgctgggcagcgaggcctgggca ccggtcaccgggccttgggcctgggactttgccgccggcaccgtcctccacggctccaag atgtgtctctgcttccggcacgtgcccgcgagagggggccacattggggcgtgtctccag acagggggtctccgaggggagggcggcaaccaggtggcaagaagcctccctgtgtaccag gtctcagtgtggcactgcctgaaggcctgggggtggtggggtgtcctggttcacccttgg ctgcagccgaagctgcctggggcaggttgtgccaccaagaaagaggccagcagctgtgag caggctgagagaggaagctgcctgcctggggatgtttacctaacacttgctgaattgggc ctgggggtgggcctgcagctggtaccaccccctccttgggctgccttcttcggagtcaca cttctggggcagtgcctgctcgtccaggtcctggggagctgcgatgcctgtcctgtggag aagctgcccatcagccccatcccagtaccatccaggccgctgccgccgcccatgggtgcg gacccggcactcagccgttgcagcccggtgttcatgagcattttcctcttacaggaatct gaagcaatgggcgactggagctttctgggaagactcttagaaaatgcacaggagcactcc acggtcatcggcaaggtttggctgaccgtgctgttcatcttccgcatcttggtgctgggg gccgcggcggaggacgtgtggggcgatgagcagtcagacttcacctgcaacacccagcag ccgggctgcgagaacgtctgctacgacagggccttccccatctcccacatccgcttctgg gcgctgcagatcatcttcgtgtccacgcccaccctcatctacctgggccacgtgctgcac atcgtgcgcatggaagagaagaagaaagagagggaggaggaggagcagctgaagagagag agccccagccccaaggagccaccgcaggacaatccctcgtcgcgggacgaccgcggcagg gtgcgcatggccggggcgctgctgcggacctacgtcttcaacatcatcttcaagacgctg ttcgaggtgggcttcatcgccggccagtactttctgtacggcttcgagctgaagccgctc taccgctgcgaccgctggccctgccccaacacggtggactgcttcatctccaggcccacg gagaagaccatcttcatcatcttcatgctggcggtggcctgcgcgtccctgctgctcaac atgctggagatctaccacctgggctggaagaagctcaagcagggcgtgaccagccgcctc ggcccggacgcctccgaggccccgctggggacagccgatcccccgcccctgccccccagc tcccggccgcccgccgttgccatcgggttcccaccctactatgcgcacaccgctgcgccc ctgggacaggcccgcgccgtgggctaccccggggccccgccaccagccgcggacttcaaa ctgctagccctgaccgaggcgcgcggaaagggccagtccgccaagctctacaacggccac caccacctgctgatgactgagcagaactgggccaaccaggcggccgagcggcagcccccg gcgctcaaggcttacccggcagcgtccacgcctgcagcccccagccccgtcggcagcagc tccccgccactcgcgcacgaggctgaggcgggcgcggcgcccctgctgctggatgggagc ggcagcagtctggaggggagcgccctggcagggacccccgaggaggaggagcaggccgtg accaccgcggcccagatgcaccagccgcccttgcccctcggagacccaggtcgggccagc aaggccagcagggccagcagcgggcgggccagaccggaggacttggccatctag >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_7|85_aa MGPRRAWLGQMAQATQAAERDLNVNRYLYIHVHCSSVHNSQEVEGAQVPISEWTEKQNVV YACNGMLFSLKKEGISDTCYNMDEP >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_7|258_bp atggggccaaggagggcgtggctgggacagatggcgcaggccacgcaggccgctgagaga gatctcaatgtgaacagatatttgtacatccatgttcattgcagctctgttcacaacagc caagaggtggaaggagcccaagtgcccatcagcgaatggacagagaagcaaaatgtagtc tatgcatgcaatggaatgttattcagccttaaaaaggaaggaatttctgacacatgctac aacatggatgaaccttga >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_8|226_aa MDWGTLQTILGGVNKHSTSIGKIWLTVLFIFRIMILVVAAKEVWGDEQADFVCNTLQPGC KNVCYDHYFPISHIRLWALQLIFVSTPALLVAMHVAYRRHEKKRKFIKGEIKSEFKDIEE IKTQKVRIEGSLWWTYTSSIFFRVIFEAAFMYVFYVMYDGFSMQRLVKCNAWPCPNTVDC FVSRPTEKTVFTVFMIAVSGICILLNVTELCYLLIRYCSGKSKKPV >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_8|681_bp atggattggggcacgctgcagacgatcctggggggtgtgaacaaacactccaccagcatt ggaaagatctggctcaccgtcctcttcatttttcgcattatgatcctcgttgtggctgca aaggaggtgtggggagatgagcaggccgactttgtctgcaacaccctgcagccaggctgc aagaacgtgtgctacgatcactacttccccatctcccacatccggctatgggccctgcag ctgatcttcgtgtccacgccagcgctcctagtggccatgcacgtggcctaccggagacat gagaagaagaggaagttcatcaagggggagataaagagtgaatttaaggacatcgaggag atcaaaacccagaaggtccgcatcgaaggctccctgtggtggacctacacaagcagcatc ttcttccgggtcatcttcgaagccgccttcatgtacgtcttctatgtcatgtacgacggc ttctccatgcagcggctggtgaagtgcaacgcctggccttgtcccaacactgtggactgc tttgtgtcccggcccacggagaagactgtcttcacagtgttcatgattgcagtgtctgga atttgcatcctgctgaatgtcactgaattgtgttatttgctaattagatattgttctggg aagtcaaaaaagccagtttaa >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_9|261_aa MDWGTLHTFIGGVNKHSTSIGKVWITVIFIFRVMILVVAAQEVWGDEQEDFVCNTLQPGC KNVCYDHFFPVSHIRLWALQLIFVSTPALLVAMHVAYYRHETTRKFRRGEKRNDFKDIED IKKQKVRIEGSLWWTYTSSIFFRIIFEAAFMYVFYFLYNGYHLPWVLKCGIDPCPNLVDC FISRPTEKTVFTIFMISASVICMLLNVAELCYLLLKVCFRRSKRAQTQKNHPNHALKESK QNEMNELISDSGQNAITGFPS >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_9|786_bp atggattgggggacgctgcacactttcatcgggggtgtcaacaaacactccaccagcatc gggaaggtgtggatcacagtcatctttattttccgagtcatgatcctcgtggtggctgcc caggaagtgtggggtgacgagcaagaggacttcgtctgcaacacactgcaaccgggatgc aaaaatgtgtgctatgaccactttttcccggtgtcccacatccggctgtgggccctccag ctgatcttcgtctccaccccagcgctgctggtggccatgcatgtggcctactacaggcac gaaaccactcgcaagttcaggcgaggagagaagaggaatgatttcaaagacatagaggac attaaaaagcagaaggttcggatagaggggtcgctgtggtggacgtacaccagcagcatc tttttccgaatcatctttgaagcagcctttatgtatgtgttttacttcctttacaatggg taccacctgccctgggtgttgaaatgtgggattgacccctgccccaaccttgttgactgc tttatttctaggccaacagagaagaccgtgtttaccatttttatgatttctgcgtctgtg atttgcatgctgcttaacgtggcagagttgtgctacctgctgctgaaagtgtgttttagg agatcaaagagagcacagacgcaaaaaaatcaccccaatcatgccctaaaggagagtaag cagaatgaaatgaatgagctgatttcagatagtggtcaaaatgcaatcacaggtttccca agctaa >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_10|118_aa MAVNAGDSASFKEDSTPLTAPVLTLVGGQGREVSPVCKTQDARDTGDRSSSPTGSAFWCL HVLTKMCRQLLLITVVHAIVISISQVSRKPITQPIPCGSPGTYASAQVEGLSELKPAS >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_10|357_bp atggcagtaaacgcaggagactcagcaagttttaaggaggactcaactccactgactgct cctgtcctgactttggtgggtggacagggaagggaggtcagccccgtttgcaaaacacag gatgcccgtgacaccggagacaggtcttcttcaccgacaggaagtgccttctggtgcctg cacgttttaactaagatgtgtcgccaattacttttaattactgtcgtccacgctattgtc atcagcatttcacaagtttctcggaagcccatcacgcagcccataccctgcggttctccg gggacttatgcatcggcccaagttgagggtttgtctgaactgaaacccgcatcctag >gi568815585r:20041984_20243288|GENSCAN_predicted_peptide_11|107_aa MALFSSSPSQITVFGNMPGLRLPYHKTCILETGPGKKVNNNNNNNKKPSQKLMAVKAAFV VNEWDAYQQSRGDFSQGTQRGKGKEAPDSGETQATPPQQGVQAEHQQ >gi568815585r:20041984_20243288|GENSCAN_predicted_CDS_11|324_bp atggctctgttcagctcaagccctagccagataactgtctttggtaatatgcctgggcta agactcccgtaccataagacttgtattctggaaactggtccaggaaaaaaagtaaacaac aacaacaacaacaacaaaaagccatctcagaagctcatggccgtaaaggctgcatttgta gttaatgaatgggatgcttatcagcagagcaggggtgacttctcccaaggaacacagcgt ggaaaggggaaagaggcccctgacagtggagagacccaagccacaccacctcagcaaggg gtccaagctgagcatcagcagtga