GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:34:15 Sequence gi568815594f:55246337_55470088 : 223752 bp : 40.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 16951 17370 420 0 0 64 32 280 0.566 16.05 1.02 PlyA + 17751 17756 6 1.05 2.08 PlyA - 18066 18061 6 1.05 2.07 Term - 22161 21944 218 1 2 15 40 180 0.329 2.22 2.06 Intr - 23947 23777 171 2 0 88 49 78 0.179 2.89 2.05 Intr - 26366 26298 69 0 0 74 93 50 0.462 2.34 2.04 Intr - 34100 33838 263 1 2 91 59 163 0.326 9.91 2.03 Intr - 45114 45077 38 0 2 113 103 12 0.410 1.34 2.02 Intr - 45630 45549 82 2 1 102 65 45 0.413 2.32 2.01 Init - 51254 51139 116 2 2 74 65 64 0.364 2.47 2.00 Prom - 54826 54787 40 -9.25 3.00 Prom + 55594 55633 40 -6.15 3.01 Init + 55838 56062 225 1 0 74 75 146 0.643 10.42 3.02 Term + 59140 59241 102 0 0 129 45 46 0.862 1.70 3.03 PlyA + 59300 59305 6 1.05 4.03 PlyA - 61639 61634 6 1.05 4.02 Term - 77934 77821 114 0 0 66 43 78 0.680 -1.31 4.01 Init - 81594 81529 66 0 0 95 103 80 0.924 11.32 4.00 Prom - 83033 82994 40 -3.75 5.03 PlyA - 84726 84721 6 1.05 5.02 Term - 93252 92784 469 2 1 96 44 217 0.216 11.56 5.01 Init - 96078 96071 8 0 2 45 93 0 0.458 -3.40 5.00 Prom - 97363 97324 40 -5.25 6.00 Prom + 99176 99215 40 -6.55 6.01 Init + 100001 100221 221 1 2 94 76 194 0.895 14.95 6.02 Intr + 105712 105871 160 1 1 46 79 74 0.014 1.37 6.03 Intr + 106111 106161 51 0 0 43 81 87 0.021 1.69 6.04 Intr + 113010 113152 143 2 2 110 86 41 0.095 4.33 6.05 Intr + 117793 117935 143 1 2 22 101 130 0.051 6.78 6.06 Term + 119127 119233 107 1 2 86 42 35 0.064 -3.71 6.07 PlyA + 119773 119778 6 1.05 7.06 PlyA - 120003 119998 6 -1.95 7.05 Term - 120551 120237 315 2 0 56 42 267 0.508 12.86 7.04 Intr - 135477 135389 89 1 2 42 84 59 0.543 -0.33 7.03 Intr - 140087 139973 115 2 1 88 91 119 0.966 11.30 7.02 Intr - 140554 140367 188 2 2 82 82 73 0.899 4.59 7.01 Init - 141023 140936 88 2 1 58 62 88 0.901 4.05 7.00 Prom - 145698 145659 40 -4.25 8.00 Prom + 146028 146067 40 -7.25 8.01 Init + 149854 150060 207 0 0 97 83 345 0.991 31.77 8.02 Intr + 165278 165503 226 1 1 73 109 172 0.626 14.44 8.03 Intr + 170736 170911 176 1 2 67 74 172 0.778 12.44 8.04 Intr + 171467 171649 183 1 0 64 105 61 0.764 4.46 8.05 Intr + 178202 178307 106 1 1 64 119 99 0.545 9.47 8.06 Term + 185572 185636 65 2 2 58 47 53 0.029 -4.73 8.07 PlyA + 186120 186125 6 1.05 9.14 PlyA - 187277 187272 6 1.05 9.13 Term - 189189 189079 111 0 0 78 42 113 0.624 3.28 9.12 Intr - 192201 191946 256 2 1 112 91 247 0.995 23.92 9.11 Intr - 196298 196096 203 2 2 74 63 49 0.880 -1.84 9.10 Intr - 197560 197351 210 1 0 49 95 102 0.890 5.19 9.09 Intr - 198443 198297 147 2 0 2 91 197 0.115 10.91 9.08 Intr - 203160 203060 101 1 2 72 115 20 0.460 2.01 9.07 Intr - 203896 203755 142 1 1 56 76 95 0.636 4.11 9.06 Intr - 206793 206718 76 2 1 84 53 129 0.928 7.60 9.05 Intr - 207488 207341 148 0 1 62 85 -1 0.614 -4.63 9.04 Intr - 209667 209561 107 2 2 94 93 61 0.778 6.14 9.03 Intr - 209964 209882 83 0 2 66 91 46 0.953 0.22 9.02 Intr - 212674 212556 119 2 2 83 95 60 0.877 5.46 9.01 Init - 215138 214901 238 2 1 64 78 77 0.188 2.52 9.00 Prom - 216711 216672 40 -3.25 10.00 Prom + 217056 217095 40 -3.95 10.01 Init + 219977 219982 6 1 0 93 93 0 0.148 1.91 10.02 Term + 223113 223241 129 2 0 28 47 147 0.148 1.80 10.03 PlyA + 223331 223336 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 107355 107240 116 1 2 89 49 120 0.844 5.95 S.002 Term + 150575 150664 90 1 0 54 41 115 0.860 0.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_1|139_aa MLMKDKGEREQELTQRAFNPRFWSDTGERRQGRTKTEARASDCSELCRSLSQANGKPQSK DGPLKSFTLDRSGPAVALCHAQSLTASSLGKRPECECQDRTPKVQHLLGNRQLTPFYNAR LLLKGDLSSISASLPQVDF >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_1|420_bp atgcttatgaaggacaaaggagaaagggagcaggagctaacacagagagccttcaacccg cgtttctggtctgacaccggtgaaaggagacagggaaggacaaagacagaagcaagagcc tcagactgcagtgagctgtgtagaagtcttagccaggctaatgggaagcctcagagtaaa gacggcccattgaaaagcttcacattggacagaagtggcccagctgtagcactttgtcat gcccagtcactgactgcgagcagtctggggaaaaggcctgagtgtgaatgccaagacaga actccaaaggtgcaacacctgctgggtaacaggcaacttactcccttctataatgcacgt cttctcttgaagggagatctgagcagcatatctgcatcattgccacaggttgatttttaa >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_2|318_aa MVDAPSPANLQCRRLISDCCASSEQGSVGVGPIKPGTGSFPPLSQEEKASIIPNSVILSH IDYKAEDEFISQEASQTLMTQPACTQVIRSFTAHTKPVWWSLHTDSHESFSWSLRLGSLT SHNTYDLLRLNYEEMENLNRPIMSKEIESVVKNLPTKKSPRLDDYPEYFLGNVKPSSAAG ASVSWCSHSGMQQLCLGSSTPARLPMSFPSNCHSPRRQALELHPGPLVETGCNVSKRVEQ WLLIYSAQSNPGKGSAQPSSDFSHMRDSKNCLTELTQPTEPREGRANDSFKPQSCGVVCY QQQITGKITLGQEAKLLG >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_2|957_bp atggtggacgccccttctcctgccaatctgcagtgtcgtaggttgatctcagactgctgt gctagcagtgagcaaggctccgtgggtgtgggacccatcaaaccaggcacgggaagcttt cctccactttcacaagaggaaaaagcatccattatccctaactctgttatactctcacac attgattacaaagcagaggatgagttcatttcccaagaggcttctcagaccctcatgact cagcccgcctgcacccaggtgattagaagctttactgctcacacaaagcctgtttggtgg tctcttcacacggactcgcatgaaagcttcagctggtccctccgtttggggtccctgact tcccacaacacatatgatctactaagactgaattatgaagaaatggaaaatctgaacaga ccaataatgagtaaagagattgaatcagtagtcaaaaacctcccaacaaagaaaagccca agattagatgattaccctgaatatttcctggggaatgtgaaaccttcatccgctgctggt gcgagtgtgagctggtgcagccattctgggatgcagcagctgtgcctagggtccagcacc cctgctaggcttccaatgagcttcccatccaactgccacagccctagacggcaggctctg gagctccacccaggacccttggtggaaacaggatgtaatgtttccaagcgggtcgagcag tggctgcttatctacagtgcccagtcaaaccctggaaaaggttcggcccagccatcatct gactttagccacatgagagactccaagaactgcctcactgagctcactcaacccacagaa ccacgagagggaagagcaaatgacagctttaagccacaaagttgtggggtggtttgttac caacaacagataacaggaaaaattactctgggacaagaagccaaacttctaggctag >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_3|108_aa MKTLEENLGNTIQDIGMGKDFMTKAPRAMATKAKIDKWDLIKLKSFCTAKETTIRVKRQP TEWEKIFAIYPSDKEISPISLQETAANAPSVSVVCLGLPGHTHFQDWA >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_3|327_bp atgaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaagac ttcatgactaaagcaccaagagcaatggcaacaaaagccaaaatagacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaagaggcagcct acagaatgggagaaaatttttgcaatctacccatctgacaaagagatatcacccatttcc cttcaggaaacagcagccaatgcaccttcagtttctgtggtttgcttagggctgcctggc catactcactttcaggactgggcatag >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_4|59_aa MEEKRERAKQYDARRKKENEEQRKIQSPYSGCKGLADLGPATSLAVSTPVLPLRFSSTC >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_4|180_bp atggaggagaaaagggagagagccaaacagtatgatgccagaagaaaaaaagaaaatgaa gaacagaggaaaatccaaagtccttacagtggctgcaagggcctggctgatctgggccct gctacctctctggctgtgtccactccagtcctccctctacgtttttcctcaacctgctag >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_5|158_aa MPRSLAHFPLWPLPRGRIRPLLVLACRYKSQRRIRPKPYEVARGPRRGPTNSVQQQDLSP STQTTLQTGLSVIIHTHTHSALRNFTIKEVLYRLPWLLLRWSVHRVVAMVDEDPLPQVSG QFLSVLLRVQIYSSHRVGLSPLPLRPPQEAAWRLMRED >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_5|477_bp atgccaaggtcccttgcacacttcccactctggccgcttccccgagggagaattaggccc ctcttagtgttggcatgccggtataaatcccaacgcaggatccgccctaagccatatgag gtagctaggggaccgcggagaggacccactaactccgtccagcagcaggacttgtcacca tccacacaaacaacactgcaaacagggttgtctgtgatcattcacacacatacacattca gccctccggaatttcaccatcaaggaagtactttatcgactcccgtggcttctccttcgt tggtctgtgcacagagtcgtcgccatggtagatgaggatcctttaccccaggttagtggc cagtttctttccgtgttgctgagagtccagatttattcatcacaccgggtgggtctcagc cccttacccctaaggccaccacaagaggcggcatggcgcctcatgagagaggactag >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_6|274_aa MAPWAEAEHSALNPLRAVWLTLTAAFLLTLLLQLLPPGLLPGCAIFQDLIRYGKTKCGEP SRPAACRAFDVPKRATPYFCSNFSISGITQSVIQGIHLVYRQSIIDCTKSGLAFMEKLIK VGINEDVKGCSLHETAEGVHDGEMIFFPLLYHLSAVEWLPALVPYSISVPGSTFSKLASW FAQNSRGGTVPGLRRLFECLYVSVFSNVMIHVVQYCFGLVYYVLVGLTVLSQVPMDGRNG MSVVAFCQPVPWKGLSFLVGSAGSPPKGTLQSAW >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_6|825_bp atggctccctgggcggaggccgagcactcggcgctgaacccgctgcgcgcggtgtggctc acgctgaccgccgccttcctgctgaccctactgctgcagctcctgccgcccggcctgctc ccgggctgcgcgatcttccaggacctgatccgctatgggaaaaccaagtgtggggagccg tcgcgccccgccgcctgccgagcctttgatgtccccaagagagccactccatacttctgc tccaatttctcaatttcaggcattacacaatcagttatacaaggaatacacttggtatac agacagtccatcattgactgcactaagtctggtttggcttttatggaaaagttgataaag gttggtatcaatgaggatgtgaagggctgtagccttcacgaaactgccgaaggggtccat gatggagaaatgatatttttcccacttttatatcatctcagtgctgtggaatggcttcct gctttggtgccttactcaatctctgttcctgggagcaccttttccaagctggcttcatgg tttgctcagaattctcggggcggcacagttccaggcttacgaagactcttcgagtgcctc tacgtcagtgtcttctccaatgtcatgattcacgtcgtgcagtactgttttggacttgtc tattatgtccttgttggcctaactgtgctgagccaagtgccaatggatggcaggaatggc atgtctgtggtggcattctgccagccagtaccatggaaggggctgtcattcctggtgggc agtgctggttctccccctaaaggaactttgcagagcgcttggtga >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_7|264_aa MKLQRKFNKQKKKALHSGEGTWRRVAAFTVSSLVDYTSVEMFLLKMMQGERQSYKVGIHN FSILPHFPIKMKEKKLVRSPSLSEAGIHYPAQGVTFIFMAQCDCKSSSIQVPSRRMEDWL EKQGDTGYASEPFPVFGAAAIAVLTVTPAARHHHHCNWAQEEGKNTDLLRTCGANHWLQK SKQGLPSAQQEAVSLVGGLTEETNVSTDNGQEPEALEPRGNPKKLIQVREPRIMQFHMAT SKQESVNDSKGLMKIHRSTCVSLF >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_7|795_bp atgaagttgcagcgaaagtttaataagcaaaagaagaaagctctccacagtggagagggg acctggaggagggttgctgcttttacagtatcttcccttgtggactacaccagtgtcgaa atgttcttgctaaaaatgatgcagggagaaagacaaagttacaaagttggcatccacaat ttttccatcttgccccacttcccaattaaaatgaaggaaaaaaaattggtgaggtctcca tccctgtctgaagcaggcatccattatcctgcccagggggtgacctttatcttcatggcc cagtgtgactgcaagagctctagcatccaagttcccagcagaaggatggaggactggctg gaaaaacaaggagacacagggtatgcatcagagccctttcctgtctttggagccgcagcc attgctgtcctgactgttactcctgctgcccgccaccatcaccactgcaactgggcccag gaagaagggaaaaacacagaccttctgaggacctgcggagcaaaccactggcttcagaaa agcaaacagggacttccctctgcgcagcaggaagcagttagcctagtcggaggcttgact gaagaaacaaatgtatccactgataatgggcaagagcctgaggctctggagccaaggggg aaccctaaaaagctgattcaagtaagagaaccaaggatcatgcaattccacatggccacc tcgaagcaggaatccgttaatgatagtaaaggactgatgaaaatccaccggtctacctgt gtcagtttattttaa >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_8|320_aa MAAAAPGNGRASAPRLLLLFLVPLLWAPAAVRAGPDEDLSHRNKEPPAPAQQLQPQPVAV QGPEPARVEKIFTPAAPVHTNKEDPATQTNLGFIHAFVAAISVIIVSELGDKTFFIAAIM AMRYNRLTVLAGAMLALGLMTCLSVLFGYATTVIPRVYTYYVSTVLFAIFGIRMLREGLK MSPDEGQEELEEVQAELKKKDEEFQRTKLLNGPGDVETGTSITVPQKKWLHFISPIFVQA LTLTFLAEWGDRSQLTTIVLAAREDPYGVAVGGTVGHCLCTGLAVIGGRMIAQKISVRTV LLMELDCFSAPSVKQTALRQ >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_8|963_bp atggcggccgcggctccagggaacggccgcgcatcggcgccccggctgcttctgctcttt ctggttccgctgctgtgggccccggctgcggtccgggccggcccagatgaagaccttagc caccggaacaaagaaccgccggcgccggcccagcagctgcagccgcagcctgtggctgtg cagggccccgagccggcccgggtcgagaaaatatttacaccagcagctccagttcatacc aataaagaagatcctgctacccaaactaatttgggatttatccatgcatttgtcgctgcc atatcagttattattgtatctgaattgggtgataagacattttttatagcagccatcatg gcaatgcgctataaccgcctgaccgtgctggctggtgcaatgcttgccttgggactaatg acatgcttgtcagttttgtttggctatgccaccacagtcatccccagggtctatacatac tatgtttcaactgtattatttgccatttttggcattagaatgcttcgggaaggcttaaag atgagccctgatgagggtcaagaggaactggaagaagttcaagctgaattaaagaagaaa gatgaagaatttcaacgaaccaaacttttaaatggaccgggagatgttgaaacgggtaca agcataacagtacctcagaaaaagtggttgcattttatttcacccatttttgttcaagct cttacattaacattcttagcagaatggggtgatcgctctcaactaactacaattgtattg gcagctagagaggacccctatggtgtagccgtgggtggaactgtggggcactgcctgtgc acgggattggcagtaattggaggaagaatgatagcacagaaaatctctgtcagaactgtg ctcttgatggagctggactgcttcagtgctcctagcgtcaagcagactgctttaaggcaa tga >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_9|646_aa MTTFPVTAVLPHGGPGLHCNESIFHQHLWPRKSSSTLLIKVAATFVIWFGRLKKITQVEN FEKDQTGRQKGKPYNPSNKISSSAHNGFEGTIQRTHRPSYEDRVCFVATVRLATPQFIKE MCTVEEPNEEFTSRHSLEWKFLFLDHRAPPIIGYLPFEVLGTSGYDYYHVDDLENLAKCH EHLMQYGKGKSCYYRFLTKGQQWIWLQTHYYITYHQWNSRPEFIVCTHTVVSYAEVRAER RRELGIEESLPETAADKSQDSGSDNRINTVSLKEALERFDHSPTPSASSRSSRKSSHTAV SDPSSTPTKIPTDTSTPPRQHLPAHEKMVQRRSSFSSQFSAQLGAMQHLKDQLEQRTRMI EANIHRQQEELRKIQEQLQMVHGQGLQMFLQQSNPGLNFGSVQLSSGNSSNIQQLAPINM QGQVVPTNQIQSGMNTGHIGTTQHMIQQQTLQSTSTQSQQNVLSGHSQQTSLPSQTQSTL TAPLYNTMVISQPAAGSMVQIPSSMPQNSTQSAAVTTFTQDRQIRFSQGQQLVTKLVTAP VACGAVMVPSTMLMGQVVTAYPTFATQQQQSQTLSVTQQQQQQSSQEQQLTSVQQPSQAQ LTQPPQQFLQSTFPQSHHQQHQSQQQQQLSRHRTDSLPDPSKVQPQ >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_9|1941_bp atgactacttttcctgtaacagctgttttgcctcatggaggtccaggtcttcattgcaat gaaagcatctttcaccagcatctctggcctagaaagagcagcagtaccttgcttataaaa gttgctgccactttcgtcatctggtttgggagattaaaaaaaatcactcaggtagaaaat tttgaaaaggaccagactgggagacaaaaaggaaaaccatataatcctagcaataaaata tcctcttcagcacacaatggttttgaaggaactatacaacgcacacataggccatcttat gaagatagagtttgttttgtagctactgtcaggttagctacacctcagttcatcaaggaa atgtgcactgttgaagaacccaatgaagagtttacatctagacatagtttagaatggaag tttctgtttctagatcacagggcaccacccataatagggtatttgccatttgaagttctg ggaacatcaggctatgattactatcatgtggatgacctagaaaatttggcaaaatgtcat gagcacttaatgcaatatgggaaaggcaaatcatgttattataggttcctgactaagggg caacagtggatttggcttcagactcattattatatcacttaccatcagtggaattcaagg ccagagtttattgtttgtactcacactgtagtaagttatgcagaagttagggctgaaaga cgacgagaacttggcattgaagagtctcttcctgagacagctgctgacaaaagccaagat tctgggtcagataatcgtataaacacagtcagtctcaaggaagcattggaaaggtttgat cacagcccaaccccttctgcctcttctcggagttcaagaaaatcatctcacacggccgtc tcagacccttcctcaacaccaaccaagatcccgacggatacgagcactccacccaggcag catttaccagctcatgagaagatggtgcaaagaaggtcatcatttagtagtcagttttca gctcaattaggagccatgcaacatctgaaagaccaattggaacaacggacacgcatgata gaagcaaatattcatcggcaacaagaagaactaagaaaaattcaagaacaacttcagatg gtccatggtcaggggctgcagatgtttttgcaacaatcaaatcctgggttgaattttggt tccgttcaactttcttctggaaattcatctaatatccagcaacttgcacctataaatatg caaggccaagttgttcctactaaccagattcaaagtggaatgaatactggacacattggc acaactcagcacatgatacaacaacagactttacagagtacatcaactcagagtcaacaa aatgtactgagtgggcacagtcagcaaacatctctacccagtcagacacagagcactctt acagccccactgtataacactatggtgatttctcagcctgcagccggaagcatggtccag attccatctagtatgccacaaaacagcacccagagtgctgcagtaactacattcactcag gacaggcagataagattttctcaaggtcaacaacttgtgaccaaattagtgactgctcct gtagcttgtggggcagtcatggtacctagtactatgcttatgggccaggtggtgactgca tatcctacttttgctacacaacagcaacagtcacagacattgtcagtaacgcagcagcag cagcagcagagctcccaggagcagcagctcacttcagttcagcaaccatctcaggctcag ctgacccagccaccgcaacaatttttacagagcaccttccctcagtcacatcaccagcaa catcagtctcagcaacagcagcaactcagccggcacaggactgacagcttgcccgaccct tccaaggttcaaccacagtag >gi568815594f:55246337_55470088|GENSCAN_predicted_peptide_10|44_aa MTEVFQKGTVITGDDSMPIVSPEDPPMGQDVEVEDSDVDDPDPA >gi568815594f:55246337_55470088|GENSCAN_predicted_CDS_10|135_bp atgactgaggtatttcaaaagggcactgttattacaggagatgactccatgcctattgtt tctcctgaagaccctccaatgggacaagatgtggaggtggaggacagtgatgttgatgat cctgaccccgcgtag