GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:55:20 Sequence gi568815594f:153170332_153434963 : 264632 bp : 41.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 402 520 119 2 2 30 56 123 0.190 1.64 1.02 Intr + 11236 11413 178 1 1 71 64 103 0.921 5.20 1.03 Intr + 11698 11865 168 0 0 87 78 112 0.943 9.32 1.04 Term + 16164 16283 120 2 0 66 44 131 0.750 3.99 1.05 PlyA + 18872 18877 6 1.05 2.00 Prom + 32473 32512 40 -4.45 2.01 Init + 35441 35678 238 1 1 106 75 143 0.777 13.02 2.02 Term + 42836 42945 110 0 2 133 44 107 0.826 8.49 2.03 PlyA + 44173 44178 6 1.05 3.06 PlyA - 44503 44498 6 1.05 3.05 Term - 52832 52491 342 2 0 64 49 216 0.052 8.83 3.04 Intr - 74103 73769 335 1 2 8 31 1224 0.780 103.07 3.03 Intr - 76600 76483 118 1 1 49 94 97 0.430 5.52 3.02 Intr - 83751 83638 114 0 0 106 32 49 0.020 0.82 3.01 Init - 90478 90389 90 1 0 58 3 193 0.209 8.34 3.00 Prom - 90705 90666 40 -6.95 4.00 Prom + 96825 96864 40 -4.65 4.01 Init + 100055 100188 134 1 2 64 99 93 0.932 7.66 4.02 Intr + 105562 105799 238 1 1 93 101 233 0.931 21.79 4.03 Intr + 122651 122802 152 1 2 93 81 178 0.774 15.64 4.04 Intr + 123974 124154 181 2 1 69 69 196 0.977 14.75 4.05 Term + 124982 125737 756 1 0 114 49 1086 0.973 99.74 4.06 PlyA + 136004 136009 6 1.05 5.04 PlyA - 136886 136881 6 1.05 5.03 Term - 138076 137378 699 1 0 13 42 736 0.482 54.05 5.02 Intr - 138447 138111 337 2 1 16 -1 305 0.242 9.00 5.01 Init - 139261 139206 56 1 2 72 58 48 0.496 1.01 5.00 Prom - 141281 141242 40 -8.25 6.00 Prom + 142404 142443 40 -3.65 6.01 Init + 143493 143541 49 2 1 86 91 36 0.990 2.86 6.02 Intr + 145154 145257 104 0 2 88 80 35 0.995 1.67 6.03 Intr + 145501 145668 168 0 0 53 116 191 0.993 17.62 6.04 Intr + 152317 152485 169 0 1 62 86 195 0.692 15.30 6.05 Intr + 153747 153817 71 1 2 100 74 4 0.708 -1.82 6.06 Intr + 158199 158339 141 2 0 40 76 164 0.812 10.03 6.07 Term + 164483 164635 153 1 0 119 39 136 0.999 8.74 6.08 PlyA + 166072 166077 6 1.05 7.03 PlyA - 166347 166342 6 1.05 7.02 Term - 173838 173765 74 1 2 107 54 39 0.695 -0.51 7.01 Init - 174408 174285 124 0 1 88 59 181 0.897 13.58 7.00 Prom - 182647 182608 40 -4.55 8.00 Prom + 183081 183120 40 -8.35 8.01 Init + 188148 188291 144 2 0 40 115 148 0.859 12.87 8.02 Term + 192104 192211 108 1 0 70 46 65 0.429 -1.97 8.03 PlyA + 193747 193752 6 1.05 9.04 PlyA - 194388 194383 6 1.05 9.03 Term - 195838 195641 198 1 0 -21 54 212 0.018 3.32 9.02 Intr - 218548 218437 112 0 1 80 80 92 0.040 7.06 9.01 Init - 242139 242036 104 0 2 77 81 96 0.239 7.56 9.00 Prom - 252312 252273 40 -4.05 10.04 PlyA - 252334 252329 6 1.05 10.03 Term - 260258 260076 183 2 0 76 43 117 0.652 2.56 10.02 Intr - 262931 262790 142 1 1 72 85 21 0.488 -0.37 10.01 Intr - 264448 264333 116 1 2 48 67 109 0.289 3.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 10976 11095 120 2 0 105 23 127 0.962 6.59 S.002 Sngl - 195796 195641 156 1 0 63 54 182 0.913 6.45 S.003 Intr + 223931 224005 75 1 0 49 127 96 0.803 8.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_1|194_aa GAQPGGKNSCPFAHQALHPVEGYVSPEKGTPFSNHTKGYWLKTALAFNLPSCARDEMYVW GIGLDTLRAQEVGCGSDESPGPFLSRVDSSQQPCTPGLEGTRCMRLPTHLSLPSAQAAHF TDEETEALREIRCGRGEVRKKLFASESEPEMKPEGELYKELKGPRSSALSLRKEFLLALK GQPRAESSPRVTQD >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_1|585_bp ggagctcagccagggggtaaaaacagctgtccctttgctcatcaagcccttcaccctgtt gagggctatgtcagccctgagaaagggacacctttttctaatcacacaaagggctactgg ctgaagacagctctggcgttcaaccttccctcatgcgccagggatgagatgtatgtatgg ggcatagggctagacacgctcagagcacaggaggtgggatgtggcagcgatgagtcccca gggcccttcttgtctcgtgtggactcatcccagcagccttgtacacctgggctggaggga acaagatgcatgaggctccccacccatctcagtctaccgagtgcccaagcagcccatttt acagatgaggaaactgaggccctcagagagataagatgtggtagaggggaggtgaggaag aagctctttgcttcagaatcagaaccagaaatgaagccagaaggggagctgtacaaggag ctgaaagggcctcgttctagtgctttgtctctaaggaaggaatttttgttggcgctcaag ggacagcccagagcagaatcttctcctcgagttacacaggactaa >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_2|115_aa MLFADPGGIKSEYKGCFSNEFDAKQHECLNPLYENRQSLSEARFSWCGPRALERDESPHE EVLTRNGNSDSASELLTLPGEETEAQIVEKIVPSRTAGQGQASIQFPEPVFLTIM >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_2|348_bp atgctgtttgcggatcctggaggtattaagtcagaatacaaagggtgtttttcaaatgaa tttgacgccaagcaacatgaatgcttgaacccattgtatgaaaacaggcagtctctaagt gaggccaggttctcctggtgcggtcctagagccctggagagggatgagtcaccccacgag gaggtcctgacaaggaatggaaattctgactctgccagtgaacttttgactctcccaggt gaggaaactgaggctcagattgttgagaaaattgtcccaagccgcacagctggtcagggg caggccagcattcagtttccagagcctgtgtttttaaccattatgtga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_3|332_aa MGHFVNQKARCIIFPDDDDDDHDDDDDDVCSHSVHKTFCYWKAVPIQTPREGSWISCKKE FRASPQCKVGIASGPVPGSHKATRGRGEEQGRKNRSQAAADGMCARSEEEEEEEEEEEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEERRGRRREEGGEE EGEGKGGEEEGGEEEEEEEEEQEEQEEEEEEDGSGTVAHAGCQPRSARPEMDSGQASRTL RPPDLARAGGFPGHSPSHDLLRDGDSLRGNSARSGVAWASPLGPGPRCAPSQGRGALAHD IRQRAGDPGLRTRSSPPPPSSPAAALLQPQAI >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_3|999_bp atgggccattttgtaaatcagaaggctagatgcattatctttcctgatgatgatgatgat gatcatgatgatgatgatgatgatgtgtgttcacactctgttcataagacattctgttac tggaaagcggtcccgatccagaccccaagagagggttcttggatctcctgcaagaaggaa ttcagggcgagtccacagtgcaaagttggcattgcatctggaccagtgccaggaagccac aaggccaccagaggcagaggggaagagcaagggcggaagaatagaagtcaagcagcagca gatggaatgtgtgcacggtcagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaggaagaagaagaagaagaa gaagaagaagaagaagaggaggaggaggaggaggaggaggaggaggaggaggaggaggag gaggaggaggaggaggaggaggaaagaagaggaaggagaagagaagaaggaggagaagag gaaggagaaggaaaaggaggagaagaagaaggaggagaagaagaagaagaagaagaagaa gaacaagaagaacaagaagaagaagaagaagaagatggcagtggcacagtagctcatgcc ggctgccagccccgcagcgcccgtccggaaatggactctgggcaggcttcccgcacgctc cggccaccggaccttgcccgggcgggagggttcccagggcacagcccgagccacgactta ctacgagacggcgactcgcttcgcgggaacagcgcgcgctccggagtggcctgggcctca cccctcgggcctgggcctcgctgcgcgcccagccagggccgaggtgccctagctcacgac atcaggcagcgggcaggcgacccaggacttcgcacccgctcctctccgcctcctccttcc tcccccgcagccgcccttctgcagcctcaggcgatctga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_4|486_aa MASEGTNIPSPVVRQIDKQFLICSICLERYKNPKVLPCLHTFCERCLQNYIPAHSLTLSC PVCRQTSILPEKGVAALQNNFFITNLMDVLQRTPGSNAEESSILETVTAVAAGKPLSCPN HDGNVMEFYCQSCETAMCRECTEGEHAEHPTVPLKDVVEQHKASLQVQLDAVNKRLPEID SALQFISEIIHQLTNQKASIVDDIHSTFDELQKTLNVRKSVLLMELEVNYGLKHKVLQSQ LDTLLQGQESIKSCSNFTAQALNHGTETEVLLVKKQMSEKLNELADQDFPLHPRENDQLD FIVETEGLKKSIHNLGTILTTNAVASETVATGEGLRQTIIGQPMSVTITTKDKDGELCKT GNAYLTAELSTPDGSVADGEILDNKNGTYEFLYTVQKEGDFTLSLRLYDQHIRGSPFKLK VIRSADVSPTTEGVKRRVKSPGSGHVKQKAVKRPASMYSTGKRKENPIEDDLIFRVGKER ASVPDA >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_4|1461_bp atggccagtgaaggcaccaacatcccaagtcctgtggtgcgccagattgacaagcagttt ctgatttgcagtatatgcctggaacggtacaagaatcccaaggttctcccctgtctgcac actttctgcgagaggtgcctgcagaactacattcctgcccacagtttaaccctctcctgc ccagtgtgccgccagacctccatcctgcccgagaaaggggtggccgcgctccagaacaat ttcttcatcacaaacctgatggacgtgctgcagcgaactccaggcagcaacgctgaggag tcttccatcctggagacagtcactgctgtggctgcgggaaagcctctctcttgcccaaac cacgatgggaatgtgatggaattttactgccagtcctgtgagactgccatgtgtcgggag tgcacggagggggagcacgcagagcaccccacagttccactcaaggatgtggtggaacag cacaaggcctcgctccaggtccagctggatgctgtcaacaaaaggctcccagaaatagat tctgctcttcagttcatctctgaaatcattcatcagttaaccaaccaaaaggccagcatc gtggatgacattcattccacctttgatgagctccagaagactttaaatgtgcgcaagagt gtgctgcttatggaattggaggtcaactatggcctcaaacacaaagtcctccagtcgcag ctggatactctgctccaggggcaggagagcattaagagctgcagcaacttcacagcgcag gccctcaaccatggcacggagaccgaggtcctactggtgaagaagcagatgagcgagaag ctgaacgagctggccgaccaggacttccccttgcacccgcgggagaacgaccagctggat ttcatcgtggaaaccgaggggctgaagaagtccatccacaacctcgggacgatcttaacc accaacgccgttgcctcagagacagtggccacgggcgaggggctgcggcagaccatcatc gggcagcccatgtccgtcaccatcaccaccaaggacaaagacggtgagctgtgcaaaacc ggcaacgcctacctcaccgccgaactgagcacccccgacgggagcgtggcagacggggag atcctggacaacaagaacggcacctatgagtttttgtacactgtccagaaggaaggggac tttaccctgtctctgagactctatgaccagcacatccgaggcagcccgtttaagctgaaa gtgatccgatccgctgatgtgtctcccaccacagaaggcgtgaagaggcgcgttaagtcc ccggggagcggccacgtcaagcagaaagctgtgaaaagacccgcaagcatgtacagcact ggaaaacgaaaagagaatcccatcgaagacgatttgatctttcgagtgggtaaggagagg gcttctgtgcccgacgcctga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_5|363_aa MQQDETEPSVRSNTGSSINPASFKISAVHEILCKLSLEGEHSTLPSAFGSVKAYTKFDAE QDALNIEMAIKTKGVDEITTVNILTNHSNAQRQDIAFTYQRRTKKELVSALKSALSGHLD SGFGPIKDTCSGLGTNKDSLIEIICSRTNQELQEINRVYKEMYKTDLEKDIISDTSGDFC KLMVALAKGRRAEDGSVIDYELIDQDARDLYDAGVKRKGTEVPKWISVMTERSMSHFQKV FDRYKSYSPYDMLESIKKEVKGDLENSFLNLVQCIQNKPLYFADRLYDSIMGMGTQDKVL IRIMVSHNEVDMLKIRSEFKRKYSKSLYYYIQQDTKGAVPVWWRWLKSDTARASRNGAPH ASS >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_5|1092_bp atgcaacaagatgaaactgagccaagtgtaagatctaacacaggatcttcaatcaaccca gcttccttcaaaatatctgctgttcatgaaatcctgtgcaagctcagcttggagggtgaa cactctacactcccaagtgcatttgggtcagtcaaagcctacaccaaatttgatgctgag caggatgctttgaacattgaaatggccatcaagaccaaaggtgtggatgagatcaccact gtcaacattttgactaaccacagcaatgcacagagacaggatattgccttcacctaccag agaaggaccaaaaaggaacttgtatcagcacttaagtcagccttatctggccacctagat agtggctttgggcctattaaagacacctgctcagggctgggaaccaacaaggactccctc attgagatcatctgctcaagaaccaaccaagagctgcaggaaattaacagagtctacaag gaaatgtacaagactgatctggagaaggacattatttcggacacatctggtgacttctgc aagctgatggttgccctggcaaagggtagaagagcagaggatggctctgtcattgattat gaactgattgaccaagatgcccgggatctctatgacgctggggtgaagaggaaaggaact gaagttcccaagtggatcagcgtcatgaccgagcggagcatgtcccacttccagaaagta tttgataggtacaagagctacagtccttatgacatgttggagagcatcaagaaagaggtt aaaggagacctggaaaattctttcctgaacctggtccagtgtattcagaacaagcccctg tatttcgctgaccggctgtacgactccataatgggcatggggactcaagataaggtcctg atcagaatcatggtctcccacaatgaagtggacatgttgaaaattaggtctgaattcaag agaaagtatagcaagtccctgtactattacatccagcaagacactaagggtgctgtacct gtgtggtggagatggctgaagtccgacacagcacgagcgtccagaaatggtgctccccat gcttccagctaa >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_6|284_aa MGFHHVGQAGLELLTSGTKGRNKGEFTNLQGVAASTNGKILIADSNNQCVQIFSNDGQFK SRFGIRGRSPGQLQRPTGVAVHPSGDIIIADYDNKWVSIFSSDGKFKTKIGSGKLMGPKG VSVDRNGHIIVVDNKACCVFIFQPNGKIVTRFGSRGNGDRQFAGPHFAAVNSNNEIIITD FHNHSVKVFNQEGEFMLKFGSNGEGNGQFNAPTGVAVDSNGNIIVADWGNSRIQVFDGSG SFLSYINTSADPLYGPQGLALTSDGHVVVADSGNHCFKVYRYLQ >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_6|855_bp atggggtttcaccatgttggccaggctggtctcgaactcctgacctcaggtaccaaagga agaaataaaggagagtttacaaatcttcagggggtagctgcatctacaaatggaaagata ttaattgcagacagtaacaaccaatgtgtgcagatattttccaatgatggccagttcaaa agtcgttttggcatacggggacgctctccggggcagctgcagcggcccacaggagtggct gtacatcccagtggggacataatcattgccgattatgataataaatgggtcagcattttc tcctccgatgggaaatttaagacaaaaattggatcaggaaagctgatgggacccaaagga gtttctgtggaccgcaatgggcacattattgttgtggacaacaaggcgtgctgcgtgttt atcttccagccaaacgggaaaatagtcaccaggtttggtagccgaggaaatggggacagg cagtttgcaggtccccattttgcagctgtaaatagcaataatgagattattattacagat ttccataatcattctgtcaaggtgtttaatcaggaaggagaattcatgttgaagtttggc tcaaatggagaaggaaatgggcagtttaatgctccaacaggtgtagcagtggattcaaat ggaaacatcattgtggccgactggggaaacagcaggatccaggtttttgatgggagtgga tcatttttgtcctacattaacacatctgctgacccactctatggcccccaaggcctggcc ctaacttcagatggtcatgttgtggttgcagactctggaaatcactgtttcaaagtctat cgatacttacagtaa >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_7|65_aa MARAQGLPLAGPALGERGGTGQDAFDFGADYAAGAEQRDGGGFKKEGNNGMNFKSTWSKI RYFMA >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_7|198_bp atggcgcgggcgcaggggcttccgctggccgggcccgcgcttggggagaggggcgggaca ggccaggacgcgtttgattttggcgccgactacgccgcgggggcggagcagcgagacggt ggaggttttaaaaaagaaggaaacaacggaatgaatttcaaatcaacatggtcaaaaata cgctacttcatggcttga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_8|83_aa MSVKEVLQSLVDDGMVDCERIGTSNYYWAFPSKALHARKHKLEVLESQSCFCHVKCLLAL SKGSLRPPQKQKPPCFQVEPAEA >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_8|252_bp atgtcagtaaaagaagtccttcaaagcttagttgatgatggtatggttgactgtgagagg atcggaacttctaattattattgggcttttccaagtaaagctcttcatgcaaggaaacat aagttggaggttctggaatctcagtcctgcttctgccatgtaaaatgcctgctcgccctg agtaaaggatccctgaggcctccccagaagcagaagccaccatgcttccaagtagagcct gcagaagcatga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_9|137_aa MERHRKNCERTQGEGRPGETKPANILILKFDPPGCEILLCPSLPFNCQYILVLLGSRIRA REPPMAVQAVTQQSRKQCDQEAEIRVMQQQVKECWQPTEIERGKEDYPREPLEEQQLCQH FDFGLVTLIFNFWPPEL >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_9|414_bp atggagagacacaggaaaaactgtgagaggacacagggagaaggaagacctggagaaacc aaacctgccaacatcttgatcttgaaatttgatcctccaggatgtgaaattcttctctgt ccatccttacccttcaactgtcagtatattcttgttcttcttggaagcagaataagagct cgggaaccgccaatggcagtacaagctgtaacacagcagagcagaaagcaatgtgaccaa gaggcagaaattagagtgatgcagcaacaagtcaaagaatgctggcagcccacagaaatt gaaagaggcaaggaagattatcccagagagcctctggaggagcaacagctgtgccaacac tttgattttggcctagtaacactgatttttaacttctggcctccagagctgtga >gi568815594f:153170332_153434963|GENSCAN_predicted_peptide_10|146_aa CRRIQPKKGNSDTCYIMGEHYALLNNHKGQILYDSNHSRVLWRSLATHRGISQRRLCLPL FLWSYGSPFHLNGGFSKSKNEIELYCYPLKIFITVLIDMHYQFDFCFCTKTDYLSDSDLA IPLCEWYKVGISKVTGEALVFQESLL >gi568815594f:153170332_153434963|GENSCAN_predicted_CDS_10|441_bp tgtaggcgcattcagcctaaaaaaggaaattctgacacctgctacatcatgggtgaacat tatgcattattgaataatcacaagggccaaattctgtatgattccaatcattcgagagtt ctctggcgctctctggctactcacagaggaatttcacagaggaggttgtgcctcccgctg tttctctggagctacgggtccccatttcacctcaacggtggcttttcgaaaagtaaaaat gaaatagagctttattgctatcctctgaaaatctttattacagtgcttattgacatgcac taccaattcgatttctgtttctgcaccaagaccgattatctttcggactcagaccttgcc attcctctttgtgaatggtataaagtaggcatttcaaaagtgactggagaagctctggtg ttccaagaatctcttttgtag