GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:28:56 Sequence gi568815594f:153250063_153514854 : 264792 bp : 41.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 853 848 6 1.05 1.03 Term - 4020 3903 118 2 1 106 42 86 0.139 2.83 1.02 Intr - 7316 7106 211 0 1 13 92 95 0.022 -0.55 1.01 Init - 10747 10660 88 1 1 58 3 196 0.405 8.95 1.00 Prom - 10974 10935 40 -6.95 2.00 Prom + 17094 17133 40 -4.65 2.01 Init + 20324 20457 134 1 2 64 99 93 0.932 7.66 2.02 Intr + 25831 26068 238 1 1 93 101 233 0.931 21.79 2.03 Intr + 42920 43071 152 1 2 93 81 178 0.774 15.64 2.04 Intr + 44243 44423 181 2 1 69 69 196 0.977 14.75 2.05 Term + 45251 46006 756 1 0 114 49 1086 0.973 99.74 2.06 PlyA + 56273 56278 6 1.05 3.04 PlyA - 57155 57150 6 1.05 3.03 Term - 58345 57647 699 1 0 13 42 736 0.482 54.05 3.02 Intr - 58716 58380 337 2 1 16 -1 305 0.242 9.00 3.01 Init - 59530 59475 56 1 2 72 58 48 0.496 1.01 3.00 Prom - 61550 61511 40 -8.25 4.00 Prom + 62673 62712 40 -3.65 4.01 Init + 63762 63810 49 2 1 86 91 36 0.990 2.86 4.02 Intr + 65423 65526 104 0 2 88 80 35 0.995 1.67 4.03 Intr + 65770 65937 168 0 0 53 116 191 0.993 17.62 4.04 Intr + 72586 72754 169 0 1 62 86 195 0.692 15.30 4.05 Intr + 74016 74086 71 1 2 100 74 4 0.708 -1.82 4.06 Intr + 78468 78608 141 2 0 40 76 164 0.812 10.03 4.07 Term + 84752 84904 153 1 0 119 39 136 0.999 8.74 4.08 PlyA + 86341 86346 6 1.05 5.03 PlyA - 86616 86611 6 1.05 5.02 Term - 94107 94034 74 1 2 107 54 39 0.695 -0.51 5.01 Init - 94677 94554 124 0 1 88 59 181 0.897 13.58 5.00 Prom - 102916 102877 40 -4.55 6.00 Prom + 103350 103389 40 -8.35 6.01 Init + 108417 108560 144 2 0 40 115 148 0.859 12.87 6.02 Term + 112373 112480 108 1 0 70 46 65 0.429 -1.97 6.03 PlyA + 114016 114021 6 1.05 7.04 PlyA - 114657 114652 6 1.05 7.03 Term - 116107 115910 198 1 0 -21 54 212 0.018 3.32 7.02 Intr - 138817 138706 112 0 1 80 80 92 0.040 7.06 7.01 Init - 162408 162305 104 0 2 77 81 96 0.240 7.56 7.00 Prom - 172581 172542 40 -4.05 8.03 PlyA - 172603 172598 6 1.05 8.02 Term - 180502 180345 158 2 2 75 43 135 0.283 4.81 8.01 Init - 197119 196936 184 1 1 80 14 188 0.509 9.93 8.00 Prom - 199461 199422 40 -0.75 9.02 PlyA - 200869 200864 6 1.05 9.01 Sngl - 202507 202088 420 1 0 67 49 319 0.610 21.95 9.00 Prom - 205596 205557 40 -5.05 10.00 Prom + 210900 210939 40 -5.65 10.01 Init + 210954 211577 624 2 0 71 60 228 0.256 13.69 10.02 Intr + 215043 215156 114 2 0 66 92 49 0.022 2.82 10.03 Term + 216521 216619 99 1 0 120 47 46 0.212 0.85 10.04 PlyA + 217340 217345 6 1.05 11.05 PlyA - 217794 217789 6 1.05 11.04 Term - 221373 221268 106 2 1 75 48 107 0.298 2.30 11.03 Intr - 223390 223104 287 1 2 93 45 221 0.351 13.52 11.02 Intr - 224107 224055 53 2 2 25 97 56 0.657 -2.09 11.01 Init - 230931 230727 205 0 1 45 86 218 0.396 16.36 11.00 Prom - 232433 232394 40 -3.65 12.03 PlyA - 232534 232529 6 1.05 12.02 Term - 238884 238714 171 0 0 64 48 230 0.635 13.44 12.01 Init - 257495 257427 69 2 0 75 87 46 0.041 4.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 116065 115910 156 1 0 63 54 182 0.913 6.45 S.002 Intr + 144200 144274 75 1 0 49 127 96 0.803 8.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_1|138_aa MGHFVNQKARCIIFPDDDDDDHDDDDDDVYCSKPGKGLWDRDAAASCNEIRGWLAFPLSQ QSAVSMLSCWAAELHHQKVTAGFTRGEATSAVSVQRSRGSHTLFIRHSVTGKRSRSRPQE RVLGSPARRNSGRVHSAK >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_1|417_bp atgggccattttgtaaatcagaaggctagatgcattatctttcctgatgatgatgatgat gatcatgatgatgatgatgatgatgtgtactgcagcaaaccaggcaagggattatgggat agggatgcagcagcgagctgcaatgagattcgtgggtggcttgctttccctctgagtcag cagtcagctgtttccatgctctcgtgctgggccgctgagctccatcaccagaaagtgact gcaggattcaccaggggagaggcgaccagtgcagtctccgtgcaacgcagccggggcagt cacactctgttcataagacattctgttactggaaagcggtcccgatccagaccccaagag agggttcttggatctcctgcaagaaggaattcagggcgagtccacagtgcaaagtaa >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_2|486_aa MASEGTNIPSPVVRQIDKQFLICSICLERYKNPKVLPCLHTFCERCLQNYIPAHSLTLSC PVCRQTSILPEKGVAALQNNFFITNLMDVLQRTPGSNAEESSILETVTAVAAGKPLSCPN HDGNVMEFYCQSCETAMCRECTEGEHAEHPTVPLKDVVEQHKASLQVQLDAVNKRLPEID SALQFISEIIHQLTNQKASIVDDIHSTFDELQKTLNVRKSVLLMELEVNYGLKHKVLQSQ LDTLLQGQESIKSCSNFTAQALNHGTETEVLLVKKQMSEKLNELADQDFPLHPRENDQLD FIVETEGLKKSIHNLGTILTTNAVASETVATGEGLRQTIIGQPMSVTITTKDKDGELCKT GNAYLTAELSTPDGSVADGEILDNKNGTYEFLYTVQKEGDFTLSLRLYDQHIRGSPFKLK VIRSADVSPTTEGVKRRVKSPGSGHVKQKAVKRPASMYSTGKRKENPIEDDLIFRVGKER ASVPDA >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_2|1461_bp atggccagtgaaggcaccaacatcccaagtcctgtggtgcgccagattgacaagcagttt ctgatttgcagtatatgcctggaacggtacaagaatcccaaggttctcccctgtctgcac actttctgcgagaggtgcctgcagaactacattcctgcccacagtttaaccctctcctgc ccagtgtgccgccagacctccatcctgcccgagaaaggggtggccgcgctccagaacaat ttcttcatcacaaacctgatggacgtgctgcagcgaactccaggcagcaacgctgaggag tcttccatcctggagacagtcactgctgtggctgcgggaaagcctctctcttgcccaaac cacgatgggaatgtgatggaattttactgccagtcctgtgagactgccatgtgtcgggag tgcacggagggggagcacgcagagcaccccacagttccactcaaggatgtggtggaacag cacaaggcctcgctccaggtccagctggatgctgtcaacaaaaggctcccagaaatagat tctgctcttcagttcatctctgaaatcattcatcagttaaccaaccaaaaggccagcatc gtggatgacattcattccacctttgatgagctccagaagactttaaatgtgcgcaagagt gtgctgcttatggaattggaggtcaactatggcctcaaacacaaagtcctccagtcgcag ctggatactctgctccaggggcaggagagcattaagagctgcagcaacttcacagcgcag gccctcaaccatggcacggagaccgaggtcctactggtgaagaagcagatgagcgagaag ctgaacgagctggccgaccaggacttccccttgcacccgcgggagaacgaccagctggat ttcatcgtggaaaccgaggggctgaagaagtccatccacaacctcgggacgatcttaacc accaacgccgttgcctcagagacagtggccacgggcgaggggctgcggcagaccatcatc gggcagcccatgtccgtcaccatcaccaccaaggacaaagacggtgagctgtgcaaaacc ggcaacgcctacctcaccgccgaactgagcacccccgacgggagcgtggcagacggggag atcctggacaacaagaacggcacctatgagtttttgtacactgtccagaaggaaggggac tttaccctgtctctgagactctatgaccagcacatccgaggcagcccgtttaagctgaaa gtgatccgatccgctgatgtgtctcccaccacagaaggcgtgaagaggcgcgttaagtcc ccggggagcggccacgtcaagcagaaagctgtgaaaagacccgcaagcatgtacagcact ggaaaacgaaaagagaatcccatcgaagacgatttgatctttcgagtgggtaaggagagg gcttctgtgcccgacgcctga >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_3|363_aa MQQDETEPSVRSNTGSSINPASFKISAVHEILCKLSLEGEHSTLPSAFGSVKAYTKFDAE QDALNIEMAIKTKGVDEITTVNILTNHSNAQRQDIAFTYQRRTKKELVSALKSALSGHLD SGFGPIKDTCSGLGTNKDSLIEIICSRTNQELQEINRVYKEMYKTDLEKDIISDTSGDFC KLMVALAKGRRAEDGSVIDYELIDQDARDLYDAGVKRKGTEVPKWISVMTERSMSHFQKV FDRYKSYSPYDMLESIKKEVKGDLENSFLNLVQCIQNKPLYFADRLYDSIMGMGTQDKVL IRIMVSHNEVDMLKIRSEFKRKYSKSLYYYIQQDTKGAVPVWWRWLKSDTARASRNGAPH ASS >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_3|1092_bp atgcaacaagatgaaactgagccaagtgtaagatctaacacaggatcttcaatcaaccca gcttccttcaaaatatctgctgttcatgaaatcctgtgcaagctcagcttggagggtgaa cactctacactcccaagtgcatttgggtcagtcaaagcctacaccaaatttgatgctgag caggatgctttgaacattgaaatggccatcaagaccaaaggtgtggatgagatcaccact gtcaacattttgactaaccacagcaatgcacagagacaggatattgccttcacctaccag agaaggaccaaaaaggaacttgtatcagcacttaagtcagccttatctggccacctagat agtggctttgggcctattaaagacacctgctcagggctgggaaccaacaaggactccctc attgagatcatctgctcaagaaccaaccaagagctgcaggaaattaacagagtctacaag gaaatgtacaagactgatctggagaaggacattatttcggacacatctggtgacttctgc aagctgatggttgccctggcaaagggtagaagagcagaggatggctctgtcattgattat gaactgattgaccaagatgcccgggatctctatgacgctggggtgaagaggaaaggaact gaagttcccaagtggatcagcgtcatgaccgagcggagcatgtcccacttccagaaagta tttgataggtacaagagctacagtccttatgacatgttggagagcatcaagaaagaggtt aaaggagacctggaaaattctttcctgaacctggtccagtgtattcagaacaagcccctg tatttcgctgaccggctgtacgactccataatgggcatggggactcaagataaggtcctg atcagaatcatggtctcccacaatgaagtggacatgttgaaaattaggtctgaattcaag agaaagtatagcaagtccctgtactattacatccagcaagacactaagggtgctgtacct gtgtggtggagatggctgaagtccgacacagcacgagcgtccagaaatggtgctccccat gcttccagctaa >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_4|284_aa MGFHHVGQAGLELLTSGTKGRNKGEFTNLQGVAASTNGKILIADSNNQCVQIFSNDGQFK SRFGIRGRSPGQLQRPTGVAVHPSGDIIIADYDNKWVSIFSSDGKFKTKIGSGKLMGPKG VSVDRNGHIIVVDNKACCVFIFQPNGKIVTRFGSRGNGDRQFAGPHFAAVNSNNEIIITD FHNHSVKVFNQEGEFMLKFGSNGEGNGQFNAPTGVAVDSNGNIIVADWGNSRIQVFDGSG SFLSYINTSADPLYGPQGLALTSDGHVVVADSGNHCFKVYRYLQ >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_4|855_bp atggggtttcaccatgttggccaggctggtctcgaactcctgacctcaggtaccaaagga agaaataaaggagagtttacaaatcttcagggggtagctgcatctacaaatggaaagata ttaattgcagacagtaacaaccaatgtgtgcagatattttccaatgatggccagttcaaa agtcgttttggcatacggggacgctctccggggcagctgcagcggcccacaggagtggct gtacatcccagtggggacataatcattgccgattatgataataaatgggtcagcattttc tcctccgatgggaaatttaagacaaaaattggatcaggaaagctgatgggacccaaagga gtttctgtggaccgcaatgggcacattattgttgtggacaacaaggcgtgctgcgtgttt atcttccagccaaacgggaaaatagtcaccaggtttggtagccgaggaaatggggacagg cagtttgcaggtccccattttgcagctgtaaatagcaataatgagattattattacagat ttccataatcattctgtcaaggtgtttaatcaggaaggagaattcatgttgaagtttggc tcaaatggagaaggaaatgggcagtttaatgctccaacaggtgtagcagtggattcaaat ggaaacatcattgtggccgactggggaaacagcaggatccaggtttttgatgggagtgga tcatttttgtcctacattaacacatctgctgacccactctatggcccccaaggcctggcc ctaacttcagatggtcatgttgtggttgcagactctggaaatcactgtttcaaagtctat cgatacttacagtaa >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_5|65_aa MARAQGLPLAGPALGERGGTGQDAFDFGADYAAGAEQRDGGGFKKEGNNGMNFKSTWSKI RYFMA >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_5|198_bp atggcgcgggcgcaggggcttccgctggccgggcccgcgcttggggagaggggcgggaca ggccaggacgcgtttgattttggcgccgactacgccgcgggggcggagcagcgagacggt ggaggttttaaaaaagaaggaaacaacggaatgaatttcaaatcaacatggtcaaaaata cgctacttcatggcttga >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_6|83_aa MSVKEVLQSLVDDGMVDCERIGTSNYYWAFPSKALHARKHKLEVLESQSCFCHVKCLLAL SKGSLRPPQKQKPPCFQVEPAEA >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_6|252_bp atgtcagtaaaagaagtccttcaaagcttagttgatgatggtatggttgactgtgagagg atcggaacttctaattattattgggcttttccaagtaaagctcttcatgcaaggaaacat aagttggaggttctggaatctcagtcctgcttctgccatgtaaaatgcctgctcgccctg agtaaaggatccctgaggcctccccagaagcagaagccaccatgcttccaagtagagcct gcagaagcatga >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_7|137_aa MERHRKNCERTQGEGRPGETKPANILILKFDPPGCEILLCPSLPFNCQYILVLLGSRIRA REPPMAVQAVTQQSRKQCDQEAEIRVMQQQVKECWQPTEIERGKEDYPREPLEEQQLCQH FDFGLVTLIFNFWPPEL >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_7|414_bp atggagagacacaggaaaaactgtgagaggacacagggagaaggaagacctggagaaacc aaacctgccaacatcttgatcttgaaatttgatcctccaggatgtgaaattcttctctgt ccatccttacccttcaactgtcagtatattcttgttcttcttggaagcagaataagagct cgggaaccgccaatggcagtacaagctgtaacacagcagagcagaaagcaatgtgaccaa gaggcagaaattagagtgatgcagcaacaagtcaaagaatgctggcagcccacagaaatt gaaagaggcaaggaagattatcccagagagcctctggaggagcaacagctgtgccaacac tttgattttggcctagtaacactgatttttaacttctggcctccagagctgtga >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_8|113_aa MAKTCLHVCDLRTGSGKGEERDQMSLATCHREYVKEKQEVDGIFSEVPGPTVSEEPQFGA HVLIDMHYQFDFCFCTKTDYLSDSDLAIPLCEWYKVGISKVTGEALVFQESLL >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_8|342_bp atggctaaaacttgtcttcatgtttgtgacttgagaactggaagtggaaagggggaggag agggaccaaatgtcactagcaacctgccatcgagaatatgttaaggagaagcaggaggtt gatggaatcttctcagaggtccctggtcccacagtgtctgaggagcctcaatttggtgcc catgtgcttattgacatgcactaccaattcgatttctgtttctgcaccaagaccgattat ctttcggactcagaccttgccattcctctttgtgaatggtataaagtaggcatttcaaaa gtgactggagaagctctggtgttccaagaatctcttttgtag >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_9|139_aa MIRGGLHQQRKLVPCESQSSHPGWPWQTHREQGLSAQEEKPGRRQTGRAVENTTQSKSTA NSSFSETAGRSPDERMQLSEARGLQAARAKGQNHNQRVCEPHAVALAAAARGLQRKATAS PSSPAETSELPESSEKGKM >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_9|420_bp atgatcagagggggcctccaccagcaaagaaaattggttccatgcgaatcacagtcctca caccctggctggccgtggcagacacacagagagcagggtctgtcagctcaggaagagaaa ccaggcaggaggcaaacgggccgggccgtggagaacacgacccaaagcaagtcaactgcc aacagtagcttctcggaaacagcaggacgcagcccagacgagcgaatgcagctctcagag gcgcgaggactccaggccgccagagccaaggggcagaatcacaatcagcgggtgtgtgaa cctcatgcagtggccctggcggctgctgccagagggctgcagagaaaggctacggcatcg cccagcagccctgctgagacgtcggaactcccagaaagtagtgagaagggcaagatgtag >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_10|278_aa MGKGFMTKTPKAMAAKAKIDKGDLIKLKSFCTAKETTIRVNRQPTEWEKIFTINPSDKGL ISRIYKELKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSSLAVREMQIKTT MRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLLQPLWKTVWRFLKDLELEIP VSRNTQPSHYWVYTERIINHAGIKTHAHERGSRFHFVLLRIVSAMQALVFYSLACKAQGA IKQRIKGTAEIYKRARGGKGKGQEEEGKLLRFSADNIF >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_10|837_bp atgggcaagggcttcatgactaaaacaccaaaagcaatggcagcaaaagccaaaattgac aaaggggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttacaatcaacccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactggccgtcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacactgttggtgggactgtaaacta cttcaaccattgtggaagacagtgtggcgattcctcaaggatctagaactagaaatacca gtttctagaaatacccagccatcccattactgggtatataccgaaaggattataaatcat gctggtataaagacacatgcacacgaaagaggatctagatttcattttgtccttctcagg attgtgtctgctatgcaagcactggttttctatagtctggcttgtaaggcacaaggagca attaaacagaggattaagggaactgctgaaatctataagagggcgaggggaggaaaggga aaggggcaggaggaggaaggaaagctgttgcgattctccgctgataacattttctga >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_11|216_aa MTGKDVGQMDVRRTGWGGYRMETEDHMKTQPHLVTAGTGTGDSERPRSSVQDTVILWHLQ AKGELWGEDVYKQEGGHKHIMEKVKLKDLPTHGKPSSYLSFLPTQQPTPSVAPAVLEDKR SFLLCLQTAACNHPVHHRTLCSKDQSPHIIHSLSPLSHSKDPALNVITSVPAQEPTMAPS NWTTSWAVLNVYCMQDTVATAGHQPLAAQPLKGEPS >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_11|651_bp atgacaggaaaggatgtaggacaaatggacgtgaggagaacaggatggggtggctacaga atggagacagaagaccacatgaagactcaaccacacctcgtgacggcaggcacaggaact ggggacagcgagaggcccagaagctcagttcaggacacagtaattttgtggcacctgcaa gctaaaggagaactctggggagaagatgtttataaacaagagggagggcacaagcacatc atggaaaaagtgaaattgaaagacctacccactcacgggaaacccagttcctacctgtcc tttcttccaacacagcagcccacaccatcagtagccccggctgtattagaagacaaacgc agctttcttctctgcctgcagactgctgcctgcaaccatccggtgcaccatcgcactctg tgcagcaaagaccagtctcctcatatcattcactccttatcacccttatcacacagtaag gacccagcactcaatgtcatcacttcagtccctgctcaagaacctactatggctccctcc aactggacgacttcatgggcagtactgaatgtctactgtatgcaggacactgtggccaca gcagggcaccaacccttagcagcccagcccctcaaaggagaacccagttaa >gi568815594f:153250063_153514854|GENSCAN_predicted_peptide_12|79_aa MRTHNVENNTNVNTMFQTLGSHKPRESTETPRESAVRQETGVPQLQAKKPHRLPASPQKL GKGEGGLPEGFRGSVALPA >gi568815594f:153250063_153514854|GENSCAN_predicted_CDS_12|240_bp atgaggactcacaatgtggagaataatacaaatgtgaacacgatgttccagactcttgga agccataagccacgtgaaagcacggagacacccagggagagcgctgtgcgacaagagact ggagtgccgcagctgcaagccaagaaaccccacaggctgccggcgagcccgcagaagcta ggaaaaggcgagggcggactccccgagggtttcagaggcagcgtggccctgccggcttga