GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:18:01 Sequence gi568815581r:4182838_4466316 : 283479 bp : 45.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 714 561 154 1 1 36 91 227 0.894 16.93 1.10 Intr - 1073 975 99 0 0 17 95 75 0.404 1.18 1.09 Intr - 2209 1981 229 1 1 68 115 203 0.667 18.44 1.08 Intr - 6642 6545 98 1 2 36 106 65 0.703 2.83 1.07 Intr - 12340 12141 200 0 2 84 103 202 0.999 20.29 1.06 Intr - 12634 12566 69 0 0 130 60 80 0.896 7.70 1.05 Intr - 14740 14536 205 2 1 97 99 250 0.998 25.26 1.04 Intr - 23649 23484 166 0 1 33 75 155 0.994 8.23 1.03 Intr - 25245 25096 150 0 0 74 84 175 0.999 16.06 1.02 Intr - 27110 26987 124 1 1 92 92 176 0.942 18.99 1.01 Init - 29387 29329 59 2 2 68 51 1 0.188 -4.82 1.00 Prom - 31937 31898 40 -7.46 2.00 Prom + 32533 32572 40 -3.46 2.01 Init + 38967 39015 49 2 1 77 58 57 0.431 0.71 2.02 Intr + 39258 39880 623 1 2 -35 49 384 0.264 14.20 2.03 Intr + 40021 40149 129 0 0 1 67 123 0.787 2.39 2.04 Intr + 40461 40946 486 2 0 60 52 370 0.871 23.81 2.05 Term + 42761 42835 75 1 0 35 54 65 0.304 -4.36 2.06 PlyA + 43650 43655 6 1.05 3.05 PlyA - 47799 47794 6 1.05 3.04 Term - 50134 49989 146 2 2 59 42 100 0.867 0.57 3.03 Intr - 53053 52935 119 0 2 85 99 52 0.929 6.11 3.02 Intr - 59611 59419 193 2 1 114 75 384 0.953 38.25 3.01 Init - 80893 80758 136 1 1 95 74 172 0.250 14.81 3.00 Prom - 87947 87908 40 -5.46 4.06 PlyA - 88014 88009 6 1.05 4.05 Term - 89679 89668 12 0 0 114 54 9 0.024 -1.70 4.04 Intr - 106571 106393 179 0 2 57 91 119 0.893 8.74 4.03 Intr - 113977 113880 98 0 2 63 115 -15 0.028 -1.65 4.02 Intr - 124286 124184 103 0 1 114 83 94 0.191 10.73 4.01 Init - 141281 141206 76 2 1 32 105 20 0.005 -0.65 4.00 Prom - 152132 152093 40 -2.16 5.00 Prom + 175225 175264 40 -2.96 5.01 Init + 182605 183643 1039 0 1 91 70 217 0.412 12.55 5.02 Intr + 213081 213271 191 1 2 58 47 172 0.469 9.40 5.03 Term + 230422 230598 177 0 0 70 44 156 0.866 7.09 5.04 PlyA + 234452 234457 6 1.05 6.00 Prom + 235943 235982 40 -5.16 6.01 Init + 251131 251329 199 0 1 83 94 110 0.696 8.98 6.02 Intr + 253172 253320 149 0 2 57 64 85 0.703 2.95 6.03 Intr + 253419 253518 100 2 1 8 55 110 0.481 -0.62 6.04 Intr + 256821 256886 66 1 0 111 95 42 0.725 6.08 6.05 Intr + 262195 262331 137 2 2 76 87 113 0.922 10.29 6.06 Intr + 263211 263362 152 2 2 41 81 264 0.999 19.96 6.07 Intr + 264059 264125 67 2 1 101 72 91 0.793 7.71 6.08 Intr + 265318 265466 149 0 2 118 87 113 0.998 13.23 6.09 Intr + 266398 266550 153 1 0 77 94 172 0.504 15.89 6.10 Intr + 270179 270390 212 2 2 98 59 231 0.508 19.76 6.11 Term + 275209 275231 23 2 2 95 50 34 0.279 -1.23 6.12 PlyA + 276878 276883 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_1|518_aa MVDVRKSRETILVVCDCPWRCEKGVMSLVNVRNCIRFYQTAEELNASTLMNYCAEIIASH WDDLRKEDFSSMSAQLLYKMIKSKTEYPLHKAIKVEREDVVFLYLIEMDSQLPGKLNEAD HNGDLALDLALSRRLESIATTLVSHKADVDMVDKSGWSLLHKGIQRGDLFAATFLIKNGA FVNAATLGAQETPLHLVALYSSKKHSADVMSEMAQIAEALLQAGANPNMQDSKGRTPLHV SIMAGNEYVFSQLLQCKQLDLELKDHEGSTALWLAVQHITVSSDQSVNPFEDVPVVNGTS FDENSFAARLIQRGSHTDAPDTATGNCLLQRAAGAGNEAAALFLATNGAHVNHRNKWGET PLHTACRHGLANLTAELLQQGANPNLQTEEALPLPKEAASLTSLADSVHLQTPLHMAIAY NHPDVVSVILEQKANALHATNNLQIIPDFSLKDSRDQTVLGLALWTGMHTIAAQLLGSGA AINDTMSDGQTLLHMAIQRQDSKSALFLLEHQADINVS >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_1|1554_bp atggtggatgtcaggaaatctcgtgaaaccatactagtcgtctgtgactgcccttggaga tgtgagaagggtgttatgtctctagtgaatgtcaggaactgtattcgcttctaccagacg gcagaggagctgaatgccagcacactgatgaactactgtgcagaaattattgcaagtcat tgggacgacctgaggaaggaggatttcagcagcatgagcgctcagttgttatacaaaatg atcaaatccaagacagagtacccgctacataaagccatcaaagtggagagagaagacgtg gtcttcctgtatctgattgaaatggattcccagctccctgggaagctgaatgaagcggat cataacggagatctggcattagatctagccctctcacgacgactggagagtattgccacc acgctggttagtcacaaagctgatgtggacatggtggacaagagtggctggagcttgtta cacaaaggaatccaaagaggagatctctttgctgccactttcctcattaagaatggggcc tttgtcaacgctgctacactgggtgcccaggagacaccactgcaccttgtggccttgtac agttcaaagaaacactcagcagatgtgatgtctgagatggcgcagattgcagaggccctt ctgcaggctggtgccaaccccaacatgcaggacagcaaggggaggactcctttacatgtg tccatcatggccgggaatgaatatgtgttcagtcagctgctgcagtgcaaacaactagat ttagaactcaaagaccacgagggcagcacggctctgtggctggcagtgcagcatatcaca gtgtcttctgaccagtctgtgaaccccttcgaagatgtccccgtggtaaatgggacttca tttgatgagaacagctttgcagccagactcatccagcgcggcagccacacagacgcacct gacacggcgacaggaaactgtttactacagcgggcagctggagcaggaaacgaggcagca gctcttttcctggcaaccaacggtgcccatgtcaaccacagaaacaagtggggagaaacc ccgttgcacacagcgtgtcggcatggcctggccaacctcacggcagagctcctgcagcaa ggcgccaacccaaacctgcagacggaggaagctctgcctctgccaaaggaggccgcatcc ctgaccagcttggcggacagcgtccatctgcagacgccactgcacatggcgatcgcctat aaccatccggatgtggtgtctgtcatcctggagcagaaagccaatgctcttcatgccacc aacaacttgcagatcattccggacttcagcctcaaagattcccgagaccagactgtgctg ggcctggcattatggactggcatgcacacgatcgcagcccagctgctgggctctggagcc gccatcaatgacaccatgtcggatgggcagacgctactgcacatggccatacagcggcag gacagcaagagcgcactcttcctgctggagcaccaggcagatataaatgtcagn >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_2|453_aa MRFRHVGQAGLELLTSGVGGGAAGGQAPGGVAGVPRGQKAPPPPLLLPLLPAPGAAAPAA PRTPELQSAAAGPSVSLYLSEDEVHRLIGLDAELYYVTNDLIIHYALSFNLLVPSETNFL HFTWHSKSKVECKLGFQADNVLAMDLPQGNISVQGEVLRTLSVFRVELSCTGKVDSEVMI LMQLNLTVNSSKNFTVLNFKPRKMCTKNGPDTQLQPLLRVFYIRTTQYPRADTPNNPTPI TSSSGYPTLRIEKNDLRSVTLLEAKAKCKLVEANNPQAISQQDLVHMAIQIACGMSYLAR TEVIHKDLAARNCITDDTLQVKITDNAVSWTITVWGTMKTGQFVGWLLKVWLITSFLALV MCGAFGVMLWELMTLGQTPYMDTDPFEMAAYLKDGYRIAQPINCPDEPFAVMACCWALDP GERPKFQQLLKILDRVQNNNDLKIWKSQQKQLN >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_2|1362_bp atgaggtttcgccatgttggtcaggctggtctggaactcctgacctcaggggtaggcggc ggtgcggcgggcggccaggcgccgggaggcgtggctggggtgccccgcggccagaaggcc ccgccgccgccgctgctgctacccctgctgcccgcgcctggcgccgccgcccccgccgcc ccgcggaccccggagctgcagtcggcggccgcggggcccagcgtgagcctctacctgagt gaggatgaggtgcaccggctgatcggtcttgatgcagaactttattatgtgacaaatgac cttattattcactacgctctgtcctttaatctgttagtacccagtgagacaaatttcctg cacttcacctggcattcgaagtccaaggttgaatgtaagctgggattccaagcggacaat gttttggcaatggacctgccccagggcaacatttctgttcagggagaagttctacgcact ttatcagtatttcgggtagagctttcctgtactggcaaagtagattctgaagttatgata ctaatgcagctcaacttgacagtcaattcttcaaagaattttaccgtcttaaattttaaa ccaaggaaaatgtgcaccaaaaatggccctgacacgcagctccaaccacttctacgcgtg ttttatattaggacgactcagtatccgagagctgacacacccaacaatccaactcctatc accagctcctcaggttatcctaccttgcggatagagaagaacgacttgagaagtgtcact cttttggaggccaaagccaagtgcaagttagtagaggccaataatccacaggcaatttct caacaagacctggtacacatggctattcagattgcctgtggaatgagctacctggccaga acggaagtcatccacaaagacctggctgctaggaactgtatcactgatgacacacttcaa gttaagatcacagacaatgccgtctcatggactatcactgtctgggggacaatgaaaaca ggccagtttgttggatggctcttgaaagtctggttaataacgagttttctagcactagtg atgtgtggggcctttggagtgatgctgtgggaactcatgactctgggccagacgccctac atggacactgaccccttcgagatggccgcatacctgaaagatggttaccgaatagcccag ccaatcaactgtcctgatgaaccatttgctgtgatggcctgttgctgggccttagatccg ggggagaggcccaagtttcagcagctgctaaaaattctggacagagtacaaaacaacaac gacctaaagatttggaaaagtcaacaaaagcagctgaactga >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_3|197_aa MPTPRDCGRLRSRAGRSRAGAACSRGAPRAAREALDCRRCRDAGGKEVAKLEKHLMLLRQ EYVKLQKKLAETEKRCALLAAQANKESSSESFISRLLAIVADLYEQEQYSDLKIKVGDRH ISAHKFVLAARSDSWSLANLSSTKELDLSGHSDSVNVPQFYFNCEISTATKMNESGKNET SLLSETKRHNTENYIKE >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_3|594_bp atgccgaccccgcgggactgcggccggctgcggagccgggctggcaggtcccgcgccggt gctgcgtgcagccgcggggccccgagggcagcacgggaggctcttgattgccggcggtgc cgggacgccgggggaaaggaggtggccaagttggagaagcacttgatgcttctgcggcag gagtatgtcaagctgcagaagaagctggcggagacagagaagcgctgcgctctcttggct gcgcaggcaaacaaggagagcagcagcgagtccttcatcagccgtctgctggccatcgtg gcagacctctacgagcaggagcagtacagcgatctgaagataaaggttggggacaggcac atcagtgctcacaagtttgtcctggcagcccgcagtgacagctggagtctggctaacttg tcttccactaaagagttggacctgtcaggtcactcagacagtgttaatgttcctcagttt tattttaattgtgaaatatcgactgccaccaaaatgaatgagagtggcaaaaacgaaaca tcgctgctctctgaaacaaaacgacacaacacagaaaattacatcaaggaataa >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_4|155_aa MVLNSNNYLVIARFLRCEDALSLFTELNKNPVEGFSAGLIDDNDLYRWEVLIIGPPDTLY EGGVFKAHLTFPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPGEDKYGYEKPEER WLPIHTVETIMISVISMLADPNGDSPANVDAAVSN >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_4|468_bp atggttctaaattcgaacaattatttggtgattgccagatttctccgttgcgaagatgca ttatctctttttactgaactcaacaaaaatccagtggaaggcttttctgcaggtttaata gatgacaatgatctctaccgatgggaagtccttattattggccctccagatacactttat gaaggtggtgtttttaaggctcatcttactttcccaaaagattatcccctccgacctcct aaaatgaaattcattacagaaatctggcacccaaatgttgataaaaatggtgatgtgtgc atttctattcttcatgagcctggggaagataagtatggttatgaaaagccagaggaacgc tggctccctatccacactgtggaaaccatcatgattagtgtcatttctatgctggcagac cctaatggagactcacctgctaatgttgatgctgcggtctccaattga >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_5|468_aa MGEGGRRLLGEADPAAAPGQSLRAQLRGLQACAAAGIAASTKPGAGQRAALPAARSYPPP AGRRSRSPCQAEAGEAAIRTPPGSPAPSRPRGGRRGQRFCCRPQLRGPSAARGEVLCASP IPPGPVPSARPQTPSRGLAAALGPDAPAGPEGPAAPAAPEPGTRPSKLAAPAWGPAHSPS PATSYRTVGEGTSSSPTPRAPGPVPGPELGRVPAPSALQEPGENGWARPGPEPRGPHPRR PGPLRQYGRWPGRRRGTGLRLSAIAAGPGAPPPACSPASCLRSSSADCSSVILPAEGPGW RRGFRRAGDRLWGRLERGVPRNPGPATGAPEPRKAGLRRRERRLAAAVPDSHGHLVLVLG STDTTATLKGTRNSTSISSQMVTVAAVSKEGVHSSSTDDKTETPRSQVTCVAPRRQTLKP SVLGASLGGRFGPATERPLSEGSQLRLQRHLVATGHQTWDLPRGLPET >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_5|1407_bp atgggggaggggggccgcaggctcctcggggaggccgacccggccgccgccccgggccag agcctgcgggcgcagctccggggtttgcaggcctgcgcagcagccggaatcgcggcgagc acaaagcccggagccggccagcgagcggctctccccgcggcccgctcctaccctccccca gccgggcggcgcagccggagcccgtgccaggcggaggcaggcgaggcggcgatccgcact ccgccagggtctccggcaccctcgcgcccgcgtggaggccgccgaggccagcgcttttgt tgtcggccccaactccgcggacccagcgccgcccggggagaagttttgtgcgcctcacct atcccgccagggccggtcccttccgcccgccctcagacaccgagccggggactcgcagcc gcactgggtccggacgctccagccggccccgaaggtccggcggccccggcagcacctgaa cccgggacccgcccctccaagctggcggccccggcctggggacccgcacacagcccctcc cccgcaacgtcctacagaacggtgggggaggggacctcctcctctccgaccccccgggcg ccaggcccagtccctgggccggagctgggtcgcgtccccgcccccagtgccctccaagag ccgggcgagaacggctgggcccggccgggaccggagcctcgaggtccccaccctcgcagg cccgggccccttcggcagtacggccgctggcccgggaggagaagagggactgggctgcgg ctgtccgcgatcgcggccgggcccggcgccccgccgcccgcctgctcacctgccagctgt cttcgcagtagcagtgccgactgcagctccgtcatcctccctgccgagggcccgggctgg cgccggggcttccgaagggctggggacaggctctgggggcggctggagcggggtgtgccg aggaacccgggccccgcgaccggagcgccggagccgaggaaggccgggctgaggcggcgg gagcggcgcctcgctgccgcagttccagacagccacggccacctggtcctggtcctcggg agtaccgacacaactgccaccctaaagggcaccaggaactctacctccatcagctcgcag atggtcaccgtggcagctgtctctaaggagggtgttcactcctcttccacagatgacaaa actgagactccaagaagccaagtgacttgtgtcgcccctagaaggcaaacgctgaagcca agtgttcttggagcgtctcttggtggacgcttcggccctgccacggaacggccccttagt gaaggttctcagcttagacttcagcgccacctggtggccaccgggcatcagacctgggac ttgcctcgagggctgccagaaacttga >gi568815581r:4182838_4466316|GENSCAN_predicted_peptide_6|468_aa MAGGMSAECPEPGPGGLQGQSPGPGRQCPPPITPTSWSLPPWRAYVAAAVLCYINLLNYM NWFIIADRLSTNCGQDPLLAAVDPGRKSDLEFLNSGMTISWGRDRGWLMAVLRGGELPAQ HAREWLWNHETLGAAEPSPDLVIVFEELRGVLLDIQEVFQISDNHAGLLQTVFVSCLLLS APVFGYLGDRHSRKATMSFGILLWSGAGLSSSFISPRYSWLFFLSRGIVGTGSASYSTIA PTVLGDLFVRDQRTRVLAVFYIFIPVGSGLGYVLGSAVTMLTGNWRWALRVMPCLEAVAL ILLILLVPDPPRGAAETQGEGAVGGFRSSWCEDVRYLGKNWSFVWSTLGVTAMAFVTGAL GFWAPKFLLEARVVHGLQPPCFQEPCSNPDSLIFGALTIMTGVIGVILGAEAARRYKKVI PGAEPLICASSLLATAPCLYLALVLAPTTLLASYVSESLYGECSDVLT >gi568815581r:4182838_4466316|GENSCAN_predicted_CDS_6|1407_bp atggctggggggatgtcagcggagtgccctgagcctgggccaggaggtctgcagggccag tccccagggccaggcaggcagtgtccccctcccatcacgcccacctcctggagcctgccc ccgtggagggcctacgtggctgccgccgtcctctgctacatcaacctcctgaattacatg aactggttcatcattgcagaccggctgagcactaactgtgggcaggaccccctgctggct gctgtggacccagggaggaaatcggacttggagttcctgaactcaggaatgaccatctca tggggcagagatagaggctggttgatggcagtcctgcgaggcggagagttaccagctcag catgccagagagtggctatggaatcatgaaactctaggggctgcagagccatccccagac ctggtcatcgtctttgaggagctcagaggagtgctgctggatatacaggaggttttccag atcagtgacaaccatgctggtttgcttcagactgtcttcgttagctgcctgctgctgtct gcacctgtgtttggctacctgggcgaccgacatagccgcaaggctaccatgagcttcggt atcttgctgtggtcaggagctggcctctctagctccttcatctccccccggtattcttgg ctcttcttcctgtcccggggcatcgtgggcactggctcggccagctactccaccatcgcg cccaccgtcctgggcgacctcttcgtgagggaccagcgcacccgcgtgctggctgtcttc tacatctttatccccgttggaagtggtctgggctacgtgctggggtcggctgtgacgatg ctgactgggaactggcgctgggccctccgagtcatgccctgcctggaggccgtggccttg atcctgcttatcctgctggttccagacccaccccggggagctgccgagacacagggggag ggggccgtgggaggcttcaggagcagctggtgtgaggacgtcagatacctggggaaaaac tggagtttcgtgtggtcgaccctcggagtgaccgccatggcctttgtgactggagccctg gggttctgggcccccaagtttctgctcgaggcacgcgtggttcacgggctgcagcctccc tgcttccaggagccgtgcagcaaccccgacagcctgatttttggggcactgaccatcatg accggcgtcattggggtcatcttgggggcagaagctgcgaggaggtacaagaaagtcatt ccaggagctgagcccctcatctgcgcctccagcctgcttgccacagccccctgcctctac ctggctctcgtcctggccccgaccaccctgctggcctcctatgtaagtgagagcctctat ggagaatgctcggatgttctgacctga