GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:44:10 Sequence gi568815581f:34256527_34458128 : 201602 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 196 301 106 1 1 105 47 127 0.629 8.18 1.02 PlyA + 655 660 6 1.05 2.00 Prom + 13659 13698 40 -1.46 2.01 Init + 13765 13840 76 0 1 79 110 113 0.972 11.87 2.02 Intr + 14620 14737 118 2 1 109 48 63 0.981 3.92 2.03 Term + 15171 15276 106 0 1 120 47 102 0.980 7.18 2.04 PlyA + 15750 15755 6 1.05 3.00 Prom + 25535 25574 40 -4.46 3.01 Init + 29283 29358 76 2 1 94 115 102 0.771 12.86 3.02 Intr + 30570 30681 112 1 1 44 95 57 0.761 1.44 3.03 Term + 31059 31164 106 0 1 64 36 145 0.958 4.78 3.04 PlyA + 31302 31307 6 -0.45 4.05 PlyA - 31331 31326 6 1.05 4.04 Term - 34298 34133 166 1 1 78 42 83 0.137 0.09 4.03 Intr - 34865 34785 81 1 0 80 81 40 0.160 1.25 4.02 Intr - 38754 36585 2170 1 1 43 53 710 0.050 50.34 4.01 Init - 42082 42034 49 1 1 62 53 44 0.048 -0.59 4.00 Prom - 44565 44526 40 -4.36 5.00 Prom + 47095 47134 40 -3.46 5.01 Init + 62976 63051 76 2 1 84 110 109 0.977 12.00 5.02 Intr + 63743 63860 118 0 1 77 76 63 0.931 3.52 5.03 Term + 64276 64381 106 1 1 118 52 36 0.846 0.88 5.04 PlyA + 65843 65848 6 1.05 6.03 PlyA - 66942 66937 6 1.05 6.02 Term - 68755 68321 435 1 0 68 42 118 0.382 0.49 6.01 Init - 80656 80252 405 1 0 45 8 286 0.163 12.89 6.00 Prom - 81868 81829 40 -1.46 7.00 Prom + 90016 90055 40 -2.36 7.01 Init + 100001 100076 76 1 1 69 111 98 0.956 9.81 7.02 Intr + 100949 101063 115 0 1 73 34 86 0.569 1.21 7.03 Term + 101500 101605 106 1 1 107 43 80 0.921 3.28 7.04 PlyA + 102060 102065 6 -3.74 8.04 PlyA - 102072 102067 6 1.05 8.03 Term - 104135 104033 103 1 1 122 53 26 0.484 0.25 8.02 Intr - 105370 105259 112 2 1 96 101 -19 0.687 -0.36 8.01 Init - 106635 106560 76 0 1 83 73 127 0.941 10.29 8.00 Prom - 109334 109295 40 -4.46 9.00 Prom + 109914 109953 40 -7.66 9.01 Init + 110822 110888 67 1 1 46 103 67 0.493 5.23 9.02 Intr + 119172 119292 121 1 1 33 91 96 0.127 3.95 9.03 Intr + 129185 129360 176 2 2 38 37 101 0.045 -0.42 9.04 Intr + 132511 132659 149 2 2 49 94 91 0.075 5.75 9.05 Term + 132825 132881 57 2 0 72 40 90 0.714 0.29 9.06 PlyA + 136038 136043 6 1.05 10.10 PlyA - 136755 136750 6 1.05 10.09 Term - 144500 144208 293 0 2 71 43 166 0.407 5.91 10.08 Intr - 151682 151632 51 0 0 42 95 52 0.074 0.18 10.07 Intr - 166652 166524 129 0 0 86 70 14 0.333 0.07 10.06 Intr - 168219 168097 123 1 0 87 109 19 0.585 4.46 10.05 Intr - 178475 178313 163 2 1 88 49 25 0.013 -1.85 10.04 Intr - 192357 192289 69 0 0 77 111 67 0.637 7.28 10.03 Intr - 193084 192927 158 2 2 65 60 18 0.406 -3.57 10.02 Intr - 197713 197426 288 2 0 94 92 73 0.579 5.52 10.01 Init - 200865 200826 40 0 1 97 39 70 0.747 3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 26299 26427 129 0 0 78 99 105 0.831 10.75 S.002 Term + 26787 26891 105 2 0 38 50 83 0.868 -2.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_1|35_aa XFKTIVAKEICADPKQKWVQDSMDHLDKQTQTPKT >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_1|108_bp nncttcaagaccattgtggccaaggagatctgtgctgaccccaagcagaagtgggttcag gattccatggaccacctggacaagcaaacccaaactccgaagacttga >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_2|99_aa MKASAALLCLLLTAAAFSPQGLAQPVGINTSTTCCYRFINKKIPKQRLESYRRTTSSHCP REAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQTPKL >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_2|300_bp atgaaagcctctgcagcacttctgtgtctgctgctcacagcagctgctttcagcccccag gggcttgctcagccagttgggattaatacttcaactacctgctgctacagatttatcaat aagaaaatccctaagcagaggctggagagctacagaaggaccaccagtagccactgtccc cgggaagctgtaatcttcaagaccaaactggacaaggagatctgtgctgaccccacacag aagtgggtccaggactttatgaagcacctggacaagaaaacccaaactccaaagctttga >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_3|97_aa MKVSAALLWLLLIAAAFSPQGLAGPASVPTTCCFNLANRKIPLQRLESYRRITSGKCPQK AVIFKTKLAKDICADPKKKWVQDSMKYLDQKSPTPKP >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_3|294_bp atgaaggtctccgcagcacttctgtggctgctgctcatagcagctgccttcagcccccag gggctcgctgggccagcttctgtcccaaccacctgctgctttaacctggccaataggaag ataccccttcagcgactagagagctacaggagaatcaccagtggcaaatgtccccagaaa gctgtgatcttcaagaccaaactggccaaggatatctgtgccgaccccaagaagaagtgg gtgcaggattccatgaagtatctggaccaaaaatctccaactccaaagccataa >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_4|821_aa MTIALQINSGMALKGRKIQTTIGEYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESL NRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSF DEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPG MQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKI IRAIYDKPIANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIIYLENPIVSAQNLLKLISNFSKVSEYKINVQKSQAFLYTNNRQT ESQIMSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGG ITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHTYNYLIFDKPEKNKQWGKD SLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRRKTIKTLEENLGITIQDIG MGKDFMSKTPKAMATKDKIDKWDLIKIKSFCTAKETTVRVNRQPTKWEKIFATYSSDKGL ISRIYNELQQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRLAGQETNRQGEEEVTIRARVMGPGHQKSKVLLPQLLEVLA ANASQRNFLYEARRQLKTAALPRVVSSSRSILQPGLVQMGV >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_4|2466_bp atgacaattgctctacagatcaactcagggatggccttgaaagggagaaaaatacaaact accatcggagaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatg gataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggta ccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt gatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacagagcc aaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaata ataagagctatctatgacaaacccatagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattatatatctagaa aaccccattgtctctgcccaaaatctccttaagctgataagcaacttcagcaaagtctca gaatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaacaacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggacatgaaggacctcttcaaggagaactacaaaccactgctc aaagaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccg catacctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgtt agacgtaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggc atgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaaataaagagcttctgcacagcaaaagaaactaccgtcagagtg aacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggcta atatccagaatctacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccact atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaga cttgcaggccaagaaactaacaggcaaggagaggaagaggtcactatccgggccagggta atgggccctggtcatcagaagtctaaagtgcttcttccccagctgctagaagtgttggca gccaatgcttctcagcggaatttcctctatgaggcccgccgtcagctaaaaacagctgcc ttgcccagagttgtgtcctcttccaggagcatcctgcagccagggctggttcagatgggg gtataa >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_5|99_aa MKVSAALLCLLLMAATFSPQGLAQPDSVSIPITCCFNVINRKIPIQRLESYTRITNIQCP KEAVIFKTKRGKEVCADPKERWVRDSMKHLDQIFQNLKP >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_5|300_bp atgaaggtttctgcagcgcttctgtgcctgctgctcatggcagccactttcagccctcag ggacttgctcagccagattcagtttccattccaatcacctgctgctttaacgtgatcaat aggaaaattcctatccagaggctggagagctacacaagaatcaccaacatccaatgtccc aaggaagctgtgatcttcaagaccaaacggggcaaggaggtctgtgctgaccccaaggag agatgggtcagggattccatgaagcatctggaccaaatatttcaaaatctgaagccatga >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_6|279_aa MCKFLDTYTLPRLNQEDVECLNRPITSSEIEAVIKSLPTKKSPRTNGFTAKFYQMYKEEL VTFLLKLFQTIEKGGFLPNSFYEASIILIPKCVKDTPKEENLRAISLMNVNVKILHKIPA NQIQQHIKKLIHHDQPTQMRRNQKNSSGNMTKQGSSTPPKDHTSSPAMDPNQDEISELPE KEFIRSIIKLIKEAPEKGEVLLKEIKKKIQHMNGKISSEIDSINKSQSQLLEMKDTVREM QNVLESLSSRIKQAEERTSELEDKAFELNQSNKGEEKRI >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_6|840_bp atgtgtaaattcctggacacatacaccctcccaagactaaaccaggaagatgttgaatgc ctgaatagaccaataacaagttctgaaattgaggcagtaattaagagcctaccaaccaaa aaaagcccaagaacaaatggattcacagccaaattctaccagatgtacaaggaggagctg gtaacattccttctgaaactattccaaacaatagaaaaagggggattcctccctaactca ttttatgaggccagcatcatccttataccaaaatgcgtcaaagacacaccaaaagaagaa aatttaagggcaatatccctgatgaatgtcaacgtgaaaatcctccataaaataccggca aaccaaatccagcagcacatcaaaaagcttatccaccacgatcaacctacccaaatgaga aggaaccagaaaaacagttctggtaatatgacaaaacaaggctcttcaacacccccaaaa gatcacactagctcaccagcaatggatccaaaccaagatgaaatctctgaattgccagaa aaagaattcataaggtcaattattaagctaatcaaggaggcaccagagaaaggtgaagtc ctacttaaagaaataaaaaaaaagatacagcatatgaatggaaaaatctccagtgaaata gatagcataaataaatcacaatcacaacttctggaaatgaaggacacagttagagaaatg caaaatgtgctggaaagtctcagcagtagaatcaaacaagcagaagaaagaacttcagag cttgaagacaaggcttttgaattaaaccaatccaacaaaggtgaagaaaaaagaatttaa >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_7|98_aa MKVSAVLLCLLLMTAAFNPQGLAQPDALNVPSTCCFTFSSKKISLQRLKSYVITTSRCPQ KAVIFRTKLGKEICADPKEKWVQNYMKHLGRKAHTLKT >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_7|297_bp atgaaagtctctgcagtgcttctgtgcctgctgctcatgacagcagctttcaacccccag ggacttgctcagccagatgcactcaacgtcccatctacttgctgcttcacatttagcagt aagaagatctccttgcagaggctgaagagctatgtgatcaccaccagcaggtgtccccag aaggctgtcatcttcagaaccaaactgggcaaggagatctgtgctgacccaaaggagaag tgggtccagaattatatgaaacacctgggccggaaagctcacaccctgaagacttga >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_8|96_aa MQIITTALVCLLLAGMWPEDVDSKSMQVPFSRCCFSFAEQEIPLRAILCYRNTSSICSNE GLIFKLKRGKEACALDTVGWVQRHRKMLRHCPSKRK >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_8|291_bp atgcagatcatcaccacagccctggtgtgcttgctgctagctgggatgtggccggaagat gtggacagcaagagcatgcaggtacccttctccagatgttgcttctcatttgcggagcaa gagattcccctgagggcaatcctgtgttacagaaataccagctccatctgctccaatgag ggcttaatattcaagctgaagagaggcaaagaggcctgcgccttggacacagttggatgg gttcagaggcacagaaaaatgctgaggcactgcccgtcaaaaagaaaatga >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_9|189_aa MYQTLGIPQRDKTNMGPGIGNCAVWRTDFQDNKNQSRGNSKKPGKGGQWLRLARGQRKVD GSSSLSAEGAQKPRPPIGKGAFLKSLKETMKAHNLKGCQKSLQNSALTSAKASHTIAEYV QWTDPLVASDRERGRSMGEDADTQKDPNLYRKVKVDISPAPGSEPDMHGAKETVILAKLL CGQDENEEE >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_9|570_bp atgtaccagacactggggataccgcagcgggacaagacaaacatgggccctggtattggg aattgtgctgtatggagaacagattttcaggacaataagaaccagtcaagaggcaatagc aagaagccaggcaagggagggcaatggctccggctggcccggggacagcgcaaagtggat ggatccagctccttgtccgcagaaggtgctcagaagccacggcctcctataggaaaggga gcctttcttaagtccctgaaagagacaatgaaggctcataatcttaagggttgccaaaag tctttgcaaaattctgccttgaccagcgctaaagcctcacacaccattgctgagtacgtg cagtggactgatccactggtggccagtgacagggagaggggaaggagcatgggagaagat gcagatacccagaaagatccaaacctctaccgcaaagtcaaggtagacatttccccagcc ccggggagtgagcctgatatgcatggagcaaaggagacagtgattctcgccaagctcctg tgtgggcaggatgagaacgaggaggaataa >gi568815581f:34256527_34458128|GENSCAN_predicted_peptide_10|437_aa MGATDHMLIDDRICNCLELSIPIAPSSPNLLLCLGGSRSSSAECGTWKTGSNPAPSFLRP GMKVQRQRASSPESELAQQVMGSTRHKGPGSCMCLHGNGQPAQNLCALTEPFIEHLLYAR SLNNLHSGETDTVISCKSVSVSYIQLHIKCCGHTEDFIGALGMHGGSGRGPDPNLLEPFA TQMNNVVVTQEVHGIRTTLVLNVESLRCQLVTLGQFQTSLLLSVCSVDDGNICPAGAPVS HLCRIRGVAALGLGAWWDEQQRECGLRCRKGLGSGATEKPVQGLECQSWPLTLLHVQFWL CGVCEVQGDMTRVTVCVSSPPSKVLAIKKLSTAVERDEASGPGTGKSLLLQKRTRFAMFI SHIFSGTIMFLYSKFPLTSVIRYRSQTTEFVATVTSPPQMCDGQHVEGDLCASSGSPCCR GAVRGNLEILVSLPIFP >gi568815581f:34256527_34458128|GENSCAN_predicted_CDS_10|1314_bp atgggggccaccgaccacatgctgattgatgacagaatctgcaactgcctggagctgagt ataccaattgctccttcttccccaaatctcttgctttgtctaggggggtctagaagcagc tcagcagaatgtggaacctggaagacaggctctaatccagccccctcgtttttgagacca ggaatgaaggttcagaggcagagggcaagttccccagagtcagagcttgcacagcaggtc atgggcagcaccaggcacaagggtccaggcagctgcatgtgcctccatggcaatgggcag cctgctcagaacctctgtgcactcacagaacccttcattgagcatctgctgtatgctagg tccttaaataacttacattctggggagacagacacagtaattagttgtaaaagtgtatca gtgtcatacatacagttacatatcaaatgctgtggacatacagaagactttattggtgct ttggggatgcatgggggtagtggaagaggtcctgaccccaacttgctggagccttttgct acccagatgaacaacgtggtggtgacacaggaagtgcatgggataaggacaaccctggtt ttgaatgtggagtctctcaggtgtcagctggtgactttgggccaattccagacctctctg ctcctctctgtatgcagtgtggatgatggtaacatctgccctgcaggagcccctgtgtct cacttgtgtagaattcgaggggttgcagctttgggtttgggggcttggtgggacgagcag cagagggaatgtggcctgcggtgtaggaagggcttaggctcaggtgctactgagaagcca gtgcaggggctggagtgtcagagctggcccctcaccctactgcatgtgcagttctggtta tgtggggtctgtgaggtccagggagacatgaccagagtgacggtgtgtgtcagctccccc ccctccaaagtacttgccattaagaagctgagtacagctgtggagagggatgaggcatca ggtcctggcacaggcaagagcctgctcctgcagaagaggacccgctttgccatgtttatt tctcatatcttctctggcacgatcatgttcctttattcaaaatttccactcacatcagtc atccgttatcgatcacagacgacggagtttgtggccacagtgacatctcccccacagatg tgtgatggccagcatgtggaaggggacctttgtgctagctcaggctctccgtgttgcagg ggagcagtcagaggaaacttggaaattcttgtctctttgcccatctttccctag