GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:48:44 Sequence gi568815581r:34260562_34463161 : 202600 bp : 43.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9624 9663 40 -1.46 1.01 Init + 9730 9805 76 0 1 79 110 113 0.972 11.87 1.02 Intr + 10585 10702 118 2 1 109 48 63 0.981 3.92 1.03 Term + 11136 11241 106 0 1 120 47 102 0.980 7.18 1.04 PlyA + 11715 11720 6 1.05 2.00 Prom + 21500 21539 40 -4.46 2.01 Init + 25248 25323 76 2 1 94 115 102 0.771 12.86 2.02 Intr + 26535 26646 112 1 1 44 95 57 0.761 1.44 2.03 Term + 27024 27129 106 0 1 64 36 145 0.958 4.78 2.04 PlyA + 27267 27272 6 -0.45 3.05 PlyA - 27296 27291 6 1.05 3.04 Term - 30263 30098 166 1 1 78 42 83 0.137 0.09 3.03 Intr - 30830 30750 81 1 0 80 81 40 0.160 1.25 3.02 Intr - 34719 32550 2170 1 1 43 53 710 0.050 50.34 3.01 Init - 38047 37999 49 1 1 62 53 44 0.048 -0.59 3.00 Prom - 40530 40491 40 -4.36 4.00 Prom + 43060 43099 40 -3.46 4.01 Init + 58941 59016 76 2 1 84 110 109 0.977 12.00 4.02 Intr + 59708 59825 118 0 1 77 76 63 0.931 3.52 4.03 Term + 60241 60346 106 1 1 118 52 36 0.846 0.88 4.04 PlyA + 61808 61813 6 1.05 5.03 PlyA - 62907 62902 6 1.05 5.02 Term - 64720 64286 435 1 0 68 42 118 0.382 0.49 5.01 Init - 76621 76217 405 1 0 45 8 286 0.163 12.89 5.00 Prom - 77833 77794 40 -1.46 6.00 Prom + 85981 86020 40 -2.36 6.01 Init + 95966 96041 76 1 1 69 111 98 0.956 9.81 6.02 Intr + 96914 97028 115 0 1 73 34 86 0.569 1.21 6.03 Term + 97465 97570 106 1 1 107 43 80 0.921 3.28 6.04 PlyA + 98025 98030 6 -3.74 7.04 PlyA - 98037 98032 6 1.05 7.03 Term - 100100 99998 103 1 1 122 53 26 0.484 0.25 7.02 Intr - 101335 101224 112 2 1 96 101 -19 0.687 -0.36 7.01 Init - 102600 102525 76 0 1 83 73 127 0.941 10.29 7.00 Prom - 105299 105260 40 -4.46 8.00 Prom + 105879 105918 40 -7.66 8.01 Init + 106787 106853 67 1 1 46 103 67 0.493 5.23 8.02 Intr + 115137 115257 121 1 1 33 91 96 0.127 3.95 8.03 Intr + 125150 125325 176 2 2 38 37 101 0.045 -0.42 8.04 Intr + 128476 128624 149 2 2 49 94 91 0.075 5.75 8.05 Term + 128790 128846 57 2 0 72 40 90 0.714 0.29 8.06 PlyA + 132003 132008 6 1.05 9.10 PlyA - 132720 132715 6 1.05 9.09 Term - 140465 140173 293 0 2 71 43 166 0.407 5.91 9.08 Intr - 147647 147597 51 0 0 42 95 52 0.074 0.18 9.07 Intr - 162617 162489 129 0 0 86 70 14 0.333 0.07 9.06 Intr - 164184 164062 123 1 0 87 109 19 0.585 4.46 9.05 Intr - 174440 174278 163 2 1 88 49 25 0.013 -1.85 9.04 Intr - 188322 188254 69 0 0 77 111 67 0.622 7.28 9.03 Intr - 189049 188892 158 2 2 65 60 18 0.394 -3.57 9.02 Intr - 193678 193391 288 2 0 94 92 73 0.554 5.52 9.01 Init - 196830 196791 40 0 1 97 39 70 0.696 3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 22264 22392 129 0 0 78 99 105 0.831 10.75 S.002 Term + 22752 22856 105 2 0 38 50 83 0.868 -2.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_1|99_aa MKASAALLCLLLTAAAFSPQGLAQPVGINTSTTCCYRFINKKIPKQRLESYRRTTSSHCP REAVIFKTKLDKEICADPTQKWVQDFMKHLDKKTQTPKL >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_1|300_bp atgaaagcctctgcagcacttctgtgtctgctgctcacagcagctgctttcagcccccag gggcttgctcagccagttgggattaatacttcaactacctgctgctacagatttatcaat aagaaaatccctaagcagaggctggagagctacagaaggaccaccagtagccactgtccc cgggaagctgtaatcttcaagaccaaactggacaaggagatctgtgctgaccccacacag aagtgggtccaggactttatgaagcacctggacaagaaaacccaaactccaaagctttga >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_2|97_aa MKVSAALLWLLLIAAAFSPQGLAGPASVPTTCCFNLANRKIPLQRLESYRRITSGKCPQK AVIFKTKLAKDICADPKKKWVQDSMKYLDQKSPTPKP >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_2|294_bp atgaaggtctccgcagcacttctgtggctgctgctcatagcagctgccttcagcccccag gggctcgctgggccagcttctgtcccaaccacctgctgctttaacctggccaataggaag ataccccttcagcgactagagagctacaggagaatcaccagtggcaaatgtccccagaaa gctgtgatcttcaagaccaaactggccaaggatatctgtgccgaccccaagaagaagtgg gtgcaggattccatgaagtatctggaccaaaaatctccaactccaaagccataa >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_3|821_aa MTIALQINSGMALKGRKIQTTIGEYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESL NRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSF DEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPG MQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKI IRAIYDKPIANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQ LGKEEVKLSLFADDMIIYLENPIVSAQNLLKLISNFSKVSEYKINVQKSQAFLYTNNRQT ESQIMSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGR INIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGG ITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHTYNYLIFDKPEKNKQWGKD SLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRRKTIKTLEENLGITIQDIG MGKDFMSKTPKAMATKDKIDKWDLIKIKSFCTAKETTVRVNRQPTKWEKIFATYSSDKGL ISRIYNELQQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRLAGQETNRQGEEEVTIRARVMGPGHQKSKVLLPQLLEVLA ANASQRNFLYEARRQLKTAALPRVVSSSRSILQPGLVQMGV >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_3|2466_bp atgacaattgctctacagatcaactcagggatggccttgaaagggagaaaaatacaaact accatcggagaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatg gataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactggta ccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattt gatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagagaat tttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaac cgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctggg atgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacagagcc aaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaa caacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaata ataagagctatctatgacaaacccatagccaatatcatactgaatgggcaaaaactggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattatatatctagaa aaccccattgtctctgcccaaaatctccttaagctgataagcaacttcagcaaagtctca gaatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaacaacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaataccta ggaatccaacttacaagggacatgaaggacctcttcaaggagaactacaaaccactgctc aaagaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatcccc atcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggc atcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccg catacctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggat tccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactg gatcccttccttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgtt agacgtaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggc atgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagacaaaattgac aaatgggatctaattaaaataaagagcttctgcacagcaaaagaaactaccgtcagagtg aacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggcta atatccagaatctacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccact atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaga cttgcaggccaagaaactaacaggcaaggagaggaagaggtcactatccgggccagggta atgggccctggtcatcagaagtctaaagtgcttcttccccagctgctagaagtgttggca gccaatgcttctcagcggaatttcctctatgaggcccgccgtcagctaaaaacagctgcc ttgcccagagttgtgtcctcttccaggagcatcctgcagccagggctggttcagatgggg gtataa >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_4|99_aa MKVSAALLCLLLMAATFSPQGLAQPDSVSIPITCCFNVINRKIPIQRLESYTRITNIQCP KEAVIFKTKRGKEVCADPKERWVRDSMKHLDQIFQNLKP >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_4|300_bp atgaaggtttctgcagcgcttctgtgcctgctgctcatggcagccactttcagccctcag ggacttgctcagccagattcagtttccattccaatcacctgctgctttaacgtgatcaat aggaaaattcctatccagaggctggagagctacacaagaatcaccaacatccaatgtccc aaggaagctgtgatcttcaagaccaaacggggcaaggaggtctgtgctgaccccaaggag agatgggtcagggattccatgaagcatctggaccaaatatttcaaaatctgaagccatga >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_5|279_aa MCKFLDTYTLPRLNQEDVECLNRPITSSEIEAVIKSLPTKKSPRTNGFTAKFYQMYKEEL VTFLLKLFQTIEKGGFLPNSFYEASIILIPKCVKDTPKEENLRAISLMNVNVKILHKIPA NQIQQHIKKLIHHDQPTQMRRNQKNSSGNMTKQGSSTPPKDHTSSPAMDPNQDEISELPE KEFIRSIIKLIKEAPEKGEVLLKEIKKKIQHMNGKISSEIDSINKSQSQLLEMKDTVREM QNVLESLSSRIKQAEERTSELEDKAFELNQSNKGEEKRI >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_5|840_bp atgtgtaaattcctggacacatacaccctcccaagactaaaccaggaagatgttgaatgc ctgaatagaccaataacaagttctgaaattgaggcagtaattaagagcctaccaaccaaa aaaagcccaagaacaaatggattcacagccaaattctaccagatgtacaaggaggagctg gtaacattccttctgaaactattccaaacaatagaaaaagggggattcctccctaactca ttttatgaggccagcatcatccttataccaaaatgcgtcaaagacacaccaaaagaagaa aatttaagggcaatatccctgatgaatgtcaacgtgaaaatcctccataaaataccggca aaccaaatccagcagcacatcaaaaagcttatccaccacgatcaacctacccaaatgaga aggaaccagaaaaacagttctggtaatatgacaaaacaaggctcttcaacacccccaaaa gatcacactagctcaccagcaatggatccaaaccaagatgaaatctctgaattgccagaa aaagaattcataaggtcaattattaagctaatcaaggaggcaccagagaaaggtgaagtc ctacttaaagaaataaaaaaaaagatacagcatatgaatggaaaaatctccagtgaaata gatagcataaataaatcacaatcacaacttctggaaatgaaggacacagttagagaaatg caaaatgtgctggaaagtctcagcagtagaatcaaacaagcagaagaaagaacttcagag cttgaagacaaggcttttgaattaaaccaatccaacaaaggtgaagaaaaaagaatttaa >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_6|98_aa MKVSAVLLCLLLMTAAFNPQGLAQPDALNVPSTCCFTFSSKKISLQRLKSYVITTSRCPQ KAVIFRTKLGKEICADPKEKWVQNYMKHLGRKAHTLKT >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_6|297_bp atgaaagtctctgcagtgcttctgtgcctgctgctcatgacagcagctttcaacccccag ggacttgctcagccagatgcactcaacgtcccatctacttgctgcttcacatttagcagt aagaagatctccttgcagaggctgaagagctatgtgatcaccaccagcaggtgtccccag aaggctgtcatcttcagaaccaaactgggcaaggagatctgtgctgacccaaaggagaag tgggtccagaattatatgaaacacctgggccggaaagctcacaccctgaagacttga >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_7|96_aa MQIITTALVCLLLAGMWPEDVDSKSMQVPFSRCCFSFAEQEIPLRAILCYRNTSSICSNE GLIFKLKRGKEACALDTVGWVQRHRKMLRHCPSKRK >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_7|291_bp atgcagatcatcaccacagccctggtgtgcttgctgctagctgggatgtggccggaagat gtggacagcaagagcatgcaggtacccttctccagatgttgcttctcatttgcggagcaa gagattcccctgagggcaatcctgtgttacagaaataccagctccatctgctccaatgag ggcttaatattcaagctgaagagaggcaaagaggcctgcgccttggacacagttggatgg gttcagaggcacagaaaaatgctgaggcactgcccgtcaaaaagaaaatga >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_8|189_aa MYQTLGIPQRDKTNMGPGIGNCAVWRTDFQDNKNQSRGNSKKPGKGGQWLRLARGQRKVD GSSSLSAEGAQKPRPPIGKGAFLKSLKETMKAHNLKGCQKSLQNSALTSAKASHTIAEYV QWTDPLVASDRERGRSMGEDADTQKDPNLYRKVKVDISPAPGSEPDMHGAKETVILAKLL CGQDENEEE >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_8|570_bp atgtaccagacactggggataccgcagcgggacaagacaaacatgggccctggtattggg aattgtgctgtatggagaacagattttcaggacaataagaaccagtcaagaggcaatagc aagaagccaggcaagggagggcaatggctccggctggcccggggacagcgcaaagtggat ggatccagctccttgtccgcagaaggtgctcagaagccacggcctcctataggaaaggga gcctttcttaagtccctgaaagagacaatgaaggctcataatcttaagggttgccaaaag tctttgcaaaattctgccttgaccagcgctaaagcctcacacaccattgctgagtacgtg cagtggactgatccactggtggccagtgacagggagaggggaaggagcatgggagaagat gcagatacccagaaagatccaaacctctaccgcaaagtcaaggtagacatttccccagcc ccggggagtgagcctgatatgcatggagcaaaggagacagtgattctcgccaagctcctg tgtgggcaggatgagaacgaggaggaataa >gi568815581r:34260562_34463161|GENSCAN_predicted_peptide_9|437_aa MGATDHMLIDDRICNCLELSIPIAPSSPNLLLCLGGSRSSSAECGTWKTGSNPAPSFLRP GMKVQRQRASSPESELAQQVMGSTRHKGPGSCMCLHGNGQPAQNLCALTEPFIEHLLYAR SLNNLHSGETDTVISCKSVSVSYIQLHIKCCGHTEDFIGALGMHGGSGRGPDPNLLEPFA TQMNNVVVTQEVHGIRTTLVLNVESLRCQLVTLGQFQTSLLLSVCSVDDGNICPAGAPVS HLCRIRGVAALGLGAWWDEQQRECGLRCRKGLGSGATEKPVQGLECQSWPLTLLHVQFWL CGVCEVQGDMTRVTVCVSSPPSKVLAIKKLSTAVERDEASGPGTGKSLLLQKRTRFAMFI SHIFSGTIMFLYSKFPLTSVIRYRSQTTEFVATVTSPPQMCDGQHVEGDLCASSGSPCCR GAVRGNLEILVSLPIFP >gi568815581r:34260562_34463161|GENSCAN_predicted_CDS_9|1314_bp atgggggccaccgaccacatgctgattgatgacagaatctgcaactgcctggagctgagt ataccaattgctccttcttccccaaatctcttgctttgtctaggggggtctagaagcagc tcagcagaatgtggaacctggaagacaggctctaatccagccccctcgtttttgagacca ggaatgaaggttcagaggcagagggcaagttccccagagtcagagcttgcacagcaggtc atgggcagcaccaggcacaagggtccaggcagctgcatgtgcctccatggcaatgggcag cctgctcagaacctctgtgcactcacagaacccttcattgagcatctgctgtatgctagg tccttaaataacttacattctggggagacagacacagtaattagttgtaaaagtgtatca gtgtcatacatacagttacatatcaaatgctgtggacatacagaagactttattggtgct ttggggatgcatgggggtagtggaagaggtcctgaccccaacttgctggagccttttgct acccagatgaacaacgtggtggtgacacaggaagtgcatgggataaggacaaccctggtt ttgaatgtggagtctctcaggtgtcagctggtgactttgggccaattccagacctctctg ctcctctctgtatgcagtgtggatgatggtaacatctgccctgcaggagcccctgtgtct cacttgtgtagaattcgaggggttgcagctttgggtttgggggcttggtgggacgagcag cagagggaatgtggcctgcggtgtaggaagggcttaggctcaggtgctactgagaagcca gtgcaggggctggagtgtcagagctggcccctcaccctactgcatgtgcagttctggtta tgtggggtctgtgaggtccagggagacatgaccagagtgacggtgtgtgtcagctccccc ccctccaaagtacttgccattaagaagctgagtacagctgtggagagggatgaggcatca ggtcctggcacaggcaagagcctgctcctgcagaagaggacccgctttgccatgtttatt tctcatatcttctctggcacgatcatgttcctttattcaaaatttccactcacatcagtc atccgttatcgatcacagacgacggagtttgtggccacagtgacatctcccccacagatg tgtgatggccagcatgtggaaggggacctttgtgctagctcaggctctccgtgttgcagg ggagcagtcagaggaaacttggaaattcttgtctctttgcccatctttccctag