GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:21:41 Sequence gi568815597f:168476638_168681217 : 204580 bp : 39.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2746 2785 40 -3.05 1.01 Init + 4310 4371 62 1 2 81 103 54 0.178 6.99 1.02 Intr + 10429 10508 80 1 2 73 99 50 0.035 2.88 1.03 Intr + 19636 19679 44 2 2 61 115 10 0.006 -1.96 1.04 Intr + 25612 25803 192 0 0 50 72 99 0.017 3.27 1.05 Intr + 31592 31764 173 1 2 61 39 91 0.020 -0.68 1.06 Intr + 35584 35722 139 1 1 73 84 41 0.023 1.75 1.07 Intr + 35882 35966 85 1 1 72 96 32 0.017 0.87 1.08 Intr + 41245 41439 195 2 0 68 110 76 0.154 6.26 1.09 Term + 55460 55605 146 0 2 64 50 91 0.113 -0.01 1.10 PlyA + 56461 56466 6 1.05 2.02 PlyA - 57180 57175 6 1.05 2.01 Sngl - 62540 61536 1005 2 0 43 42 356 0.881 23.21 2.00 Prom - 63878 63839 40 -10.35 3.06 PlyA - 64150 64145 6 1.05 3.05 Term - 64483 64315 169 0 1 105 39 135 0.990 6.67 3.04 Intr - 65470 65356 115 2 1 102 82 90 0.960 8.39 3.03 Intr - 66538 66501 38 0 2 88 60 23 0.350 -3.51 3.02 Intr - 67414 67267 148 2 1 108 115 65 0.434 9.57 3.01 Init - 67882 67690 193 1 1 53 40 119 0.432 2.78 3.00 Prom - 69278 69239 40 -5.15 4.00 Prom + 69932 69971 40 -7.15 4.01 Sngl + 70299 71045 747 2 0 44 49 271 0.921 14.73 4.02 PlyA + 71142 71147 6 1.05 5.00 Prom + 92147 92186 40 -4.15 5.01 Init + 99440 99580 141 1 0 53 59 117 0.387 5.48 5.02 Intr + 100168 100297 130 0 1 99 99 18 0.431 3.35 5.03 Intr + 103426 103540 115 2 1 107 82 128 0.994 12.69 5.04 Term + 104415 104583 169 0 1 106 39 142 0.983 7.47 5.05 PlyA + 104683 104688 6 1.05 6.00 Prom + 110652 110691 40 -5.15 6.01 Init + 111686 111692 7 1 1 90 96 0 0.787 2.09 6.02 Intr + 112193 112421 229 0 1 60 83 198 0.654 12.51 6.03 Intr + 117052 117082 31 1 1 65 93 -4 0.032 -4.99 6.04 Intr + 122942 123013 72 1 0 85 115 32 0.077 4.28 6.05 Term + 127108 127221 114 0 0 70 41 90 0.262 0.09 6.06 PlyA + 127841 127846 6 1.05 7.10 PlyA - 128335 128330 6 1.05 7.09 Term - 132642 132503 140 1 2 48 43 151 0.566 3.74 7.08 Intr - 132811 132751 61 1 1 69 71 74 0.715 1.19 7.07 Intr - 134098 134019 80 2 2 124 46 50 0.920 2.75 7.06 Intr - 135618 135430 189 1 0 97 93 63 0.964 6.34 7.05 Intr - 140019 139972 48 1 0 95 57 51 0.229 0.43 7.04 Intr - 141451 141404 48 2 0 63 71 65 0.239 0.03 7.03 Intr - 161327 161204 124 2 1 95 14 88 0.609 1.34 7.02 Intr - 164635 164512 124 0 1 42 106 96 0.477 6.47 7.01 Init - 164950 164865 86 1 2 44 49 89 0.812 0.94 7.00 Prom - 167474 167435 40 -3.35 8.08 PlyA - 168209 168204 6 1.05 8.07 Term - 168315 168293 23 1 2 96 41 25 0.053 -3.90 8.06 Intr - 173591 173455 137 1 2 84 61 123 0.384 8.49 8.05 Intr - 178738 178554 185 1 2 33 96 64 0.378 -0.64 8.04 Intr - 180498 180379 120 0 0 33 67 133 0.315 5.47 8.03 Intr - 184960 183888 1073 2 2 64 -2 428 0.024 20.41 8.02 Intr - 190875 190786 90 1 0 137 82 16 0.040 4.95 8.01 Init - 204159 203799 361 0 1 48 40 132 0.392 1.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 10433 10508 76 1 1 83 99 38 0.906 5.70 S.002 Term - 203663 203308 356 0 2 35 48 198 0.927 4.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_1|371_aa MGSSLWGSSKAAKLLFFMYSMDMDEAGNHHSQQTNTGTENQTPHILTQKFSNIEISLHLW EKMTDTKSGVPQATCYDELAMNLEISIIFSGLIFARMTCRTHGSVIITVIIIKDTIEDLL SEETNRAPIEDSNSTVYIGFGDHGAKGSEGMQSSCNAGWENTSHEDIIVIVCSLNLKRYK RVFRNSINLKDFYFMREYLNIIPSKANSLGCGFIGANGKPKVSFEHTEFEMRKGRHREVR TFAESYKASKWWIQDVNSEYSTTLSKNSWPIKWATARKSLKLKLHEFLVHPLWLGSHLGL ELWAESDVGWARLSFSEWNGPVTGFREFWHAALLSRANEAWITIPFTVALSDMDGIIVWQ LDEVYILEVNA >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_1|1116_bp atggggagcagcctgtggggcagcagcaaagctgccaaacttctgtttttcatgtactcc atggacatggatgaagctggaaaccatcattctcagcaaactaacacaggaacagaaaac caaacaccacatattctcactcaaaagttttccaacattgaaattagcctgcatctctgg gaaaagatgacagacacaaagtcaggggttccccaggccacctgctatgatgaactggct atgaatttggagatttccatcatcttctctggtctgatatttgctagaatgacttgcaga actcatggaagtgttataattacagttataattataaaggataccattgaggacctgcta agtgaagagactaacagggcacccatagaagactcaaacagcactgtgtacattggcttt ggggatcatggggctaaagggtctgaggggatgcagtcttcttgtaatgctggatgggaa aacacttcccatgaagatatcatagttattgtttgctccttaaatcttaagagatacaaa agagttttcaggaactctattaacctaaaggatttttatttcatgagagaatacttaaac ataattccaagtaaagccaattcactaggttgtggatttattggggctaacggcaaaccc aaggtgagttttgagcacactgagtttgagatgaggaaggggaggcacagagaggtgagg acatttgcggagagttacaaagccagcaaatggtggatccaagatgtgaactcagaatat tctacaactctcagcaaaaactcttggcccattaaatgggccactgcaagaaaaagcctg aagcttaagcttcatgagtttttggtacatcctctttggctgggctcccatctgggttta gaattgtgggcagagtcggatgttggatgggccagactctctttcagtgagtggaatggt ccagtcacaggtttcagagaattctggcatgctgcccttctttccagagctaatgaggca tggataaccattccttttacagttgctctgtctgacatggatggcatcattgtctggcaa cttgatgaagtatatattttggaggtgaatgcatga >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_2|334_aa MNIDAKILNKILAKQIQQHIKKLIHHDQVGFIPGMQGWFNICKSIEVVHHIKRTKEKNHM IISIGKAFDKIQQPFMLKTLNKLGIDVTYLKIMRAIYDKPTANIILNGQKLEAFPLKTGT KQGCPLSPLLFNIVLEVLARAIRLEKEIKGIQLGKEEVKLSLFADDMLVYLENPIVSAQN LLKLISNFSKVSGYKINGQKSQAFLYTNNRLTENRIMSELPFTIASKRIKYLGIQLTRDV KDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIMKMDILPKEIYRFNAIPIQLPMTLF TKLEKTTLKFIWNQKTARIAKTILSQKNKAKCSN >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_2|1005_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaacaaatccagcagcacatc aaaaagcttatccaccacgatcaggtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaatagaagtagtccatcatataaagagaaccaaagaaaaaaaccacatg attatctcaataggtaaggcctttgacaaaattcaacagcccttcatgctaaaaactctc aataaactaggtattgatgtgacatatctcaaaataatgagagctatttatgacaaaccc acagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcaca aaacagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagg gcaatcaggctggagaaagaaataaagggcattcaattaggaaaagaggaagtcaaattg tccctgtttgcagatgacatgcttgtatatctagaaaaccccattgtctcagcccaaaat ctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgggcaaaaa tcacaagcattcctctacaccaataacagactaacagagaaccgaatcatgagtgaactc ccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccgctgctcaatgaaataaaagaggacacaaac aaatggaagaacattccatgctcatggataggaagaatcaatatcatgaaaatggacata ctgcccaaggaaatttatagattcaatgccatccccatccagctaccaatgactctcttc acaaaattggaaaaaactactttaaagttcatatggaaccaaaaaacagcccgcattgcc aagacaatcctaagccaaaagaacaaagctaaatgctccaattaa >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_3|220_aa MRTKKAKDREAQCGSSLSSPPERVKGWHQGLMIHGCGSLMSHWMSHEVGWNTVTTPPHFL YSFRGSALTIGGIKEILKEPDPHSPCTAQRDLSHETSHPGPPWHLLSHCIHCGSKVVMML TPAQRTGVGSEVSHRRTCVSLTTQRLPVSRIKTYTITEGSLRAVIFITKRGLKVCADPQA TWVRDVVRSMDRKSNTRNNMIQTKPTGTQQSTNTAVTLTG >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_3|663_bp atgagaaccaagaaagcaaaggatcgagaagctcagtgtggcagcagcctctcttcccct cctgagagagtcaaagggtggcatcagggactcatgatccatggttgtggaagcctcatg tcacactggatgtcacatgaggtgggatggaacacagtgaccaccccacctcatttcctt tacagcttccgtggaagtgcactgaccattggaggcataaaagagatcctcaaagagccc gatcctcactctccctgcacagctcagcgggacctcagccatgagacttctcatcctggc cctccttggcatctgctctctcactgcatacattgtggaagcaaggtggtgatgatgtta acccctgctcaaagaacaggtgtagggagtgaagtctcacataggaggacctgtgtgagc ctcactacccagcgactgccagttagcagaatcaagacctacaccatcacggaaggctcc ttgagagcagtaatttttattaccaaacgtggcctaaaagtctgtgctgatccacaagcc acgtgggtgagagacgtggtcaggagcatggacaggaaatccaacaccagaaataacatg atccagaccaagccaacaggaacccagcaatcgaccaatacagctgtgaccctgactggc tag >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_4|248_aa MVKGSIQQEELTILNIYAPNPGAHRFKKQVLRDLQGDLDSHTIIMGDFNTPLSTLDRSTR QKVNKDIQELNSALHQVDLIDIYRTLHPKSAEYTFLSAPHRTYSKIDHIIGSKALLSKCK RIEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKETTYQNLWDTFKKVCIGKLITLNAHKRKQERSKIDPLTSQLKEIEKQEQTHSNVSRRQ EITRSGQN >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_4|747_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatatatatgcccccaat ccaggagcacacagattcaaaaagcaagtcctcagagacctacaaggagatttagactcc cacacaataataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagttaataaggatatccaggaattgaactcagctctacaccaagtggacctaata gatatttacagaaccctccaccctaaatcagcagaatatacattcctctcagcaccacat cgcacttattccaaaattgaccacataattggaagtaaagcactcctcagcaaatgtaaa agaatagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgag aacaaagaaacaacataccagaatctctgggacacatttaaaaaagtgtgtatagggaaa ttgataacactaaatgcccacaagagaaagcaggaaagatctaaaattgaccccctaaca tcacaattaaaagaaatagagaagcaagagcaaacacattcaaatgttagcagaaggcaa gaaataacaagatcagggcagaactga >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_5|184_aa MRTKKAKDREAQCGSSLSSPPERVKGWHQGLMIHGCGSLMSHWMSHEGSLKLPMCKKGMM ILTVEGFVNFQNRENLISIWAPTFPNWVISGVGSEVSDKRTCVSLTTQRLPVSRIKTYTI TEGSLRAVIFITKRGLKVCADPQATWVRDVVRSMDRKSNTRNNMIQTKPTGTQQSTNTAV TLTG >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_5|555_bp atgagaaccaagaaagcaaaggatcgagaagctcagtgtggcagcagcctctcttcccct cctgagagagtcaaagggtggcatcagggactcatgatccatggttgtggaagcctcatg tcacactggatgtcacatgaggggagccttaaacttcccatgtgcaagaaaggaatgatg attttgactgtagagggcttcgtaaacttccaaaacagggagaatttgattagtatctgg gctcctacttttcctaattgggtaatttcaggtgtagggagtgaagtctcagataagagg acctgtgtgagcctcactacccagcgactgccggttagcagaatcaagacctacaccatc acggaaggctccttgagagcagtaatttttattaccaaacgtggcctaaaagtctgtgct gatccacaagccacatgggtgagagacgtggtcaggagcatggacaggaaatccaacacc agaaataacatgatccagaccaagccaacaggaacccagcaatcgaccaatacagctgtg actctgactggctag >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_6|150_aa MEVGNANHDKSWTSRQIAGAAQLRLSIFGASGDPCYRQAGLLPVGCRPKTHSHEQRNPEQ TVFLILNDTTRAAYTLGRRYLAVELLGLTIGNRNVKSMTGKIVHLVQEELRQMARSPYLP KAADPQVAKAHHPSVLNPTKILHQVKNALH >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_6|453_bp atggaggtaggaaatgcaaatcacgacaaatcatggacatccagacaaattgctggagct gctcagctgagactgagcatcttcggtgcctcaggggatccatgttacagacaagcaggc cttttgccagtgggctgcaggcccaagacacatagtcatgagcagagaaacccagagcaa acggtcttccttatccttaatgacaccaccagggcagcttatactttgggaagaagatac ctagcagtggaattgctgggtcttactattgggaacagaaatgtaaaaagcatgacagga aaaattgttcatctggttcaggaggagctgaggcaaatggctaggtctccttatcttccc aaagctgcagaccctcaggttgccaaggcacatcatccgtctgtgctgaacccaaccaag attcttcaccaagtgaagaatgctctgcactaa >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_7|299_aa MRANTGREETRKVPEKVSVWPAGHKQYSCGCKNSLTSVSQNLFLDISLFGKSHEDEDLSA GISSVWTRHKETWSYYVNKDMYILTSSYGVKLASIQKYTLSLRMGVYDPQTVLGELPSAF RVKLKLTRISGQWYTGKGVTSSLGPGDVWHDFSLKPATPLEEDLSGHGVSLRSQSAKSSL WHQPSLQWPLQISHILGMVVLNLGIQDAKTDGKWGYLPHITKHVSGSAKIYTQELSNSHA LKRQIQHNKYPVTAQRVTQPAATIASRVKPAKIIKSTLPREMKDILVCAKSSQLPHFID >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_7|900_bp atgagagcgaacacaggtcgggaagaaaccaggaaggttccagagaaggtatctgtatgg ccagctggccacaaacaatactcatgtggttgcaagaactccttgaccagtgtttcccaa aatctgttccttgatatatccctattcggcaagagccatgaggatgaagatttatctgct ggaatctccagtgtctggacccggcacaaagaaacctggtcatactatgtgaacaaagac atgtacattttgacaagttcctatggagtaaaactagcctcaattcagaaatacacttta tctctgagaatgggtgtctatgatcctcagacagttcttggagagctccccagcgctttc agagtaaagctcaagctcacaaggatttctgggcagtggtatactggtaaaggtgtgacc agttctctgggccccggagacgtgtggcatgatttctctctcaagcctgcaacaccattg gaagaggatttatcaggccatggagtctctctgagaagtcagtcagcaaaaagcagcctc tggcaccagccatcacttcagtggcctctacagatatctcacatcttagggatggtggtc ttgaatctaggaatccaagatgcaaaaacagatggcaagtggggttacttgccccacatc acaaagcatgttagtggcagtgccaagatttacacccaggagctgtcaaatagtcatgcc ctgaaacgacaaatccagcacaacaaataccctgtgacagcacagcgcgtcacccaacct gctgcgacaatcgccagccgcgtaaagccagccaaaatcatcaaatctactttgccaaga gaaatgaaggacatactggtttgtgctaaatcttctcaattacctcacttcatagactga >gi568815597f:168476638_168681217|GENSCAN_predicted_peptide_8|662_aa MKRGLLPNSFYEASIILIQKPGRDTKKKNLGPMSLMNIDAKIPNKILENQMQQHIKKLIH HDQVSFTPGMQGWFNIGKSINVIHHVNRTNDKNHIIISIDAEKAFDNIQHPFMSKTVHKL EKPQGNQIFKRLLEVYSGSQQKENMEHRALVLEVLARAIRQEKEIKGIQVGKEEVKLSLF ADDMIVYPENPTVSAQNLLKLIRNFSKVSGYKINVQKSQAFLYTNNTQTESQIMSELPFT IASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWRNIPCSWVGRINIVKMAILPK EIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRALIAKSILSQKNKAEGITLPDFKLYYK ATVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYNYLIFDKPEKNKKWGKDSLFNKWCWQNW LAICRKLKLDPFLTPYTKINLRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPK AMATKAKIDKWDLIKLKSFCTAKETTIRTSILKRDTQKDTDTGNRGEGYVDMEADIEVVQ PQVKEHLEHQSPIISRRTDCGLTQSTHSGTGADGTSSIFKIWFLRLLQEHTSREWTIKER EYEGLHGSFRKESPKESVLQEALSSNGMREDLEEEIRAWRGSVTSIEGNGQPEIPSLWTQ AK >gi568815597f:168476638_168681217|GENSCAN_predicted_CDS_8|1989_bp atgaaaaggggactcctccctaactcattttatgaggccagcatcatcctgatacaaaaa cctggcagagacacaaaaaagaaaaacctcgggccaatgtccctgatgaacattgatgca aaaatccccaataaaatactggaaaaccaaatgcagcagcacatcaaaaagcttatccac catgatcaagtcagcttcacccctgggatgcaaggctggttcaacataggcaaatcaata aatgtaatccatcatgtaaacagaaccaatgacaaaaaccacataattatctcaatagat gcagaaaaggccttcgataacattcaacatcccttcatgtcaaaaactgtccataagcta gaaaagcctcaaggaaaccaaatttttaagaggttacttgaggtctactctgggtcccaa cagaaggaaaatatggaacacagggctctagtgttggaagttctggccagggcaattagg caggagaaggaaataaagggtattcaagtaggaaaagaggaagtcaaattgtccctgttt gcagatgacatgattgtatatccagaaaaccccactgtctcagcccaaaatctccttaag ctgataagaaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttcttatacacgaataacacacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctc ttcaaggagaactacaaaccactgctcaatgaaattaaagaggatacaaacaaatggagg aacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaag gaaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagggccctcattgccaagtcaatc ctaagccaaaagaacaaagctgaaggcatcacgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaac agaacagagccctcagaaataatgctgcatatctacaactatctgatctttgacaaacct gagaaaaacaagaaatggggaaaggattccctatttaacaaatggtgctggcaaaactgg ctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaat ttaagatggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaac ctaggcattaccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaa gcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgc acagcaaaagaaactaccatcagaacaagtattcttaaaagagacacgcagaaggataca gacacaggcaacaggggagaaggatatgtggatatggaagcagatattgaagttgtgcag ccacaagtcaaggaacacttggagcaccaaagtccaatcatcagtaggagaacggattgt gggcttactcaaagcacccactcaggaactggtgctgatggaacctcttccatcttcaaa atatggtttctaaggttgctccaagagcatacatccagggaatggaccatcaaggaaaga gagtatgaaggattacatggaagcttcaggaaagagagcccaaaagagagtgttcttcaa gaagctttgagttcaaatgggatgagagaagatttagaggaagagatcagagcctggaga gggagcgtcactagcatagagggtaatggtcagccagagatcccaagcctttggactcag gctaaataa