GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:05:46 Sequence gi568815593f:52700395_52901837 : 201443 bp : 37.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 18459 18693 235 2 1 60 -16 256 0.182 8.74 1.02 Term + 18739 19409 671 2 2 -21 43 381 0.344 15.66 1.03 PlyA + 19636 19641 6 1.05 2.00 Prom + 19806 19845 40 -6.15 2.01 Init + 20269 20937 669 0 0 58 29 439 0.223 30.24 2.02 Intr + 25151 25267 117 1 0 105 68 71 0.407 6.54 2.03 Term + 30773 31126 354 1 0 -9 37 271 0.385 5.81 2.04 PlyA + 32643 32648 6 1.05 3.08 PlyA - 33061 33056 6 1.05 3.07 Term - 49848 49723 126 0 0 85 42 79 0.015 0.30 3.06 Intr - 61639 61535 105 1 0 102 90 77 0.973 8.79 3.05 Intr - 66416 66212 205 1 1 83 78 94 0.795 6.18 3.04 Intr - 71779 71671 109 2 1 124 22 -17 0.012 -6.28 3.03 Intr - 75963 75841 123 1 0 73 65 77 0.153 3.54 3.02 Intr - 79081 78927 155 0 2 74 6 193 0.119 8.49 3.01 Init - 80849 80803 47 2 2 103 30 44 0.795 0.31 3.00 Prom - 84340 84301 40 -5.75 4.00 Prom + 85757 85796 40 -8.85 4.01 Init + 87960 88020 61 2 1 103 93 124 0.991 13.86 4.02 Term + 91786 91805 20 2 2 118 43 0 0.344 -4.00 4.03 PlyA + 92224 92229 6 1.05 5.00 Prom + 95035 95074 40 -7.05 5.01 Init + 100001 100726 726 1 0 76 98 927 0.997 87.35 5.02 Term + 101015 101446 432 1 0 70 39 599 0.999 47.51 5.03 PlyA + 101619 101624 6 1.05 6.03 PlyA - 101855 101850 6 1.05 6.02 Term - 109789 109687 103 0 1 90 41 94 0.822 1.67 6.01 Init - 119330 118912 419 2 2 71 53 150 0.663 5.75 6.00 Prom - 120759 120720 40 -5.45 7.00 Prom + 126308 126347 40 -4.05 7.01 Init + 146586 146646 61 2 1 26 115 30 0.846 0.96 7.02 Intr + 148971 149091 121 1 1 65 103 159 0.786 13.73 7.03 Intr + 161053 161165 113 1 2 81 91 65 0.911 5.20 7.04 Intr + 164369 164457 89 0 2 78 99 83 0.905 7.17 7.05 Intr + 164577 164688 112 2 1 18 87 75 0.924 -0.57 7.06 Intr + 165296 165423 128 0 2 94 91 56 0.866 5.98 7.07 Intr + 181479 181627 149 2 2 62 71 147 0.667 8.51 7.08 Intr + 187421 187571 151 2 1 62 87 199 0.992 16.44 7.09 Intr + 193281 193446 166 2 1 79 95 164 0.998 14.81 7.10 Intr + 197061 197134 74 1 2 55 91 78 0.386 3.01 7.11 Intr + 197845 197989 145 0 1 27 82 79 0.113 0.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_1|301_aa ERSSSPATEQSWTENDFDELREEGFRRSNYSELKEEVRTNGKEDKNFEKKLNEWITRITD AEKSLKDLMELKTMARELQRVAAMENEMNEMNSEEKFREKRIKRNKQSLQEIWDYVKRPN LRLIGAPESDGESGTKLEKTLQYIIQENFPNLARQANIQIQEIQRIPQRYSSRRATPRHI IVRFSKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQ PRISYPAKISFISEGEIKYFTDKQMLRDFVTTRPALKQLLKEAPNMERYNRYQPLQKHAK L >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_1|906_bp gaacgcagctcctcaccagcaacagaacaaagctggacagagaatgactttgacgagttg agagaggaaggcttcagaagatcaaactactccgagctaaaggaggaagttcgaaccaat ggcaaagaagataaaaactttgaaaaaaaattaaacgaatggataactagaataactgat gcagagaagtccttaaaggacctgatggagctgaaaaccatggcacgagaactacaaagg gtagctgcgatggaaaatgaaatgaatgaaatgaacagtgaagagaagtttagagaaaaa agaataaaaagaaacaaacaaagcctccaagaaatatgggactatgtgaaaagaccaaat ctacgtctaatcggtgcacctgaaagtgacggggagagtggaaccaagttggaaaagact ctgcagtatattatccaggagaacttccccaatctagcaaggcaggccaacattcaaatt caggaaatacagagaataccacaaagatactcctcaagaagagcaactccaagacacata attgtcagattcagcaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaa ggtcgggttacccacaaagggaagcccatcagattaactgctgatctctcagcagaaact ctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaa cccagaatttcatatccagccaaaataagcttcataagtgaaggagaaataaagtacttc acagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaacagctcctg aaggaagcaccaaacatggaaaggtacaaccggtaccagccactgcaaaaacatgccaaa ttgtaa >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_2|379_aa MKAEIKMFFETNENKDTNIPESLEHIQSSVEGKFIALNAHKRKQERLKIDTLTSQLKELE NQEQTLSKASRRQEITKIGAELKEIEIQKTLQKISESRSWFFEKINKIDRPLARLIKKKR EKNQIDAIKTDKGDITTDPTEIQTTIREYYKHLYSNKLENLEEIDKFLDTYTLPRLNQEA VESLNRPITGSEIEAVINSLPTKKSRGPDGFTAKFYQRYKEELLSTASCCFMRQMQSDKP KVSRVLFLISDIWESIFIQDAVISKDVQKHLDVSQAEVCSGVEPSWRTSARAVQKGNVGS ESPHRVPTGALPSGVVRRGPPSSRPQNGRSTDSLHSACGKAADIQPQLMKAAERGLYPAN PQRQSSPRPWVSTSGISMT >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_2|1140_bp atgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacaaacatacca gaatctctggaacacattcaaagcagtgtagagggaaaatttatagcactaaatgcccac aaaagaaagcaggaaagactgaaaattgacaccctaacatcacaattaaaagaactagag aatcaagagcaaacactttcaaaagctagcagaaggcaagaaataactaagataggagca gaactgaaggaaatagagatacaaaaaacccttcaaaaaatcagtgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagactaataaagaagaaaaga gagaagaatcaaatagatgcaataaaaaccgacaaaggggatatcaccaccgatcccaca gaaatacaaaccaccatcagagaatactataaacacctctactcaaataaactagaaaat ctagaagaaatcgataaattcctcgacacgtacactctcccaagactaaaccaggaagca gttgaatctctgaatagaccaataacaggctctgaaattgaggcagtaattaatagctta ccaaccaaaaaaagtcgaggaccagatggattcacggccaaattctaccagaggtacaag gaggagctgctatcaacagcttcctgttgtttcatgcgtcagatgcagtctgataagccc aaggtgagccgtgtcctgtttctcatctctgatatatgggaatccatcttcatacaggat gctgtgatttcaaaggatgtgcagaaacacctggatgtctcccaggcagaagtttgctca ggggtggagccttcatggagaacctctgctagggcagtacagaaaggaaatgtggggtca gagtccccacacagagtccccactggggcactgcctagtggagttgtgagaagagggcca ccatcctccagaccccagaatggtagatccactgacagcttgcacagtgcatgtgggaaa gctgcagacattcaaccccagctcatgaaagcagctgagagggggctgtaccctgcaaat ccacagaggcaaagcagcccaaggccatgggtgtccacctctggcatcagcatgacctag >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_3|289_aa MAQTILKAAASFGSTCSVVALKVNGLGILKISSLAATGGDGTVVCLAAIGGDKAATFGAP VKPHPEKNLKVQRKELVLPVRIRVLKGSGCPIEAVLVEGASQWFEMGEDLKMQSISKCRT STKSMCYLRRERRKLLGDSGHFHPQTSDPKFFSFWTLGLTPAVCQGLSGLWPETKGCTVS FLTFEVLRLGLIHHWLPRSSTCRRHIVGVYLLIAQSPDTITMEIKISTYEFEGDTNIQSI RRSMLECRRLIRGCLPVKFLQQIFISESVSEESSLREAQTIAKLYEKIR >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_3|870_bp atggcccaaaccatacttaaagcggcagcatcatttggtagcacctgctctgtggtagct ctgaaggtgaatggccttggaatcctgaaaatcagcagtctagcagccactggaggtgat ggaacagtagtctgtctagcagccattggaggtgacaaagcagccacttttggagctcct gtaaaaccccatcctgagaaaaacctcaaagtgcaaaggaaggagctagtattaccagtg agaattagagtcttgaaagggagtgggtgtccaatagaagctgtccttgtggaaggcgcc agccagtggtttgagatgggggaggacctgaagatgcaaagtattagcaagtgcaggact tcaacaaagagcatgtgctatttacgtagggagaggaggaaactgttaggagattcaggt cactttcacccacaaacatcagaccccaagttcttcagcttttggactcttggacttaca ccagcggtttgccaggggctctcaggcctttggccagagactaaaggctgcactgtcagc ttccttacttttgaggttttgagactgggactgatccaccactggcttcctcgctcctca acttgcagacgacatatcgtgggagtttacctcttgatcgcccaatctccagataccatc acaatggaaattaagatttcaacatatgaatttgagggagacacaaacattcagtctata agaaggtctatgctagaatgtaggaggttaatccgtgggtgtttacctgtaaaattcctg cagcaaattttcatctcagagtctgtttctgaggaatcttccctaagagaggcccaaacc attgccaaattgtatgagaaaatacggtaa >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_4|26_aa MAPRPRARPGVAVACCWLLTETFWLA >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_4|81_bp atggcccctcggccccgcgcccgcccaggggtcgctgtcgcctgctgctggctcctcact gaaacattttggttggcctag >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_5|385_aa MKLVRKNIEKDNAGQVTLVPEEPEDMWHTYNLVQVGDSLRASTIRKVQTESSTGSVGSNR VRTTLTLCVEAIDFDSQACQLRVKGTNIQENEYVKMGAYHTIELEPNRQFTLAKKQWDSV VLERIEQACDPAWSADVAAVVMQEGLAHICLVTPSMTLTRAKVEVNIPRKRKGNCSQHDR ALERFYEQVVQAIQRHIHFDVVKCILVASPGFVREQFCDYLFQQAVKTDNKLLLENRSKF LQVHASSGHKYSLKEALCDPTVASRLSDTKAAGEVKALDDFYKMLQHEPDRAFYGLKQVE KANEAMAIDTLLISDELFRHQDVATRSRYVRLVDSVKENAGTVRIFSSLHVSGEQLSQLT GVAAILRFPVPELSDQEGDSSSEED >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_5|1158_bp atgaagctcgtgaggaagaacatcgagaaggacaatgcgggccaggtgaccctggtcccc gaggagcctgaggacatgtggcacacttacaacctcgtgcaggtgggcgacagcctgcgc gcctccaccatccgcaaggtacagacagagtcctccacgggcagcgtgggcagcaaccgg gtccgcactaccctcactctctgcgtggaggccatcgacttcgactctcaagcctgccag ctgcgggttaaggggaccaacatccaagagaatgagtatgtcaagatgggggcttaccac accatcgagctggagcccaaccgccagttcaccctggccaagaagcagtgggatagtgtg gtactggagcgcatcgagcaggcctgtgacccagcctggagcgctgatgtggcggctgtg gtcatgcaggaaggcctcgcccatatctgcttagtcactcccagcatgaccctcactcgg gccaaggtggaggtgaacatccctaggaaaaggaaaggcaattgctctcagcatgaccgg gccttggagcggttctatgaacaggtggtccaggctatccagcgccacatacactttgat gttgtaaagtgcatcctggtggccagcccaggatttgtgagggagcagttctgcgactac ctgtttcaacaagcagtgaagaccgacaacaaactgctcctggaaaaccggtccaaattt cttcaggtacatgcctcctccggacacaagtactccctgaaagaggccctttgtgaccct actgtggctagccgcctttcagacactaaagctgctggggaagtcaaagccttggatgac ttctataaaatgttacagcatgaaccggatcgagctttctatggactcaagcaggtggag aaggccaatgaagccatggcaattgacacattgctcatcagcgatgagctcttcaggcat caggatgtagccacacggagccggtatgtgaggctggtggacagtgtgaaagagaatgca ggcaccgttaggatattctctagtcttcacgtttctggggaacagctcagccagttgact ggggtagctgccattctccgcttccctgttcccgaactttctgaccaagagggtgattcc agttctgaagaggattaa >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_6|173_aa MGKDFMSKTPKAMATKANIDKWDLIKLKNFCTAKETTIRVNRQPTEWEKIFAIYSSDKGL ISRIYNEHKQIYKKKANNSIKKWAEDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRNGYGKGQNVSNSPVTIPIFAVLKTKEAKLPFLV >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_6|522_bp atgggcaaggacttcatgtctaaaacaccgaaagcaatggcaacaaaagccaacattgac aaatgggatctaattaaactaaagaacttctgcacagcaaaagaaactaccatcagagtg aataggcaacctacagaatgggagaaaatatttgcaatctactcatctgacaaagggcta atatccagaatctataatgaacacaaacaaatttacaagaaaaaagcaaacaactccatc aaaaagtgggcagaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctgacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg aatggctatggaaagggacagaatgtctcaaactcccctgtcaccatccccatctttgct gtcctaaaaacgaaagaagctaagcttccttttctggtttaa >gi568815593f:52700395_52901837|GENSCAN_predicted_peptide_7|437_aa MLLPICVQGVVLDFDKGLKKVVLRCCVSFNVDVKNSMTFSGPVEDMFGYTVQQYENEEGK WVLIGSPLVGQPKNRTGDVYKCPVGRGESLPCVKLDLPVNTSIPNVTEVKENMTFGSTLV TNPNGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVVNSIAPVQECSTQLDIVIVLDGS NSIYPWDSVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKI VQRGGRQTMTALGIDTARKEAFTEARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCEDE NIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIF ALEATADQSAASFEMEMSQTGFSAHYSQDWVMLGAVGAYDWNGTVVMQKASQIIIPRNTT FNVESTKKNEPLASYLX >gi568815593f:52700395_52901837|GENSCAN_predicted_CDS_7|1311_bp atgttactgcccatctgtgtacaaggagttgtgctagattttgataaaggattaaaaaaa gttgttctacgctgctgcgtatcattcaatgttgatgtgaaaaattcaatgactttcagc ggcccggtggaagacatgtttggatatactgttcaacaatatgaaaatgaagaaggaaaa tgggtgcttattggttctccgttagttggccaacccaaaaacagaactggagatgtctat aagtgtccagttgggagaggtgaatcattaccttgtgtaaagttggatctaccagttaat acatcaattcccaatgtcacagaagtaaaggagaacatgacatttggatcaactttagtc accaacccaaatggaggatttctggcttgtgggcccttatatgcctatagatgtggacat ttgcattacacaactggaatctgttctgacgtcagccccacatttcaagtcgtgaattcc attgcccctgtacaagaatgcagcactcaactggacatagtcatagtgctggatggttcc aacagtatttacccatgggacagtgttacagcttttttaaatgaccttcttgaaagaatg gatattggtcctaaacagacacaggttggaattgtacagtatggagaaaacgtgacccat gagttcaacctcaataagtattcttccaccgaagaggtacttgttgcagcaaagaaaata gtccagagaggtggccgccagactatgacagctcttggaatagacacagcaagaaaggag gcattcacggaagcccggggtgcccgaagaggagttaaaaaagtcatggttattgtgaca gatggagagtctcatgacaatcatcgactgaagaaggtcatccaagactgtgaagatgaa aacattcaacggttttccatagctattcttggcagctataaccgaggaaatttaagcact gaaaaatttgtggaggaaataaaatcaattgcaagtgaacccactgaaaagcatttcttc aatgtctctgatgaattggctctagtcaccattgttaaaactctgggagaaagaatattt gccctggaagccacagctgaccagtcagcagcttcatttgaaatggaaatgtctcagact ggcttcagtgctcattattcacaggactgggtcatgcttggagcagtaggagcctatgat tggaatggaacagttgtcatgcagaaggctagtcaaatcataatccctcgaaacacaacc tttaatgttgagtctaccaaaaagaatgaaccgcttgcttcttatttagnn