GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:04:26 Sequence gi568815592r:136901561_137117106 : 215546 bp : 41.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10127 10291 165 1 0 47 91 57 0.459 1.58 1.02 Term + 11898 11966 69 2 0 124 50 71 0.832 3.86 1.03 PlyA + 12023 12028 6 1.05 2.00 Prom + 12940 12979 40 -6.15 2.01 Init + 20869 21307 439 0 1 92 99 474 0.983 43.22 2.02 Intr + 22325 22927 603 0 0 90 81 862 0.021 77.22 2.03 Term + 32128 32414 287 2 2 81 43 252 0.484 14.58 2.04 PlyA + 32607 32612 6 1.05 3.00 Prom + 40015 40054 40 -6.85 3.01 Init + 42252 42354 103 2 1 49 91 78 0.729 4.65 3.02 Term + 45628 45701 74 2 2 108 53 77 0.749 3.29 3.03 PlyA + 46409 46414 6 1.05 4.06 PlyA - 47084 47079 6 1.05 4.05 Term - 53727 53568 160 2 1 122 49 67 0.464 2.63 4.04 Intr - 58236 58123 114 2 0 93 75 90 0.430 6.84 4.03 Intr - 66081 65946 136 1 1 54 78 45 0.103 -1.19 4.02 Intr - 67560 67311 250 0 1 90 75 82 0.319 3.19 4.01 Init - 72741 72439 303 0 0 20 86 139 0.560 4.24 4.00 Prom - 76063 76024 40 -6.45 5.00 Prom + 77737 77776 40 -5.45 5.01 Init + 82345 82481 137 0 2 82 78 191 0.449 17.16 5.02 Term + 84784 84862 79 1 1 142 32 16 0.336 -2.34 5.03 PlyA + 85106 85111 6 1.05 6.04 PlyA - 86863 86858 6 1.05 6.03 Term - 87258 87134 125 1 2 90 32 104 0.512 2.57 6.02 Intr - 88647 88498 150 1 0 22 55 155 0.555 4.81 6.01 Init - 91934 91904 31 2 1 78 29 71 0.325 0.15 6.00 Prom - 97916 97877 40 -6.35 7.11 PlyA - 98429 98424 6 1.05 7.10 Term - 100795 99998 798 1 0 77 44 498 0.998 36.57 7.09 Intr - 103200 103061 140 1 2 119 97 106 0.988 13.86 7.08 Intr - 107183 107039 145 2 1 71 121 151 0.992 15.63 7.07 Intr - 107932 107757 176 2 2 92 115 116 0.997 13.44 7.06 Intr - 109892 109714 179 1 2 78 106 150 0.920 14.44 7.05 Intr - 115543 115408 136 2 1 99 103 117 0.798 13.01 7.04 Intr - 129551 129476 76 2 1 99 103 -32 0.010 -2.43 7.03 Intr - 137691 137581 111 0 0 99 14 96 0.005 2.96 7.02 Intr - 142764 142577 188 1 2 43 44 116 0.080 1.19 7.01 Init - 143168 143081 88 2 1 62 96 280 0.973 25.05 7.00 Prom - 143440 143401 40 -4.55 8.14 PlyA - 144133 144128 6 1.05 8.13 Term - 148102 148036 67 0 1 88 48 79 0.116 0.33 8.12 Intr - 150733 150630 104 1 2 24 99 61 0.068 -1.15 8.11 Intr - 160598 160455 144 2 0 82 57 66 0.374 2.46 8.10 Intr - 162498 162377 122 1 2 96 92 34 0.726 3.89 8.09 Intr - 163035 162954 82 0 1 55 110 80 0.411 5.19 8.08 Intr - 165239 165002 238 1 1 97 102 55 0.899 4.49 8.07 Intr - 168736 168455 282 0 0 61 94 92 0.628 2.71 8.06 Intr - 169049 168864 186 1 0 89 7 108 0.225 0.68 8.05 Intr - 190656 190498 159 2 0 79 96 46 0.162 2.68 8.04 Intr - 190946 190803 144 1 0 65 107 35 0.118 1.58 8.03 Intr - 194841 194758 84 2 0 67 75 64 0.007 1.02 8.02 Intr - 201044 200942 103 0 1 38 60 90 0.003 -0.49 8.01 Init - 212699 212639 61 2 1 66 60 99 0.603 6.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 15419 15244 176 0 2 85 43 138 0.831 5.94 S.002 Init - 20558 20549 10 2 1 53 64 22 0.810 -4.32 S.003 Term + 22325 23136 812 0 2 90 54 919 0.970 81.05 S.004 Term - 142764 142556 209 1 2 43 42 154 0.809 2.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_1|77_aa MNHFAQPCFRFFAIVNEVAMNIRVYIVLQTYVFIFLGQISGSGITGSKDRYVFNLVADCS WDETIKIYDPACLTIPA >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_1|234_bp atgaaccactttgcccagccttgttttcgattttttgctatcgtgaatgaagttgctatg aacattcgtgtctacattgttttgcaaacatatgttttcatttttcttggacaaatatct gggagtggaattactgggtcaaaagataggtatgtgtttaacttggtggctgactgttct tgggatgaaacaataaagatctatgaccctgcttgtcttactattcctgcttga >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_2|442_aa MRQLCRGRVLGISVAIAHGVFSGSLNILLKFLISRYQFSFLTLVQCLTSSTAALSLELLR RLGLIAVPPFGLSLARSFAGVAVLSTLQSSLTLWSLRGLSLPMYVVFKRCLPLVTMLIGV LVLKNGAPSPGVLAAVLITTCGAALAGAGDLTGDPIGYVTGVLAVLVHAAYLVLIQKASA DTEHGPLTAQYVIAVSATPLLVICSFASTDSIHAWTFPGWKDPAMVCIFVACILIGCAMN FTTLHCTYINSAVTTSFVGVVKSIATITVGMVAFSDVEPTSLFIAGVVVNTLGSIIYCVA KFMETRKQSNYEDLEAQPRGEEAQLSGDQLPFVMEELPGEGGNGRSEECGKVIVIKTSPL FRPTMQNSYAEWRLTLENVTTNAKLVVCPFVTAPNPPPTNSSLHSELQIYSGTSSNGKRK GRNMDNIRITCLKLLVDEKFWK >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_2|1329_bp atgcggcagctgtgccggggccgcgtgctgggcatctcggtggccatcgcgcacggggtc ttctcgggctccctcaacatcttgctcaagttcctcatcagccgctaccagttctccttc ctgaccctggtgcagtgcctgaccagctccaccgcggcgctgagcctggagctgctgcgg cgcctcgggctcatcgccgtgccccccttcggtctgagcctggcgcgctccttcgcgggg gtcgcggtgctctccacgctgcagtccagcctcacgctctggtccctgcgcggcctcagc ctgcccatgtacgtggtcttcaagcgctgcctgcccctggtcaccatgctcatcggcgtc ctggtgctcaagaacggcgcgccctcgccaggggtgctggcggcggtgctcatcaccacc tgcggcgccgccctggcaggagccggcgacctgacgggcgaccccatcgggtacgtcacg ggagtgctggcggtgctggtgcacgctgcctacctggtgctcatccagaaggccagcgca gacaccgagcacgggccgctcaccgcgcagtacgtcatcgccgtctctgccaccccgctg ctggtcatctgctccttcgccagcaccgactccatccacgcctggaccttcccgggctgg aaggacccggccatggtctgcatcttcgtggcctgcatcctgatcggctgcgccatgaac ttcaccacgctgcactgcacctacatcaattcggccgtgaccaccagcttcgtgggtgtg gtgaagagcatcgccaccatcacggtgggcatggtggccttcagcgacgtggagcccacc tctctgttcattgccggcgtggtggtgaacaccctgggctctatcatttactgtgtggcc aagttcatggagaccagaaagcaaagcaactacgaggacctggaggcccagcctcgggga gaggaggcgcagctaagtggagaccagctgccgttcgtgatggaggagctgcccggggag ggaggaaatggccggtcagaagaatgtggaaaagtcattgttattaaaacatcacccctt ttccgtcccactatgcaaaactcatatgctgagtggagacttaccctggaaaatgttaca accaatgcaaagttggttgtctgcccatttgtcaccgcccccaaccctccacccaccaat tcttccttgcactcagaactccagatttattcaggtacaagcagcaatgggaagcggaag ggaaggaacatggacaacatccgtatcacttgtttaaaactgctcgttgatgaaaagttt tggaaatag >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_3|58_aa MGLRHAELATYGTKTTSIPPTRKLEFENNRVVCSVNTAINTSAAVTRPLATRCCGHRY >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_3|177_bp atgggccttcggcatgctgagctagccacttacggtactaaaaccacctcaattcctcct acacgtaaattagagtttgaaaataatagagttgtgtgctcagtaaacacagccattaat acttctgctgctgtgaccaggcccctagccacccgctgctgtggccatcgctattga >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_4|320_aa MIRNLSESPPVTLEPVEACWEQDFLKGNTSGRLWSKSFFAGYKRSLQNQREHTALLKIEG GYAQDETEFYLGKRCACVYKAKSNTMIPGSKPNKSRVIWGKIPLCSGHPPLDGGQLISSV QIRGTAELAWDFFQSTCSLPEPRLTQLEHLVTFPSPGASSSCGKTTGFTDIVRFKLLAIR KRQKVLWPQIAVLCDLHQELYLEISRLDGGSKEAQSWHSGSRGAHHSFLWRECTSKQGGD WSARGLELASRFGTSKSKLCAGPMATSSPLYTLLPSLPILSFKRVLTTFFINASIMPYIG DPQNRHMSYYGNHFPDPHRP >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_4|963_bp atgatcaggaacctcagtgagtcccctcctgtcaccctggagcctgtggaggcctgctgg gaacaggacttcttaaaaggaaatacgtctggaaggctgtggtctaagtccttttttgct ggctataagcggagcctccagaaccaaagggagcatacagctcttcttaaaattgaaggt ggttatgcccaagatgaaactgaattctatttgggcaagagatgtgcttgtgtatacaaa gcaaagagcaacacaatgattcctggcagcaaaccaaacaaaagcagagtaatctgggga aagatccctctctgctctggtcatcctcctctggacgggggccagctcatcagctcggtt cagataaggggcacagcagagcttgcctgggacttcttccagagcacctgctcattgcca gaaccaaggctgactcaattagaacatctggtcaccttcccgtcccctggcgcctcctca tcctgtggcaaaaccactggcttcacagatatagttagatttaaattattagccattagg aaaagacagaaagtattatggccacagattgctgtcctgtgtgatctccaccaagaactc tacctggagatctccaggctggatggtggctccaaggaggcacagtcatggcatagtggc tctaggggtgcacaccacagcttcctatggagggagtgcacgagcaaacaaggcggggac tggagtgcacgaggcctggaactggccagccgcttcggcaccagcaagagcaagctctgt gcgggccccatggcaacatccagccctctctacaccctactcccatccttgcccatccta tcgttcaaaagagttcttaccacctttttcattaatgcatccatcatgccctacattggt gatcctcaaaataggcacatgtcgtattatggaaatcacttcccagatcctcatcgtcct tag >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_5|71_aa MGPGPRLVVVIMVITAAEAATVETAAIALETGEAPHEGSARICVGRSAASTLTQVHNIHP LILSPFLRERI >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_5|216_bp atgggacctgggcccagattggtggtagtgatcatggtgataacagcagcagaagcagca actgtggagacagcagccatagccctggagacaggagaagctccacatgaaggcagtgct aggatttgtgttgggagatcagctgcttccaccctgactcaagtacacaacatccaccca ctaattctgtctccatttctgagagaaagaatctaa >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_6|101_aa MPFVYSEEEPRAAQQRGFPLLCSRVRNPERVSSERRPLSEWESIARAFPRARLRISAGEV GYDSPFPDLSACRVPVGTEGCSGALRCHNLPLSTPQSQSGS >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_6|306_bp atgccctttgtgtacagcgaagaggagccaagagctgctcagcaaaggggcttccctctg ctctgctctcgggttcggaatccggagcgagtttccagtgagcggcgcccgctgagcgag tgggaaagcatcgcgcgtgcctttccccgcgcgcgtctgcggatcagcgcaggcgaggtg gggtatgacagtccttttccagacctgtcagcatgccgagtgcctgtggggacagagggc tgctccggtgccctgaggtgccacaacttgcctctctccacaccccagagccagtcagga agctaa >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_7|678_aa MRAPGRPALRPLPLPPLLLLLLAAPWGRAAYDLRSGTVTMRGPSPASPRFPPPPPRPRGG APVDAASRPPGPAVLQAARREIRALKAKLPERNYSASYRMNSGKAVFATWRAAKEIGYPW KAAWNRNDIISNSWQCLLLIIMAVIHIITIVIVNVPCVSGGLPKPANITFLSINMKNVLQ WTPPEGLQGVKVTYTVQYFIYGQKKWLNKSECRNINRTYCDLSAETSDYEHQYYAKVKAI WGTKCSKWAESGRFYPFLETQIGPPEVALTTDEKSISVVLTAPEKWKRNPEDLPVSMQQI YSNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQ CARTLKDQSSEFKAKIIFWYVLPVSITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNE FDKRFFVPAEKIVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEE EVKHLGYASHLMEIFCDSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPE EQELSLQEEVSTQGTLLESQAALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEP STTLVDWDPQTGRLCIPSLSSFDQDSEGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENE TYLMQFMEEWGLYVQMEN >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_7|2037_bp atgcgggctcccggccgcccggccctgcggccgctgccgctgccgccgctgctgctgttg ctcctggcggcgccttggggacgggcagcctatgacttgcggtcggggacggttaccatg cggggcccctcgccggccagtcccaggttcccgccaccaccgccgcggccgcgcggaggg gcccctgtggatgccgcgtctcggccgccgggccccgcagtcctgcaagccgcgcgtcgc gagatccgcgctttaaaagccaaactccccgaaaggaactactcagcttcctatagaatg aattcaggaaaggcagtatttgcgacctggagagcagccaaggagattggttacccatgg aaagcggcatggaataggaacgacataataagtaattcgtggcaatgtttattactgata ataatggcagttattcatataataacaatagttattgttaacgttccctgtgtctctggt ggtttgcctaaacctgcaaacatcaccttcttatccatcaacatgaagaatgtcctacaa tggactccaccagagggtcttcaaggagttaaagttacttacactgtgcagtatttcata tatgggcaaaagaaatggctgaataaatcagaatgcagaaatatcaatagaacctactgt gatctttctgctgaaacttctgactacgaacaccagtattatgccaaagttaaggccatt tggggaacaaagtgttccaaatgggctgaaagtggacggttctatccttttttagaaaca caaattggcccaccagaggtggcactgactacagatgagaagtccatttctgttgtcctg acagctccagagaagtggaagagaaatccagaagaccttcctgtttccatgcaacaaata tactccaatctgaagtataacgtgtctgtgttgaatactaaatcaaacagaacgtggtcc cagtgtgtgaccaaccacacgctggtgctcacctggctggagccgaacactctttactgc gtacacgtggagtccttcgtcccagggccccctcgccgtgctcagccttctgagaagcag tgtgccaggactttgaaagatcaatcatcagagttcaaggctaaaatcatcttctggtat gttttgcccgtatctattaccgtgtttcttttttctgtgatgggctattccatctaccga tatatccacgttggcaaagagaaacacccagcaaatttgattttgatttatggaaatgaa tttgacaaaagattctttgtgcctgctgaaaaaatcgtgattaactttatcaccctcaat atctcggatgattctaaaatttctcatcaggatatgagtttactgggaaaaagcagtgat gtatccagccttaatgatcctcagcccagcgggaacctgaggccccctcaggaggaagag gaggtgaaacatttagggtatgcttcgcatttgatggaaattttttgtgactctgaagaa aacacggaaggtacttctctcacccagcaagagtccctcagcagaacaatacccccggat aaaacagtcattgaatatgaatatgatgtcagaaccactgacatttgtgcggggcctgaa gagcaggagctcagtttgcaggaggaggtgtccacacaaggaacattattggagtcgcag gcagcgttggcagtcttgggcccgcaaacgttacagtactcatacacccctcagctccaa gacttagaccccctggcgcaggagcacacagactcggaggaggggccggaggaagagcca tcgacgaccctggtcgactgggatccccaaactggcaggctgtgtattccttcgctgtcc agcttcgaccaggattcagagggctgcgagccttctgagggggatgggctcggagaggag ggtcttctatctagactctatgaggagccggctccagacaggccaccaggagaaaatgaa acctatctcatgcaattcatggaggaatgggggttatatgtgcagatggaaaactga >gi568815592r:136901561_137117106|GENSCAN_predicted_peptide_8|591_aa MDSAMEYARETASDINEDLRDSCDHETHPESWLNAQISRPHPKLSDSELQDEQPRVASRF SSDIYEEIIFSDHFIPKARADGSSPRKVQCLAKISEPPGGEAQKQTPAVWHESQCSSLHC TGWRETGGGRRIPGSHTDMILSNIPSAATTGARGMGAHASFDLSMSPVFQLSHTHSNKQG APKRDALPREKESKEAVWLQRLCRAVVGSAQFKLPGSFVYTVRGKPHTQASVMADTPPPT KLEHPRGVNGSVSLAFQVPLGYEKKFLQLAWCLPKWLRSFVLETQGPSGIGTKGISWSAG CEDHGKSVVSRLDSTIPHSTVSHGFPWLGEVVPRPLVLPRTQPTCTQVIKSFIAPNTKPV WWSLHTDASEIWCRDSDQGTSLGKSIPCPPALCSMKKIHLRPRVLRPTSPRNISPILNRQ LKTDTARLPWKPTEPSQMLWVTLTVEARFKKIKACYHSPATAWPFKAYKLSLQFPHFTCP RTREALQELFRPPPFPTHQARGFAPAQDWQIDFTHMPRVRKLKYLLVWVDAFTGWEADSE VDFSMYGVTGEGPCDKHLLEGGKGSKSGCRHHCKNEKASDEVGKNIRNAHI >gi568815592r:136901561_137117106|GENSCAN_predicted_CDS_8|1776_bp atggattctgctatggagtatgccagagagacagcttcagacatcaatgaggacttgcga gactcatgtgatcatgagactcacccagagtcttggttaaatgcacagatcagcaggccc caccccaaactttctgactcagaattacaagatgagcagcccagggtggcaagccgattt tcatcagacatttatgaggagatcattttctctgaccacttcattcccaaggccagggct gatgggagcagccccaggaaggtacagtgcttggctaagatctcagagccaccaggtggg gaagcacaaaagcagaccccagcagtatggcatgagagccagtgctcttccctgcactgc acaggctggagagaaacaggaggaggaaggaggattccgggtagtcatactgatatgata ttgagcaacataccatctgcagcaacaacaggggcaaggggaatgggtgcccatgcctcc tttgacctttccatgagccctgtcttccagctgtcacacacccacagcaataaacagggt gctccaaaaagagatgccctgcccagagagaaggaatctaaagaggcagtctggctacaa aggctttgccgagctgtggtgggctctgcccagttcaaacttcctggcagctttgtttac actgtgaggggaaaaccacatacacaagcctcagtaatggcagacactcctcccccgacc aagctggagcatcccaggggagtaaatggttctgtctcactggcgttccaggtgccactg gggtatgagaaaaaattcctgcagctagcttggtgtctgcccaaatggctgcgcagtttt gtgcttgaaacccagggccctagtggtataggcactaagggaatctcctggtctgcaggt tgtgaagaccacgggaaaagtgtagtatctaggctagatagcaccatccctcatagcaca gtttctcatggcttcccttggctaggggaggtagttccccgaccccttgtgcttcccagg actcagcccacctgcacccaggtgattaaaagctttattgcaccaaacacaaagcctgtt tggtggtctcttcacacggacgcaagtgaaatttggtgccgtgattcggatcaggggacc tcccttgggaaatcaatcccctgtcctcctgctctttgttccatgaaaaagatccaccta cgacctcgggtcctcagacccaccagtccaaggaacatctcaccaattttaaatcggcag ctgaagactgacactgcccgattgccttggaagcctacagaaccatcacagatgctctgg gtaacactcacagtggaggcacgctttaaaaagattaaagcctgttatcactcgcctgct acagcatggccttttaaagcctataaactctccttacaattcccccatttcacctgtcct agaaccagagaagccttacaggaattgttcaggcctcctccctttcctacacatcaagct cggggatttgcccctgcccaggactggcaaattgactttactcacatgccccgagtcagg aaactaaaatacctcttagtctgggtagacgctttcactggatgggaagcagactctgag gtagactttagtatgtacggtgttactggggagggcccttgtgataaacacctgctggag ggaggaaaaggaagcaagagtggatgcagacatcattgtaaaaatgaaaaggcaagtgat gaagtgggcaaaaatattcgcaacgcacatatctga