GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:58:38 Sequence gi568815593f:159910529_160165062 : 254534 bp : 43.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6378 7326 949 2 1 55 111 1389 0.796 131.62 1.02 Intr + 9194 9309 116 0 2 64 110 10 0.419 0.97 1.03 Intr + 17766 17907 142 2 1 46 58 73 0.226 0.03 1.04 Term + 21163 21356 194 2 2 61 46 122 0.682 2.88 1.05 PlyA + 23291 23296 6 1.05 2.04 PlyA - 25619 25614 6 1.05 2.03 Term - 40109 39965 145 1 1 53 41 209 0.857 10.08 2.02 Intr - 40317 40185 133 1 1 -20 55 208 0.820 6.40 2.01 Init - 40661 40322 340 2 1 97 45 414 0.936 33.42 2.00 Prom - 41270 41231 40 -3.96 3.00 Prom + 49174 49213 40 -7.26 3.01 Init + 54231 54385 155 2 2 53 80 175 0.946 12.69 3.02 Intr + 60164 60252 89 2 2 44 91 -19 0.111 -6.39 3.03 Term + 61351 61964 614 2 2 108 49 1084 0.556 101.15 3.04 PlyA + 63342 63347 6 1.05 4.00 Prom + 64189 64228 40 -3.16 4.01 Init + 100001 100330 330 1 0 102 111 312 0.993 32.21 4.02 Term + 101488 101526 39 0 0 70 52 40 0.402 -4.21 4.03 PlyA + 102842 102847 6 1.05 5.04 PlyA - 103527 103522 6 1.05 5.03 Term - 113450 113170 281 0 2 30 44 287 0.958 14.11 5.02 Intr - 115172 115107 66 0 0 93 61 41 0.615 0.78 5.01 Init - 117087 117033 55 0 1 59 115 29 0.788 4.15 5.00 Prom - 120578 120539 40 -8.36 6.00 Prom + 121171 121210 40 -5.06 6.01 Init + 121482 121484 3 2 0 62 115 0 0.314 0.10 6.02 Intr + 123794 123841 48 1 0 90 73 50 0.398 2.58 6.03 Intr + 124612 124672 61 0 1 61 94 25 0.256 -1.29 6.04 Intr + 126163 126275 113 2 2 109 91 -8 0.324 1.70 6.05 Intr + 132605 132641 37 1 1 81 87 57 0.655 2.74 6.06 Intr + 138986 139134 149 0 2 100 53 47 0.909 2.35 6.07 Intr + 140601 140655 55 2 1 117 87 53 0.910 6.65 6.08 Term + 154404 154537 134 1 2 87 42 106 0.724 4.15 6.09 PlyA + 154995 155000 6 1.05 7.03 PlyA - 156066 156061 6 1.05 7.02 Term - 161349 161203 147 0 0 81 54 138 0.938 7.60 7.01 Init - 168764 168708 57 2 0 79 77 11 0.467 0.41 7.00 Prom - 173627 173588 40 -2.96 8.03 PlyA - 173651 173646 6 1.05 8.02 Term - 183537 181854 1684 2 1 93 47 627 0.029 47.49 8.01 Init - 208860 208277 584 0 2 75 81 862 0.818 76.86 8.00 Prom - 228728 228689 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 183470 181854 1617 2 0 93 47 578 0.965 49.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_1|466_aa MNPDLDTGHNTSAPAHWGELKNANFTGPNQTSSNSTLPQLDITRAISVGLVLGAFILFAI VGNILVILSVACNRHLRTPTNYFIVNLAMADLLLSFTVLPFSAALEVLGYWVLGRIFCDI WAAVDVLCCTASILSLCAISIDRYIGVRYSLQYPTLVTRRKAILALLSVWVLSTVISIGP LLGWKEPAPNDDKECGVTEEPFYALFSSLGSFYIPLAVILVMYCRVYIVAKRTTKNLEAG VMKEMSNSKELTLRIHSKNFHEDTLSSTKAKGHNPRSSIAVKLFKFSREKKAAKTLGIVV GMFILCWLPFFIALPLACRGCDQSMLRKEKASAGCRNGSKVQALAFRGRGNSSKQHHESA QDTVESVHIGRLCSYDSIDSTLLATLTRSSSDHPLNTQVPLSVLEARKSKIKEQEHSVSG EDLPIDGDFYVSSHGRRGKQALECLFYKYTDPIYQSSALVTYLPFT >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_1|1401_bp atgaatcccgacctggacaccggccacaacacatcagcacctgcccactggggagagttg aaaaatgccaacttcactggccccaaccagacctcgagcaactccacactgccccagctg gacatcaccagggccatctctgtgggcctggtgctgggcgccttcatcctctttgccatc gtgggcaacatcctagtcatcttgtctgtggcctgcaaccggcacctgcggacgcccacc aactacttcattgtcaacctggccatggccgacctgctgttgagcttcaccgtcctgccc ttctcagcggccctagaggtgctcggctactgggtgctggggcggatcttctgtgacatc tgggcagccgtggatgtcctgtgctgcacagcgtccattctgagcctgtgcgccatctcc atcgatcgctacatcggggtgcgctactctctgcagtatcccacgctggtcacccggagg aaggccatcttggcgctgctcagtgtctgggtcttgtccaccgtcatctccatcgggcct ctccttgggtggaaggagccggcacccaacgatgacaaggagtgcggggtcaccgaagaa cccttctatgccctcttctcctctctgggctccttctacatccctctggcggtcattcta gtcatgtactgccgtgtctatatagtggccaagagaaccaccaagaacctagaggcagga gtcatgaaggagatgtccaactccaaggagctgaccctgaggatccattccaagaacttt cacgaggacacccttagcagtaccaaggccaagggccacaaccccaggagttccatagct gtcaaactttttaagttctccagggaaaagaaagcagctaagacgttgggcattgtggtc ggtatgttcatcttgtgctggctacccttcttcatcgctctaccgcttgcctgcagaggc tgtgaccagagcatgctcaggaaggagaaggcctcagcaggatgcaggaatggatcaaag gtgcaggctttggcttttaggggcaggggaaatagctccaaacagcaccatgagtcagct caggatacagtagaatctgtccacattggcaggctctgctcttacgactctattgacagc acactcctggccacactcacccgcagctcatctgaccaccctttgaacactcaagtgccc ctctctgttctggaggctcggaagtccaagatcaaggagcaagaacattcagtgtctggt gaggacctgcccatagatggtgacttctatgtgtcctcacatggcagaaggggcaaacag gctctcgaatgcctcttttataagtacactgatcccatttatcaaagttctgccctggtg acctacttgcctttcacatga >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_2|205_aa MEKAGADLQGGATRVIISAPSADAPMFVMGMNHKKYDNRLNIVSYASCTTNCLVPLGEVI HDNFGIVERFQNTVHAITATQKIVDGLSGKLWCDGCGALQNITPASTGTAKAVGVISKLN KKLTDMGFCVHITNVSVMDLTCRLEKPAKYDDIKKVVKDTHSSTFDAGAGIALKDHFVKL ISWYDNEFGYSNRVVDLMAHIASKE >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_2|618_bp atggagaaggctggggctgacttgcagggaggagctacaagggtcatcatctctgccccc tctgctgatgcccccatgttcgtgatgggcatgaaccacaagaagtatgacaacagactc aatatcgtcagctatgcctcctgcaccaccaactgcttagtgcccctgggcgaggtcatc catgacaactttggtattgtggaaagattccagaatacagtccacgccatcactgccacc cagaagattgtggatggcctctctgggaaactgtggtgcgacggctgcggggctctccag aacatcacccctgcctctactggcactgccaaggctgtgggggtcatctctaagctgaac aagaagctcactgacatgggcttctgtgtccacatcaccaacgtgtcagtcatggacctc acctgccgtctcgaaaaacctgccaaatatgatgacattaagaaggtggtgaaagacacc cactcctccacctttgatgctggggccggcattgccctcaaagaccattttgtcaagctc atttcctggtatgacaacgaatttggctacagcaacagggtggtggacctcatggcccac atagcctccaaggagtaa >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_3|285_aa MPTQHIDLILFTTFFILVAVVLVVTAVLAVVTVAALKAVIIAILIECFQCASHEQSSSCP HTPHPCVNLELCSFLIFASLSGSLFSTLKPPDAVFKVVFWLGYFNSCLNPIIYPCSSKEF KRAFVRILGCQCRGRGRRRRRRRRRLGGCAYTYRPWTRGGSLERSQSRKDSLDDSGSCLS GSQRTLPSASPSPGYLGRGAPPPVELCAFPEWKAPGALLSLPAPEPPGRRGRHDSGPLFT FKLLTEPESPGTDGGASNGGCEAAADVANGQPGFKSNMPLAPGQF >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_3|858_bp atgcccacacagcacatagacctcatcctcttcaccaccttcttcatcttggtagcagtg gtactggtggtaacagcggtgctggcagttgtaacagtagcagcgctcaaagcagtcatc atagctatacttattgagtgctttcaatgtgccagccatgaacaaagctcctcgtgtccc cacaccccacacccctgtgtgaacttggaactatgcagcttcctaatttttgccagccta tcaggctccttgttctccaccctgaagccccccgacgccgtgttcaaggtggtgttctgg ctgggctacttcaacagctgcctcaaccccatcatctacccatgctccagcaaggagttc aagcgcgctttcgtgcgcatcctcgggtgccagtgccgcggccgcggccgccgccgacgc cgccgccgccgtcgcctgggcggctgcgcctacacctaccggccgtggacgcgcggcggc tcgctggagcgctcgcagtcgcgcaaggactcgctggacgacagcggcagctgcctgagc ggcagccagcggaccctgccctcggcctcgccgagcccgggctacctgggccgcggcgcg ccaccgccagtcgagctgtgcgccttccccgagtggaaggcgcccggcgccctcctgagc ctgcccgcgcctgagccccccggccgccgcggccgccacgactcgggcccgctcttcacc ttcaagctcctgaccgagcccgagagccccgggaccgacggcggcgccagcaacggaggc tgcgaggccgcggccgacgtggccaacgggcagccgggcttcaaaagcaacatgcccctg gcgcccgggcagttttag >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_4|122_aa MGEKSENCGVPEDLLNGLKVTDTQEAECAGPPVPDPKNQHSQSKLLRDDEAHLQEDQGEE ECFHDCSASFEEEPGADKVENKSNEDVNSSELDEEYLIELEKNMSDEEKQCAPPEQEEKG QK >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_4|369_bp atgggggagaagtcagagaactgtggggttccagaggatctgttaaatggtttgaaggtt acagatactcaggaagccgagtgtgctggccctccagttcctgatcccaaaaatcagcat tcccagagtaagctgctcagggatgatgaggcccatctccaggaggaccagggagaagag gagtgttttcatgactgcagtgcctcatttgaggaggagccaggagcggacaaggttgag aacaaatctaatgaagatgtgaattcctctgaactagatgaagaatacctaatagaactg gaaaaaaacatgtcggatgaagagaaacagtgtgcaccaccagaacaggaggaaaaaggg caaaagtga >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_5|133_aa MTVDGHSKPQEEMKNIRNGVEIKNCLQEVELSNAQKLLQKGGTKYSLIPDEEEEKEEAKS AEFEKPDPTRNPSRKRKEEKKKNWKSSDSDSSDSENDTGKRARHTSKDSKAAKKKKKKKH KKKHIGVERLGNM >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_5|402_bp atgacagtagatggtcactcaaaaccacaggaggaaatgaagaacatcagaaacggtgtg gaaataaagaattgtttgcaggaagtagaactgagtaatgcacagaagctgctgcaaaaa ggtgggaccaaatactctctgatacctgatgaggaagaggaaaaggaagaggcaaagtca gcagagtttgagaagcctgaccccacaaggaatccttctagaaaaagaaaggaggagaag aaaaagaattggaagtcatctgactctgacagctcagactctgagaatgatacaggcaag agggcaaggcacacatcaaaagacagcaaggcagcaaagaagaagaaaaagaagaagcac aagaagaagcacataggagtggaaagattaggcaacatgtga >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_6|199_aa MDNILSAFVFDEFLAIYKRREESTRLKEEGNEQFKKGDYIEAESSYSRALEMCPSCFQKE RSILFSNRAAARMKQDKKEMAINDCSKAIQLNPSYIRAILRRAELYEKTDKLDEALEDYK SILEKDPSIHQAREACMRLPKQIEERNERLKEEMLGKLKDLGNLVLRPFGLSTENFQIKQ DSSTGSYSINFVQNPNNNR >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_6|600_bp atggataacattctctcagcctttgtgtttgatgagtttttggccatctataaaagaaga gaagagagcactagactaaaggaggagggaaatgaacagtttaagaaaggagattatata gaagctgaaagttcttatagtcgagccctcgaaatgtgcccatcctgcttccaaaaggag aggtcgattctattttcaaatagagctgcagcaaggatgaaacaggacaagaaagaaatg gccatcaatgactgcagcaaagcaattcaattaaaccccagctatatcagggcaatattg aggagagcagagttgtatgagaagacggacaagctagatgaagccctggaagactataaa tctatattagaaaaagatccatcaatacatcaagcaagagaagcttgtatgagattacct aagcaaattgaagaacgtaatgaaagactaaaagaagagatgttaggtaaattaaaagat cttgggaacttggttctccgaccttttgggctctccacggaaaatttccagatcaaacag gattcctctaccggctcgtactccatcaatttcgttcaaaatccaaataataacagataa >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_7|67_aa MGTPEGNTKSSQFCQIMGKGRDEHMEETGTVMKGSDHRKQQEGTYNPAGSDHQLQAHKAA SEWGLRS >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_7|204_bp atgggaacaccggaaggcaacacaaaatcaagtcagttctgtcaaatcatggggaagggg agggacgagcatatggaggaaaccggcacagtaatgaagggaagtgatcacagaaaacaa caggaaggaacatacaatccagctggcagcgaccatcagctgcaggctcataaggctgca tctgaatgggggctgcgcagctga >gi568815593f:159910529_160165062|GENSCAN_predicted_peptide_8|755_aa MAAVAAEAAATAASPGEGGAGEAEPEMEPIPGSEAGTDPLPVTATEASVPDGETDGQQSA PQADEPPLPPPPPPPGELARSPEAVGPELEAEEKLSVRVAESAAAAPQGGPELPPSPASP PEQPPAPEEREEPPLPQPVAPALVPPAGGDSTVSQLIPGSEVRVTLDHIIEDALVVSFRF GEKLFSGVLMDLSKRFGPHGIPVTVFPKREYKDKPEAMPLQSNTFQEGTEVKCEANGAVP DDPSPVPHPELSLAESLWTSKPPPLFHEGAPYPPPLFIRDTYNQSIPQPPPRKIKRPKRK MYREEPTSIMNAIKLRPRQVLCDKCKNSVVAEKKEIRKGSSATDSSKYEDKKRRNESVTT VNKKLKTDHKVDGKNQNESQKRNAVVKVSNIAHSRGRVVKVSAQANTSKAQLSTKKVLQS KNMDHAKAREVLKIAKEKAQKKQNETSTSKNAHSKVHFTRRYQNPSSGSLPPRVRLKPQR YRNEENDSSLKTGLEKMRSGKMAPKPQSRCTSTRSAGEAPSENQSPSKGPEEASSEVQDT NEVHVPGDQDEPQTLGKKGSKNNISVYMTLNQKKSDSSSASVCSIDSTDDLKSSNSECSS SESFDFPPGSMHAPSTSSTSSSSKEEKKLSNSLKMKVFSKNVSKCVTPDGRTICVGDIVW AKIYGFPWWPARILTITVSRKDNGLLVRQEARISWFGSPTTSFLALSQLSPFLENFQSRF NKKRKGLYRKAITEAAKAAKQLTPEVRALLTQFET >gi568815593f:159910529_160165062|GENSCAN_predicted_CDS_8|2268_bp atggcggccgtggctgcagaggcggcagcgactgcagcgtcccccggggaggggggcgcc ggcgaggccgagccggagatggagcccatccccggcagtgaggccggcactgaccccctc ccggtcacggccactgaagcgtctgtgccggatggcgagactgacgggcagcaatccgct cctcaggccgacgagccgccgctcccgccgccaccgccgccgccgggggagctcgcccgc agcccagaggcggtggggccggagctggaggctgaggagaaactgtccgttcgggtggcg gagtcggcggcagccgcgcctcagggagggccggaacttccaccttctcctgcatcgccg ccggagcagcccccggctcccgaggagcgcgaggagccgccgctgcctcagcccgtagcc ccggcgctcgtgccgccggcgggcggggactccacggtgtcgcaactgatcccgggctcg gaggtgcgggtcacgctggaccacatcattgaggacgcgcttgtcgtgtcgttccgcttc ggggagaagctcttctccggggtcctcatggatctgtccaaaaggtttgggccccatggt atccctgtgacagtatttcccaaaagggaatataaggataaaccagaagccatgccgctc caaagtaatacattccaagaagggacagaagtcaagtgtgaagcaaatggtgctgttccc gatgacccttctcctgtcccgcatcccgagctgagcttggctgaaagcctgtggacttcc aaaccaccacctctcttccatgaaggagcaccttatcctccccctttgtttatcagggac acatataaccaatcaatacctcagccacctcctcggaaaattaagcgacccaaacgaaaa atgtacagggaagaacccacttcaataatgaatgctattaaactacgacccaggcaagtt ctgtgtgataaatgtaaaaacagtgttgttgctgaaaaaaaggaaattagaaaaggtagt agtgcaactgactcttctaaatatgaagataaaaaacggagaaatgaaagtgtaactact gtgaacaaaaaactgaaaactgaccataaagtggatgggaaaaaccaaaatgaaagccag aaaagaaatgctgtggttaaggtttcaaatattgctcacagcagaggcagagtagtaaaa gtttctgctcaggcaaatacatcaaaagctcagttaagtactaaaaaagttctccagagt aagaacatggatcatgcgaaagctcgggaagtgttaaaaattgccaaagaaaaggcacaa aagaagcaaaatgaaacctctacttccaaaaatgcacattcaaaagtccatttcacacgt cgatatcagaatcctagctcaggttcccttccaccccgggttcgtttaaaaccacagagg tacaggaatgaagaaaatgactcttctctgaagacaggacttgagaaaatgcggagtggc aagatggcacccaagccccagtctcgctgcacctctacccgctcagcaggtgaggcccct tcagaaaatcagagtccctcaaaaggccctgaagaggccagcagtgaggttcaggacaca aatgaagtgcatgtgcctggtgatcaggatgaaccacagacattgggcaaaaagggcagc aaaaacaatatctctgtttatatgaccctaaatcaaaagaaatctgactcttccagtgct tcagtgtgtagcattgatagcacagatgatttgaaatcttccaactctgagtgtagttct tctgaaagctttgattttcctccaggcagtatgcatgcaccttccacctcctccacttcc tcctcttcaaaggaagagaaaaagctcagtaattccttgaaaatgaaagtcttttccaaa aacgtctctaaatgcgtcacaccagatggcaggaccatatgtgtaggggacattgtttgg gccaagatatatggcttcccttggtggccagcccgtattcttactataactgtgagccgg aaagataacggccttttagtccgacaggaggcccgtatttcatggtttgggtctccaaca acatctttccttgctctttcacaactctccccctttttagaaaacttccagtcacgcttt aataagaagagaaagggcctgtatcgcaaggctatcacagaggcagctaaggctgccaag cagctgacccccgaagtgcgggctttgttgacacagtttgaaacgtga