GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:05:55 Sequence gi568815593f:148726832_148928070 : 201239 bp : 40.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 1594 1435 160 0 1 72 79 104 0.158 6.97 1.01 Init - 7107 6980 128 0 2 82 80 60 0.215 4.28 1.00 Prom - 7888 7849 40 -4.85 2.00 Prom + 16807 16846 40 -3.95 2.01 Init + 46188 46556 369 2 0 46 85 199 0.921 12.44 2.02 Intr + 51325 51468 144 0 0 91 65 37 0.018 1.26 2.03 Intr + 67778 67882 105 1 0 18 61 136 0.029 3.39 2.04 Intr + 78936 79098 163 2 1 79 62 80 0.043 3.13 2.05 Term + 81156 81265 110 1 2 104 48 76 0.327 2.89 2.06 PlyA + 81827 81832 6 1.05 3.02 PlyA - 82357 82352 6 1.05 3.01 Sngl - 84745 84599 147 1 0 99 45 170 0.382 7.60 3.00 Prom - 86995 86956 40 -5.35 4.00 Prom + 93435 93474 40 -5.55 4.01 Init + 99104 99527 424 1 1 69 95 205 0.795 15.86 4.02 Intr + 99777 101174 1398 1 0 37 79 1431 0.907 125.33 4.03 Term + 105002 105132 131 0 2 71 48 86 0.564 0.26 4.04 PlyA + 105885 105890 6 1.05 5.00 Prom + 111441 111480 40 -3.55 5.01 Init + 121641 121732 92 2 2 65 100 61 0.364 5.01 5.02 Term + 131904 132084 181 0 1 104 48 184 0.846 12.10 5.03 PlyA + 132948 132953 6 -0.45 6.03 PlyA - 133169 133164 6 1.05 6.02 Term - 133537 133424 114 1 0 131 43 97 0.802 7.09 6.01 Init - 134130 134080 51 0 0 37 103 30 0.454 0.61 6.00 Prom - 137671 137632 40 -4.75 7.08 PlyA - 138607 138602 6 -0.45 7.07 Term - 139042 138879 164 2 2 34 55 224 0.042 10.82 7.06 Intr - 153092 153012 81 0 0 100 92 29 0.328 3.19 7.05 Intr - 155233 155141 93 2 0 69 93 94 0.911 7.02 7.04 Intr - 163268 163189 80 1 2 69 99 48 0.830 2.28 7.03 Intr - 164160 163903 258 2 0 9 91 168 0.001 4.86 7.02 Intr - 176808 176721 88 1 1 106 91 91 0.574 9.31 7.01 Init - 181508 181451 58 2 1 58 89 5 0.298 -0.88 7.00 Prom - 182616 182577 40 -4.15 8.04 PlyA - 183220 183215 6 1.05 8.03 Term - 183415 183264 152 2 2 88 43 142 0.884 6.79 8.02 Intr - 185893 185786 108 2 0 77 46 97 0.487 3.74 8.01 Init - 188461 188275 187 1 1 62 85 62 0.221 2.60 8.00 Prom - 191797 191758 40 -3.65 9.02 PlyA - 191993 191988 6 -1.95 9.01 Sngl - 192390 192004 387 0 0 71 39 216 0.877 10.96 9.00 Prom - 192444 192405 40 -7.35 10.04 PlyA - 192561 192556 6 1.05 10.03 Term - 193345 192693 653 2 2 74 43 328 0.990 20.11 10.02 Intr - 195074 195012 63 0 0 98 84 37 0.761 2.07 10.01 Intr - 199307 199164 144 0 0 104 71 30 0.537 2.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 164027 163903 125 2 2 81 91 134 0.947 12.69 S.002 Sngl + 167195 167665 471 1 0 81 36 146 0.817 4.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_1|96_aa MICSAVWGMGWQKKGVGDGETESYRRSLTGFHSHKFQTAVLVRWLSPKSGQLETVSRGCE WSNNKKKHSLSTCSGRDPCSHGAYILRSGGGDGSRT >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_1|288_bp atgatctgttctgcagtgtgggggatgggttggcagaagaaaggtgtgggagatggggaa actgagtcctacaggcgatcacttactggctttcattcacacaagttccaaacggcagta ctggtcaggtggctgtcgccgaagtcaggacaattggagacagtttcaaggggctgtgaa tggagcaacaacaagaaaaaacactcattgagcacctgctctggcagagatccctgttcc catggagcttacatcctgaggagtgggggtggagatgggtcgaggaca >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_2|296_aa MGTISEPIWRTNDAQQYHVLGTTEMTVMSIFSFHQDPSLPHRPYPLSSKEVNASSMTDFT YYQLSYMEAENFSMPKSWSRSVPVVFTYVYQCMGMKTFWENMQETAEGGCLKVEELRIYG EGEQIGNISYQIMYCSQAIQTLGTIRGYLEEKQLVYYFSKYYTYKTSNITKCKDYEIKAA ACGIRKYLAEDMGASGPEVQEDLTGQATPKTSIGQMWAVNASLQLLLMAKAVWGRGKFTE GLGLGSNRSTFVVQKIPGVAATLCFAPVAALFTQSLLHFPVTSKKYFRSLLLNRLP >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_2|891_bp atggggaccatctccgagcccatctggaggaccaatgatgcccagcagtatcacgttctg ggaacaacagaaatgacagtgatgagcatcttcagctttcatcaggaccctagccttccc catcgtccataccctttaagctccaaagaagtgaatgcttcaagcatgacagatttcact tactatcaactttcatatatggaggctgagaactttagtatgccaaaatcatggagcaga tctgtacctgttgtgtttacatatgtatatcagtgtatgggcatgaaaactttctgggag aacatgcaagaaactgctgaaggtggttgcctcaaggtagaagaattaaggatctatggg gaaggggagcaaattggcaacatttcctatcagattatgtattgttcacaggctatacaa actctaggaactatcaggggttatttggaagaaaaacaactggtgtactatttttctaaa tactacacctacaaaacatccaatattaccaagtgcaaagactatgaaataaaggcagca gcatgtgggatcagaaagtatttagcggaagacatgggagcttcaggtccagaggtccag gaagacctcactgggcaggccacaccaaagacatccataggccagatgtgggctgtgaat gccagtttgcaactgctgctaatggcaaaggcagtgtgggggcgggggaagttcactgaa ggtcttgggctgggaagtaacaggagcacgtttgtagttcagaaaattcctggggtagct gcaacactctgctttgctcccgtggctgctctgtttacccagtctctcctccattttcca gtaaccagtaagaagtactttaggtctctcctcctcaacaggctcccctga >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_3|48_aa MNTFLKRVTSQEEPQAGPSGGIPEEEGIVITGHDSSLGVIALKGLPVG >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_3|147_bp atgaacacatttctgaaaagagtgacatctcaagaagagcctcaggcaggtccttcagga ggtatcccagaagaagaaggcattgttatcacaggacatgacagctccctgggtgttatt gcccttaaaggccttccagtaggataa >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_4|650_aa METSVSVSLWMPPSQRVFTFCVCHHVFVLLGASVFVSGRVSVLDRGDFVPDGFCVRARAS VHVGELGGCVSVSMAVVRYKSEHVCQGVFVPVCACLGGHSRFLPNVGQCRCAALCLETSS RAGAQGRQVAATEEPKAPGLAERTAKRLLQSTGWNWQAPRAPSTRQAECAGRVPTTPTPQ PLNEASRRPLAARRAPPWVRPLRRPQPVRSPARLRAMGQPGNGSAFLLAPNGSHAPDHDV TQERDEVWVVGMGIVMSLIVLAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGL AVVPFGAAHILMKMWTFGNFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSL LTKNKARVIILMVWIVSGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASS IVSFYVPLVIMVFVYSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKF CLKEHKALKTLGIIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPL IYCRSPDFRIAFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGT EDFVGHQENQTKMERGYETLKRTQQSKAETQAALRPGTLSQLQSWEGLLN >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_4|1953_bp atggagacatccgtgtctgtgtcgctctggatgcctccaagccagcgtgtgtttactttc tgtgtgtgtcaccatgtctttgtgcttctgggtgcttctgtgtttgtttctggccgcgtt tctgtgttggacaggggtgactttgtgccggatggcttctgtgtgagagcgcgcgcgagt gtgcatgtcggtgagctgggagggtgtgtctcagtgtctatggctgtggttcggtataag tctgagcatgtctgccagggtgtatttgtgcctgtatgtgcgtgcctcggtgggcactct cgtttccttccgaatgtggggcagtgccggtgtgctgccctctgccttgagacctcaagc cgcgcaggcgcccagggcaggcaggtagcggccacagaagagccaaaagctcccgggttg gctgaacgcactgcgaagcggcttcttcagagcacgggctggaactggcaggcaccgcga gcccctagcacccgacaagctgagtgtgcaggacgagtccccaccacacccacaccacag ccgctgaatgaggcttccaggcgtccgctcgcggcccgcagagccccgccgtgggtccgc ccgctgaggcgcccccagccagtgcgctcacctgccagactgcgcgccatggggcaaccc gggaacggcagcgccttcttgctggcacccaatggaagccatgcgccggaccacgacgtc acgcaggaaagggacgaggtgtgggtggtgggcatgggcatcgtcatgtctctcatcgtc ctggccatcgtgtttggcaatgtgctggtcatcacagccattgccaagttcgagcgtctg cagacggtcaccaactacttcatcacttcactggcctgtgctgatctggtcatgggcctg gcagtggtgccctttggggccgcccatattcttatgaaaatgtggacttttggcaacttc tggtgcgagttttggacttccattgatgtgctgtgcgtcacggccagcattgagaccctg tgcgtgatcgcagtggatcgctactttgccattacttcacctttcaagtaccagagcctg ctgaccaagaataaggcccgggtgatcattctgatggtgtggattgtgtcaggccttacc tccttcttgcccattcagatgcactggtaccgggccacccaccaggaagccatcaactgc tatgccaatgagacctgctgtgacttcttcacgaaccaagcctatgccattgcctcttcc atcgtgtccttctacgttcccctggtgatcatggtcttcgtctactccagggtctttcag gaggccaaaaggcagctccagaagattgacaaatctgagggccgcttccatgtccagaac cttagccaggtggagcaggatgggcggacggggcatggactccgcagatcttccaagttc tgcttgaaggagcacaaagccctcaagacgttaggcatcatcatgggcactttcaccctc tgctggctgcccttcttcatcgttaacattgtgcatgtgatccaggataacctcatccgt aaggaagtttacatcctcctaaattggataggctatgtcaattctggtttcaatcccctt atctactgccggagcccagatttcaggattgccttccaggagcttctgtgcctgcgcagg tcttctttgaaggcctatgggaatggctactccagcaacggcaacacaggggagcagagt ggatatcacgtggaacaggagaaagaaaataaactgctgtgtgaagacctcccaggcacg gaagactttgtgggccatcaagagaatcagaccaaaatggaaagaggatatgaaactcta aaacgaacacagcagagtaaagcagaaacacaggctgccctacgtcctggcactctttct cagctccaaagttgggaaggcctcctaaattga >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_5|90_aa MTYSNYFIFEDEEMEVQIREGCHHQLVSQLRASHHKPACFGVLVGGAAEIHTGYAFGCVY QPQPRLPWLKVAEAEGFDYSGGPSLPAPYP >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_5|273_bp atgacctactccaactacttcatttttgaagatgaggaaatggaggtccagataagggaa gggtgtcaccatcagcttgtgtcacagctaagggcttcccaccacaaacctgcgtgcttt ggcgtcttagtaggaggggctgctgaaattcataccggatatgcctttggctgtgtttat cagccccagcccaggctcccttggctgaaggtggctgaagcagagggctttgactattca ggtggccccagcctgcctgcaccctatccctga >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_6|54_aa MHKAGPWQRIIWYKQSMMVPNLSKTRVEFLIEGFFSHALRFAEYHRTWKHQLGN >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_6|165_bp atgcacaaggcaggcccatggcaaagaattatctggtataaacagtcgatgatggtgccg aatcttagtaagacaagagtggaatttctcatagagggctttttctctcacgcactgcgc tttgcagaatatcacagaacttggaaacaccagcttgggaattaa >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_7|273_aa MEPPQASRIQVSSNSLSSSGLQTRLDDHSLRRYKGALDIEYKGQLMSARKLKLDPFLTPY AKINSRWIKDLNIRCKTIKTLEENLGNTIQDIDMGKDFMTKTPKAMATEVKIDKWDLIKL KSFHTAKETIIEVNRDMDEAGNHHSQQTNSETENQTPHVLTLKQEIGNDKPDANQSHKKN GEEKECFKKRGEVFHMLVLLSFHFPESVHRPLSATLEVSGKLAASWVLHSSSPSLWNVLS TLLDNDDWYDVDCYDGDDDDDDDDDNVTTRLNY >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_7|822_bp atggagcctccacaggcatccaggatccaggtttcttccaattctctatcatcctcaggt ctccagacaaggctagatgaccactcactgagaaggtataaaggggctttggatattgaa tataaggggcagctgatgtcagccagaaaactgaaactggaccccttccttacaccttat gcaaaaattaactcaagatggattaaagacttaaacataagatgtaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacatagacatgggcaaagacttcatgact aaaacaccaaaagcaatggcaacagaagtcaaaattgacaaatgggatctaattaaacta aagagcttccacacagcaaaagaaactatcatcgaggtgaacagggacatggatgaagct ggaaaccatcattctcagcaaactaactcagaaacagaaaaccaaacaccgcatgttctc actctaaaacaagaaatagggaatgataagccagatgccaaccaatcacataagaaaaat ggagaggagaaagaatgtttcaaaaagagaggagaagtattccacatgctagtcctactg agctttcattttccagaatcagtccacaggcccttgtctgcaacccttgaggtctcaggc aagctggctgcctcctgggtgctccactcctcaagtccctccctgtggaatgtcctgtcc actttgcttgacaatgatgattggtatgatgtcgattgctatgatggtgatgatgatgat gatgatgatgacgacaatgtgaccacaaggctcaattactga >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_8|148_aa MSEPLPSTDNLGLQNLSSSQASDLGGKGSENVATVWNSQACLVGQLLIDLIGDQSNLFLR TSAQGKERGKLSGSGFGESLLLLKETVEAAREQVKELGECTPSSSRNPQGSVAESGSVHQ RQSVHVAMICDDISTYMENKLNFHFCDL >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_8|447_bp atgtctgagcctctgcccagcactgacaatctggggctccagaatctgtcgtccagccag gccagtgacctaggtgggaagggcagtgagaacgtagccactgtttggaattcacaggct tgtttggtgggacaactcctaatagatttgattggggatcagagcaacctatttctccga acctctgcccaaggaaaagagagaggcaaactctctggttcaggcttcggtgagagcctt cttttactcaaagagactgtggaggctgccagagagcaagtcaaggagttgggagagtgc accccaagtagttccaggaacccccagggcagtgttgctgaaagtggctctgtgcaccag cgccaatctgtgcacgttgctatgatctgcgatgacattagcacatacatggagaataag ttgaactttcatttctgtgacctctag >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_9|128_aa MGKDFMTNTPKAMETKAKIDKWDLTKLKSFCTAKESTIRVNRQPTEWEKIFTIYPSDKEL ISRIYKELKQIYKKKVKQPHRKVGEGYEQKLHKRRHLCSQQTHEKMLIITGHQRNANQND NETLSHTS >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_9|387_bp atgggcaaggacttcatgactaacacaccaaaagcaatggaaacaaaagccaaaattgac aaatgggatctaactaaactaaagagcttctgcacagcaaaagaaagtaccatcagagtg aacaggcaacctacagaatgggagaaaatttttacaatctacccatctgacaaagagcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaagtcaaacaaccccat cgaaaagtgggcgaaggatatgaacagaaacttcacaaaagaagacatttatgcagccaa caaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaatgac aatgagacactatctcacaccagttag >gi568815593f:148726832_148928070|GENSCAN_predicted_peptide_10|286_aa XDIVMLLKAWALEKGKSGFESWLFHLLTESLWASFLTSQILSYLICKMAGPEPSESQHSS REGLTLPSPGLEVLARAIRQEKEIKGIQLGKEEVKLPLFADDTIVYLENPVVSAQNLLKL ISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLF KENYKPLLNEIKEDTNKWKNIPCSWIGGIDIVKMAILPKGIYRFNAIPIKLPMTFFTELE KTTLKFMWNQKRACIAKTIISQKNKAGGITLPDFKLYYKATVTTTA >gi568815593f:148726832_148928070|GENSCAN_predicted_CDS_10|861_bp ngagatattgtgatgcttttaaaagcatgggctttagagaaaggcaagtctgggtttgaa tcatggcttttccacttactaactgagtcactttgggcaagtttcttaacctctcagatt ctcagttacctcatctgtaaaatggctggcccagagccttctgaatctcaacacagcagt agagaaggtttaaccttgccatcaccagggttggaagttctggccagggcaatcaggcag gagaaagaaataaagggtattcaattaggaaaagaggaagtcaaattgcccctgtttgca gatgacacgattgtatatttagaaaaccccgtcgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaac attccatgctcatggataggaggaattgatatcgtgaaaatggccatactgcccaaggga atttacagattcaatgccatccccatcaagctaccaatgactttcttcacagagttggaa aaaactactttaaagttcatgtggaaccaaaaaagagcctgcattgccaagacaatcata agccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggct acagtaaccacaacagcatga