GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:02:02 Sequence gi568815581f:55165485_55420876 : 255392 bp : 41.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 609 604 6 1.05 1.02 Term - 16309 16282 28 0 1 122 44 20 0.658 -2.43 1.01 Init - 19929 19679 251 0 2 62 41 574 0.973 47.08 1.00 Prom - 29308 29269 40 -6.25 2.00 Prom + 33160 33199 40 -6.95 2.01 Init + 36124 36228 105 0 0 76 115 111 0.954 12.87 2.02 Term + 44148 44420 273 2 0 48 48 168 0.522 3.29 2.03 PlyA + 45223 45228 6 1.05 3.06 PlyA - 45749 45744 6 1.05 3.05 Term - 55068 54900 169 2 1 36 41 100 0.205 -3.53 3.04 Intr - 58178 58012 167 2 2 38 110 107 0.724 5.74 3.03 Intr - 59512 59424 89 2 2 77 71 50 0.769 0.97 3.02 Intr - 59649 59589 61 0 1 106 68 47 0.708 1.89 3.01 Init - 60966 60907 60 0 0 69 94 31 0.848 3.10 3.00 Prom - 61392 61353 40 -3.65 4.06 PlyA - 62748 62743 6 1.05 4.05 Term - 64328 63945 384 2 0 11 48 252 0.398 7.40 4.04 Intr - 73600 73490 111 1 0 73 64 161 0.121 11.86 4.03 Intr - 76853 76669 185 0 2 87 36 98 0.760 3.09 4.02 Intr - 79385 79257 129 0 0 61 110 39 0.464 3.15 4.01 Init - 85064 85001 64 2 1 59 53 31 0.306 -1.94 4.00 Prom - 86385 86346 40 -5.75 5.00 Prom + 89226 89265 40 -7.95 5.01 Init + 90803 90910 108 1 0 65 68 84 0.666 4.37 5.02 Intr + 99977 100115 139 1 1 90 101 115 0.646 12.12 5.03 Intr + 102267 102602 336 1 0 98 111 313 0.708 29.17 5.04 Term + 105523 105635 113 2 2 26 42 74 0.069 -5.66 5.05 PlyA + 105801 105806 6 1.05 6.06 PlyA - 106032 106027 6 1.05 6.05 Term - 107540 107434 107 0 2 45 47 154 0.866 4.59 6.04 Intr - 114560 114437 124 2 1 77 99 108 0.980 10.04 6.03 Intr - 118863 118768 96 0 0 73 98 61 0.966 4.89 6.02 Intr - 119125 119029 97 0 1 67 99 76 0.816 5.69 6.01 Init - 119951 119785 167 2 2 73 76 66 0.670 3.05 6.00 Prom - 124304 124265 40 -7.85 7.05 PlyA - 124829 124824 6 1.05 7.04 Term - 127879 127770 110 2 2 73 42 155 0.038 7.09 7.03 Intr - 134907 134820 88 0 1 51 75 106 0.045 4.22 7.02 Intr - 138062 138008 55 1 1 68 52 31 0.147 -4.44 7.01 Init - 138917 138775 143 2 2 79 100 69 0.567 6.89 7.00 Prom - 142864 142825 40 -4.25 8.00 Prom + 146887 146926 40 -6.35 8.01 Init + 147487 147616 130 0 1 49 91 23 0.451 -0.79 8.02 Intr + 149743 149963 221 2 2 85 94 333 0.585 30.70 8.03 Intr + 155180 155317 138 1 0 12 24 185 0.001 4.24 8.04 Intr + 160713 160864 152 2 2 53 21 107 0.000 -1.46 8.05 Intr + 169109 169369 261 2 0 92 36 166 0.209 7.48 8.06 Intr + 169792 169934 143 1 2 94 -15 112 0.124 0.58 8.07 Intr + 174005 174193 189 0 0 98 44 76 0.048 2.84 8.08 Intr + 182099 182192 94 0 1 67 1 114 0.030 -1.30 8.09 Intr + 183776 183857 82 2 1 101 44 103 0.518 5.92 8.10 Intr + 200729 200813 85 1 1 45 76 70 0.016 -0.03 8.11 Intr + 205573 205633 61 2 1 34 115 87 0.029 2.87 8.12 Intr + 207581 207775 195 2 0 59 76 66 0.070 0.11 8.13 Term + 207882 208080 199 0 1 -2 48 209 0.175 3.79 8.14 PlyA + 209940 209945 6 1.05 9.00 Prom + 210570 210609 40 -4.05 9.01 Init + 216929 216980 52 1 1 82 92 39 0.356 5.07 9.02 Intr + 217463 217629 167 0 2 73 19 108 0.329 1.16 9.03 Intr + 220867 220972 106 0 1 92 89 45 0.265 3.87 9.04 Term + 221860 222062 203 2 2 19 38 154 0.176 0.07 9.05 PlyA + 222164 222169 6 1.05 10.07 PlyA - 223848 223843 6 1.05 10.06 Term - 229050 228850 201 0 0 106 47 190 0.992 13.11 10.05 Intr - 233744 233643 102 2 0 27 88 85 0.119 1.85 10.04 Intr - 236054 235985 70 1 1 104 95 5 0.167 1.07 10.03 Intr - 238384 238283 102 0 0 77 69 81 0.638 3.47 10.02 Intr - 245933 245773 161 2 2 93 87 123 0.984 10.56 10.01 Intr - 248748 248667 82 2 1 109 73 61 0.883 5.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 155037 154895 143 2 2 97 39 72 0.981 2.28 S.002 Init - 155271 155081 191 0 2 76 35 200 0.900 10.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_1|92_aa MPHGFVILQYEKKKKKEKEEEKEEEEEEEKEKEKEKEKEKEKEKEKEKEKEKEKEKKKKK KKKKKKKKKKKKKKKKKKKKIVARISRYPKGK >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_1|279_bp atgccccatggctttgtaatacttcagtatgagaagaagaagaagaaggagaaggaggag gagaaggaggaggaggaggaggaggagaaggagaaggagaaggagaaggagaaggagaag gagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaaagaagaagaagaaa atagtggccagaatcagcagataccctaagggtaaatag >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_2|125_aa MPRKFEYIPGKKSNQDVTAKVVAKEMFTAKTGTTKYSPPVLDKGEQLRTRAQLVNAVSYL MLMLWTTGDFMSLLSPRSHHGLKDSFGFLRTWLCIPPFTSLFRGDFPLGIRLFILWNLEL HQEAG >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_2|378_bp atgcctaggaaatttgaatacattccaggaaagaagtccaatcaagacgttactgctaag gtcgtagcaaaggaaatgttcacagcaaagacagggacgaccaagtacagtccacctgtt ctggataagggcgagcagctgaggacacgtgctcaactagtcaatgctgtttcatatctg atgctgatgctgtggacaacaggggacttcatgtcattgttgtctcccaggtcacatcat gggctgaaagatagcttcggctttctgagaacatggctctgcatcccaccatttacttcc ctcttccgaggggattttcctcttggaatcaggctcttcattctgtggaatctggagctt caccaagaggcaggctga >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_3|181_aa MKFFGLYPKSSKGLSSGQLEREKVRADDLWGPVSYGVWEGAWKPAGGQGKLAASPNQDEN VVIVSLALPQIGQCFKRSVLELSVAREGASMGPNEWAHRAGSVSQKSRQEQEDRSGRGDP MVRAKGHSATRDVGGSATSSQWKSKEEGRLHMGSFYGSGLERLNNTSARIPLARNKSHSH P >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_3|546_bp atgaagttttttggactttatcccaaaagttccaaggggctctcatcagggcagctggag agggagaaagtaagagctgatgacctatgggggcctgtgagctatggggtctgggaaggt gcttggaaaccagctggaggtcaagggaaattagcagcctccccaaaccaggatgaaaac gtagttattgtttctttggcactcccccagattggtcaatgcttcaagcgttctgtcctt gagctatctgtggccagagaaggtgccagtatgggtcccaatgaatgggctcacagggca gggagtgtaagccagaagtcaaggcaagaacaagaggatcgcagtggcagaggagacccc atggtcagagccaaaggtcattcagctaccagggatgttggaggttctgcaacatctagc cagtggaaaagtaaagaggaaggaagattgcatatgggaagtttttatgggtcgggcctg gaaaggttgaacaacacttctgctcgtattccactggctagaaataagtcacatagccac ccttaa >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_4|290_aa MVKGKGGASISHGKKGSKREAENNKNDACDYLRVPITCQAFHMYHLYSDPHSHLPYECDD HLWAAPKQKLGEQHLYQGKDLVSNNPYDIINAFLDVIDQIVSPPLPHSYVEVLAPSTQNV AIFGNKPPAAAAAAAAAAAPLKSLVEGEKSTDKLSSRSVFGEGCTESPPPTKTYPALMSV ELRERNWIRVVANGDGEKRSASGCILKAVPVRFADELQMECKRKKGAKGDLKFVAGAMEK VEFPLTQKGKRLGGADLGSTARNLVLDMSNSIRALLPSPVLNRGLIIYCQ >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_4|873_bp atggtgaaaggcaaagggggagccagtatatctcatggaaagaaaggaagcaagagagag gcggaaaataataaaaatgatgcctgtgactatttgagggtgcccattacgtgccaagca tttcacatgtatcatctctactctgatcctcatagccacctgccatatgagtgtgatgac cacctttgggcagcccctaaacaaaaacttggagaacagcatttgtatcaaggtaaagat ctggtatcaaataatccatatgatattattaatgcctttttggatgttatcgaccaaatt gtgtcccctcccctgccccattcatatgttgaagtcctagctcccagtactcagaatgtg gctatatttggaaataagcctccagcagcagcagcagcagcagcagcagcagcagcgccc ctgaagtctcttgtggaaggtgagaaatctaccgataagctttctagtaggagtgtcttt ggggagggatgcacagagtcacctcctccaacgaagacttatccagctctaatgtcagta gagctgagggagagaaactggattagagtggtagcaaatggtgatggtgagaaacgatca gcttctggatgtattctgaaggcagtgccagtgcgatttgctgatgaactgcagatggag tgtaagaggaagaaaggagccaagggtgacttaaagtttgttgctggagcaatggagaaa gtggagtttccattaacccagaaggggaagaggctgggaggagcagatcttggaagcaca gccaggaatttggttttggacatgtcaaattccatcagggcgttgctcccatccccagtg ttaaacaggggtctcattatatattgccagtga >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_5|231_aa MGYHDACKEFSTVPSPVSSQQIIVLTVTIISGILPLGKKFECIAMEKMSRPLPLNPTFIP PPYGVLRSLLENPLKLPLHHEDAFSKDKDKEKKLDDESNSPTVPQSAFLGPTLWDKTLPY DGDTFQLEYMDLEEFLSENGIPPSPSQHDHSPHPPGLQPASSAAPSVMDLSSRASAPLHP GIPSPNCMQSPIRPVASRTPSAPDVTSKNVFSHYQMSPGHNTVTPGGEPLV >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_5|696_bp atgggttaccatgatgcatgcaaagaatttagcacagtgcctagcccagtcagtagtcag caaataatagttcttactgttacaataatcagtggcattcttcctctagggaaaaaattt gagtgcatcgcgatggagaaaatgtcccgaccgctccccctgaatcccacctttatcccg cctccctacggcgtgctcaggtccctgctggagaacccgctgaagctcccccttcaccac gaagacgcatttagtaaagataaagacaaggaaaagaagctggatgatgagagtaacagc ccgacggtcccccagtcggcattcctggggcctaccttatgggacaaaacccttccctat gacggagatactttccagttggaatacatggacctggaggagtttttgtcagaaaatggc attccccccagcccatctcagcatgaccacagccctcaccctcctgggctgcagccagct tcctcggctgccccctcggtcatggacctcagcagccgggcctctgcaccccttcaccct ggcatcccatctccgaactgtatgcagagccccatcagaccagttgcaagtaggaccccc agtgccccagatgtgacaagtaaaaatgtcttcagtcactaccaaatgtcccctgggcat aatacagtcactcccggtggagaaccactggtctag >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_6|196_aa MERHVKGSGECCLRNENNPWNPIKHDPAQMFTLSWWLQTKVLNGHSQDFGYTWDPSRPVP RLPNHGAGNQPSPANYGAAELMSDSQAQPMFNLEPTSPYCPTHSWIKSLSHSYLDPREHT ESSEAGISTLTYNGGNKCRRVIWLVDPTTFAGWPMTSSLVAGIRGEDKLYKKQKNVYLSP EPCRLPPLQDLLAPDE >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_6|591_bp atggaaagacatgtcaaaggctcaggggaatgctgcttaagaaatgaaaacaatccatgg aatcccatcaaacatgatcctgcacaaatgtttactctttcttggtggctacagaccaaa gtcctcaatggacactctcaagactttggatatacctgggatccaagtcgcccagtgccc aggcttcccaaccatggggcaggcaatcagccatcaccagctaattatggagcagctgag ttgatgagcgactcccaggcccagccaatgtttaacctggagcccacttcaccctattgc ccaacacactcttggatcaagtccttgtcacacagttacctggatcccagggagcacaca gaatcctctgaagcaggtatcagcaccctcacttacaatggagggaacaaatgcagaaga gttatatggctggtagatcccaccacctttgcgggttggccgatgacatcctctctggtg gcagggatccgtggagaagataaactctacaagaaacagaaaaatgtctacctgtctcca gaaccatgcagactccctccccttcaggatctcctagcccctgatgaatga >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_7|131_aa MTTNGRGTWPMSLLAIAPCDGYDSAPSTSSSLGREPLPRHVGSPCGRRPSAIILASSALD NEPYHLESTQNKKTKEKVLGNTDSLGTAEEEDSVKEHSGDRPLPSPAATCNGRSTGIPYN SSGTYSTSNLN >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_7|396_bp atgacaactaatggccgaggcacgtggcccatgagccttctcgccattgccccctgtgat ggttatgacagtgcaccttccacctcctcctccctgggccgggagcctctccccagacat gtggggtcaccgtgtggcagaagacccagtgccataatactggcttctagtgcactggac aatgagccctatcacctggagagcacacagaacaagaaaaccaaagaaaaagtcctgggg aacactgacagtttaggcacagctgaagaggaggactctgtaaaagagcacagtggcgat cggccgttgcccagtccagctgccacatgtaatggtcgttccaccggcatcccctataac tcctcaggcacatatagcacctcaaatcttaattaa >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_8|649_aa MALRYLWGETGPGFVSQVLCHLLFDTGEATYLVCGPVRFLHSAGQLLPANRNTPSPIDPD TIQVPVGYEPDPADLALSSIPGQEMFDPRKRKFSEEELKPQPMIKKARKVFIPDDLKDDK YWARRRKNNMAAKRSRDARRLKENQIAIRASFLEKENSALRQETIRSFREDGKEFPCSYA RAGWLMVWEWVVETLFHSVTSPTVHRLAPALCTSILWPGKGVRGRKEKGGSLMNFACEDD ALREGNMIPAHGIRIPPRLNTVVQLNSPSSVFHFGPFSNPVRDSGYNKQGDRFGDGKPYP GPSASSLGSIAETPSPFSQAPSPLQPPQNYRGKLTQERLHEKEPLKDQTSASPWGPQMCS LHQAEISKSRPVGPPAHSRGPSSALVACCHCAVPDAVLANHGLFPRPFYFPASSYDKVPV PFSQRETSSPQCETPFKAALLSMIGITDAPRFIGTVLRQTGNMSAVGLNRVEKGFVREKT AGKSKKVLRLPKKMRTEEDKDSQQADIDGISTMAKERDRSGGPGGNTGFLGQAQGPSAVR SLGTWCPASQLLQPWLKKANIELRPWLQRVQASSIGSFHMVLSLLSWRTSARTVQNENVG SEPPHRVPTGAPPSGAVRREPCPSDPGMVDPLTAYTLHLEKPQTLNASL >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_8|1950_bp atggcactaaggtatctttggggagaaacgggacctggttttgtttcgcaggtcctttgc catttgctgtttgatactggggaagccacgtaccttgtctgtggtccagttaggtttctc cactctgcaggtcagctgttgccagcaaaccgcaatacaccaagtcccattgatcctgac accatccaggtcccagtgggttatgagccagacccagcagatcttgccctttccagcatc cctggccaggaaatgtttgaccctcgcaaacgcaagttctctgaggaagaactgaagcca cagcccatgatcaagaaagctcgcaaagtcttcatccctgatgacctgaaggatgacaag tactgggcaaggcgcagaaagaacaacatggcagccaagcgctcccgcgacgcccggagg ctgaaagagaaccagatcgccatccgggcctcgttcctggagaaggagaactcggccctc cgccaggagactattcgttccttcagagaggacggaaaagaattcccctgctcatatgcc agagcaggatggctgatggtatgggaatgggtggtagagacgttgtttcatagtgtgact tcaccaacagttcatagactagcgccagctttgtgcacaagcatcctgtggcctgggaag ggggtgagaggaaggaaagaaaaaggagggtccctcatgaactttgcctgtgaagacgat gcactcagggaaggaaacatgattcctgcccatgggatcaggatcccgccacgtctgaac accgttgtacagcttaacagtccctcatctgtttttcattttggccctttcagcaaccct gtgagggactcagggtacaataaacaaggagacagatttggtgatggcaaaccttaccca ggcccatccgcctcttccctgggcagcattgcagaaactccttcacccttctctcaggcc ccatcccccctgcagcctccccagaactacagagggaagctgacacaagaaaggcttcat gaaaaggagcccttaaaagaccaaacctctgcatctccctgggggcctcagatgtgttcc ctgcaccaagctgaaatcagcaaatctcggccagttggacctccagcccactcacgtgga ccttcctcagccctggtagcctgctgccattgtgctgtgccagatgctgtgctggctaac catgggctgtttccaagacctttctatttcccagcctcctcatacgacaaagttcctgtg cctttcagccagagggaaacatcttcaccccagtgtgaaaccccatttaaagctgccttg ctgtcaatgattggaatcacagatgctccacgcttcattggaaccgtcctgagacagaca ggaaacatgtcagcagtgggtctcaatagggtggagaagggttttgttcgggaaaaaaca gcaggcaaatcaaagaaagtactgaggctcccaaagaaaatgagaacagaagaggacaaa gacagccagcaggcagatattgatggcatcagcaccatggcaaaggagagagacagatct ggaggcccagggggaaacacaggtttcttgggccaggcccagggtcccagtgctgtgcgc agtctagggacttggtgccctgcatcccagctgctccagccatggctgaaaaaggccaac atagagctcaggccatggcttcagagggtgcaagcgtcaagcattggcagcttccacatg gtgttgagcctgctgtcatggagaacctctgctaggacagtgcaaaatgaaaatgtgggg tcagagcccccacacagagtacctactggggcaccccctagtggagctgtgagaagagag ccatgtccttcagatcccggaatggtagatccactgacagcttacaccttgcacctggaa aagccgcagacactcaatgccagtctgtga >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_9|175_aa MGKWVESQGTMDTPRFKDEEIKAQHGQVNCLSLSSPLIARSHSPKRICEEPFISHWLELL MPISENKDRLGPDPCEVLAHPSPSPMIVSFLRAPQKQKLLGFLYSLQNADAQMNNTKYGT LVEKTGKSASLSTWSCTLSHSLATVIYLCDANKQIDKSSPKRQIQIKEVGGQRII >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_9|528_bp atgggaaagtgggtggagagccaagggaccatggatacaccaaggtttaaagatgaggaa atcaaggctcagcatggccaagtcaactgcttgtctctttcctctccactgattgccagg tcacattccccaaaacggatctgtgaagagccttttatttctcactggctcgagctccta atgccgataagtgaaaataaagatagacttgggccagatccatgtgaagtgctggctcac ccttcaccttcccccatgattgtaagtttcctgagggctccccagaagcagaagctctta ggcttcctgtacagcctgcagaatgcagatgcacaaatgaataacacaaagtacgggacc cttgttgaaaagacaggtaaatcagcatcactttctacttggtcatgcacactgtcacat tcactggccacagtcatctatctgtgtgatgccaacaaacagattgacaaaagctcccca aaacgtcaaatccagattaaggaagttggggggcagagaatcatctag >gi568815581f:55165485_55420876|GENSCAN_predicted_peptide_10|239_aa XFMNHRAPANGRYKPTCYEHAANCYTHAFLIVPAIVGSALLHRLSDDCWEKITAWIYGMG LCALFIVSTVFHIVSWKKSHLRLNLRELGPLASHMRWFIWLMAAGGTIYVFLYHEKYKVV ELFFYLTMGFSPALVVTSMYRTVILEGRETKKMSFEFTPVVSLETLSKQQQKENNTDGLQ ELACGGLIYCLGVVFFKSDGIIPFAHAIWHLFVATAAAVHYYAIWKYLYRSPTDFMRHL >gi568815581f:55165485_55420876|GENSCAN_predicted_CDS_10|720_bp nngttcatgaaccatcgagctccagccaatggccgctacaagccaacttgctatgaacat gctgctaactgttacacacacgcattcctcattgttccggccatcgtgggcagtgccctc ctccatcggctgtctgatgactgctgggaaaagataacagcatggatttatggaatggga ctctgtgccctcttcatcgtttctacagtatttcacattgtatcatggaaaaagagccac ttaaggttaaatcttcgtgaacttggacccctggcatctcatatgcgttggtttatctgg ctcatggcagctggaggaaccatttatgtatttctctaccatgaaaaatataaggtggtt gaactctttttctatctcacaatgggattctctccagccttggtggtgacatcaatgtac aggactgtgatccttgaaggaagagaaaccaagaaaatgagctttgaattcaccccagta gtttccctggagacactttccaaacagcaacaaaaggagaacaacaccgatggacttcag gaacttgcctgtgggggcttaatttattgcttgggagttgtgttcttcaagagtgatggc atcattccatttgcccacgccatctggcacctgtttgtggccacggcagctgcagtgcat tactacgccatttggaaatacctttaccgaagtcctacggactttatgcggcatttatga