GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:38:09 Sequence gi568815581r:31763362_32001627 : 238266 bp : 44.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2094 2247 154 2 1 70 12 110 0.188 1.74 1.02 Intr + 4696 4853 158 2 2 66 66 56 0.148 0.93 1.03 Term + 17353 17529 177 0 0 94 33 108 0.498 3.59 1.04 PlyA + 18307 18312 6 1.05 2.00 Prom + 19480 19519 40 -4.96 2.01 Sngl + 20751 21563 813 2 0 43 43 243 0.947 11.08 2.02 PlyA + 23194 23199 6 1.05 3.03 PlyA - 23601 23596 6 1.05 3.02 Term - 24009 23730 280 2 1 78 37 140 0.229 2.82 3.01 Init - 30934 30918 17 1 2 75 115 -1 0.207 0.99 3.00 Prom - 35721 35682 40 -3.96 4.03 PlyA - 35834 35829 6 1.05 4.02 Term - 39472 39220 253 0 1 116 44 199 0.970 13.41 4.01 Init - 49601 49594 8 2 2 114 91 0 0.843 3.40 4.00 Prom - 75522 75483 40 -3.66 5.19 PlyA - 75738 75733 6 1.05 5.18 Term - 88947 88778 170 1 2 83 48 194 0.999 12.94 5.17 Intr - 89669 89451 219 0 0 70 78 199 0.914 15.37 5.16 Intr - 95150 95119 32 1 2 86 97 45 0.753 2.97 5.15 Intr - 95937 95453 485 0 2 73 51 118 0.050 -1.58 5.14 Intr - 104751 104685 67 2 1 90 108 34 0.855 4.51 5.13 Intr - 110126 110017 110 2 2 72 73 82 0.948 4.18 5.12 Intr - 112052 111873 180 2 0 62 82 123 0.660 9.16 5.11 Intr - 114966 114889 78 0 0 89 91 39 0.511 4.05 5.10 Intr - 117393 117192 202 2 1 57 46 73 0.176 -0.71 5.09 Intr - 121144 121063 82 2 1 82 80 71 0.429 4.40 5.08 Intr - 122700 122619 82 0 1 38 100 61 0.931 1.51 5.07 Intr - 123944 123875 70 1 1 12 79 111 0.382 1.78 5.06 Intr - 126042 125924 119 0 2 75 81 84 0.382 5.66 5.05 Intr - 129433 129386 48 1 0 91 92 12 0.613 0.78 5.04 Intr - 131376 131284 93 0 0 96 87 51 0.965 5.96 5.03 Intr - 131650 131609 42 1 0 75 98 35 0.756 1.64 5.02 Intr - 136369 136285 85 0 1 66 108 59 0.998 5.52 5.01 Init - 138266 138175 92 2 2 59 41 136 0.975 5.96 5.00 Prom - 141625 141586 40 -3.06 6.00 Prom + 145413 145452 40 -5.06 6.01 Init + 173886 174159 274 2 1 105 70 248 0.160 19.87 6.02 Intr + 213160 213253 94 2 1 30 95 50 0.031 -1.08 6.03 Intr + 219638 219743 106 2 1 83 86 48 0.159 4.32 6.04 Intr + 231203 231360 158 1 2 76 96 122 0.918 10.61 6.05 Term + 235297 235642 346 1 1 79 48 119 0.567 0.97 6.06 PlyA + 237418 237423 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_1|162_aa MEQANHPVGLDISVVYKDTLKKIVQQETSCPFTHVHYAEGITGRHTAPEDEGWLMTEEGK ILIPEASQWKILKTLHQIFHMGIENTHQMAKSLFTGPNLQTIRQKEHQEAHYPGLPPLFV SGLKAVNEKYSNKFWKAKEKPFFSKSCGLTYGQMAGMKFAET >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_1|489_bp atggagcaggccaatcacccagtggggcttgatatcagtgtggtttacaaggacacctta aaaaagattgtccaacaagaaacaagctgccccttcacccatgtccactatgctgaggga atcactggaaggcacactgccccagaggatgaagggtggttaatgacagaagaaggaaag atacttatacctgaagccagccagtggaaaatacttaaaacgctccaccaaatttttcat atgggtattgaaaacactcatcaaatggccaaatccctgtttacagggccaaatctccag accatccgacagaaggaacaccaagaagcacattacccaggacttcctcctttgtttgtt tctgggttaaaggctgttaatgagaagtattcaaacaagttttggaaggcaaaagagaag ccatttttctccaaaagttgtgggctgacttatgggcagatggcgggcatgaagtttgca gaaacttag >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_2|270_aa MNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHM IISVDAEKAFDKVQQHFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGLKLEAFPLKT GTRQGCPLSLLLFNIVLEVLARAIRQEKEINCIQLGKEEVKLPLFADDMIVYLENPVVSA PNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKCKNIPCS >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_2|813_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcagtagatgcagaaaaggcctttgacaaagttcaacaacacttcatgctaaaa actctcaataaattaggtattgatgggacgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggctaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcacttctcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaggaaataaattgtattcaattaggaaaagaggaagtt aaattgcccctgtttgcagatgacatgattgtatatctggaaaaccccgtcgtctcagcc ccaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaacgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggat acaaacaaatgcaagaacattccatgctcatag >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_3|98_aa MGACGRQLGGSKELKQVKHCETGTLNEYWLLSSSSSSSSSSSSSSSLLLLLLQDTYMTWD NCYIYRSSPRIKCFFVLEALGQFDNNGLCIFKYEKYLE >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_3|297_bp atgggtgcatgtggaaggcagttaggaggatcaaaggagttgaaacaagtaaaacactgt gagaccggcacactcaatgaatattggctcttatcatcatcatcatcatcatcatcatca tcatcatcatcatcatcattattactattacttctccaagacacttacatgacttgggat aattgttacatttatagaagcagccccaggattaaatgcttcttcgtcttggaagccttg ggccaatttgataacaacgggctttgtattttcaaatatgagaaatatctagaataa >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_4|86_aa MPREVPLMSYEEVRTQVPSLPWVWDIDLGQAPPIRCVHTDTGSSGSPREKQAVAGAAMFA GVDGNKGAQLLVVTAGGSKFMFLVLT >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_4|261_bp atgcccagggaagttcctcttatgtcttatgaggaagtcagaactcaagttcccagcctg ccctgggtctgggacatagacctgggccaggctccaccaatcagatgcgtccatactgac actggttcaagtgggagccccagggagaaacaggctgtggctggagcagccatgtttgct ggtgtggatggcaacaagggtgcccagcttttggtggtgactgccggtggtagcaagttc atgttcctggtgctgacatga >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_5|751_aa MAEIIQERIEDRLPELEQLERIGLFSHAEIKAIIKKASDLEYKIQRRTLFKEDFINYVQY EINLLELIQRRRTRIGYSFKKDEIENSIVHRVQGVFQRASAKWKDDVQLWLSYVAFCKKW LCGLWQPNGKWKIDCLQKAQGNYFFAHCAFIQSAQNFIKKMELMHAEKLRKEKEEFEKAS MDVENPDYSEEILKGELAWIIYKNSVSIIKGAEFHVSLLSIAQLFDFAKDLQKEIYDDLQ ALHTDDPLTWDYVARRELEIESQTEEQPTTKQAKAVEVGRKEERCCAVYEEAVKTLPTGE LHQTMRLERTMTVFRKAHELKLLSECQYKQLSVSLLCYNFLREALEVAVAGTELFRDSGT MWQLKLQVLIESKSPDIAMLFEEAFVHLKPQKALLAVIGADSVTLKNKYLDWAYRSGGYK KARAVFKSLQESRPFSVDFFRKMIQFEKEQADWRFCGGSSGGQPGRRVTWPTASRATSRG PQGMDLQAAGAQAQGAAEPSRGPPLPSARGAPPSPEVSAAWGPGATASRSLGREVAGPGW AGAAGPGKRLWPASGRPATRSAVWARARAENGQAAGPRGCGAGKAEGVGGRSGSAETAVR PLGAAADLVNGSLYRLGAERLEARGTQSIPNDSPARGEGTHSEEEGFAMDEEDSDGELNT WELSEGTNCPPKEQPGDLFNEDWDSELKADQGNPYDADDIQESISQELKPWVCCAPQGDM IYDPSWHHPPPLIPYYSKMVFETGQFDDAED >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_5|2256_bp atggcagagataattcaggaacgcatagaagatcggctcccggaattggaacagctggag cgcattggactgttcagtcatgcggagattaaggctatcattaagaaggcttccgatcta gagtacaaaatccagagaagaacccttttcaaggaagactttatcaattatgttcaatat gaaattaatcttttggagctgatccagagaagaagaacacgcattggatattcatttaag aaggatgagattgagaattctattgtacaccgggtacaaggtgttttccagcgtgcctca gcaaaatggaaagacgatgttcaactttggctctcctatgtggctttttgtaagaagtgg ctttgtggattatggcagccaaatgggaaatggaagatcgattgtcttcagaaagcgcaa ggcaactatttcttcgcgcactgcgctttcatccagagtgcccaaaactttataaagaag atggagctgatgcatgctgaaaaactgaggaaggagaaggaagaatttgaaaaagccagt atggatgtggagaatcctgattattctgaagaaatccttaagggcgagttggcatggatc atctacaaaaattctgtaagcataattaaaggtgcagaatttcacgtgtcactgctttcg attgcacagctatttgactttgccaaagatctacaaaaagagatttatgatgaccttcag gctctacacacagatgatcctctcacttgggattatgtggcaaggcgagaattagagatt gagtcacagacagaagagcagcctacaacgaaacaagccaaagcagtggaggtcggccgg aaggaggagaggtgctgtgctgtgtatgaagaggcagtgaagactctgccaacaggtgaa cttcaccaaaccatgaggttggaaagaaccatgactgtattcaggaaggcacatgaactg aagcttctgtcagaatgccaatacaagcagttgagtgtttcgttgctgtgttataacttc ctgagggaagctctggaagtggcagtagctggaactgaattgtttagagactctgggaca atgtggcagctgaagctgcaggtgctgatcgagtcaaagagccctgacatagccatgctt tttgaagaagcctttgtgcacctgaaaccccagaaagctctcttagctgtcataggtgcc gactcagtaaccctgaagaataagtacctggattgggcttatcgaagtggtggctacaaa aaggccagagctgtgtttaaaagtttacaggagagccgaccattttcagttgactttttc aggaaaatgattcagtttgaaaaggagcaagctgattggcgcttctgcggcggatcctcg ggcgggcagccgggccggcgcgtcacgtggcccacggcgtccagggcgaccagccgcggg ccgcagggcatggaccttcaggccgccggggcccaggcgcagggggccgcggagccgtct cggggcccgccgctgcctagcgcgcggggggcgccccccagcccggaggtgagcgccgcg tgggggccaggagcaacagccagccgcagcctggggcgggaagtcgcggggccgggctgg gcgggggccgcggggccggggaagcgcctctggccggcctcgggccgccccgcgacacgg agcgctgtttgggctcgcgctcgagctgaaaacggccaggccgcggggccgcggggctgc ggggcggggaaagccgagggcgtgggtgggcgctctgggtcagcagagacggctgtccgc ccgctgggcgccgctgcggatttggtaaatgggagtctgtaccggctaggtgctgagcgg cttgaagcccgtggaacacagagcattcctaatgacagtcctgcccggggtgagggcacc cattctgaagaggaaggctttgccatggatgaggaggactctgatggagaactgaatacc tgggagctgtcagaagggacaaactgtccacccaaggaacagcctggcgatctttttaat gaggactgggactcggagttgaaagcagatcaagggaatccatatgatgctgacgacatc caggagagcatttctcaagagcttaaaccttgggtgtgctgtgccccacaaggagacatg atctatgaccccagctggcaccatccgcctccactgataccctattattccaagatggtc tttgaaacaggacagtttgacgatgctgaagattga >gi568815581r:31763362_32001627|GENSCAN_predicted_peptide_6|325_aa MAPQKHGGGGGGGSGPSAGSGGGGFGGSAAVAAATASGGKSGGGSCGGGGSYSASSSSSA AAAAGAAVLPVKKPKMEHVQADHELFLQAFENVNEELPARRKRNREDGEKTFVAQMTVFD KNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKYHPKGARIDVSINECYDGSYAG NPQDIHRQPGFAFSRNGPVKRTPITHILVCRFIADNQMNHACMLFVENYGQKIIKKNLCR NFMLHLVSMHDFNLISIMSIDKAVTKLREMQQKLEKGESASPANEEITEEQNGTANGFSE INSKEKALETDSVSGVSKQSKKQKL >gi568815581r:31763362_32001627|GENSCAN_predicted_CDS_6|978_bp atggcgcctcagaagcacggcggtgggggagggggcggctcggggcccagcgcggggtcc gggggaggcggcttcgggggttcggcggcggtggcggcggcgacggcttcgggcggcaaa tccggcggcgggagctgtggagggggtggcagttactcggcctcctcctcctcctccgcg gcggcagcggcgggggctgcggtgttaccggtgaagaagccgaaaatggagcacgtccag gctgaccacgagcttttcctccaggcctttgagaatgtcaatgaagagcttccagccaga agaaaacgaaatcgtgaggatggggaaaagacatttgttgcacaaatgacagtatttgat aaaaacaggcgcttacagcttttagatggggaatatgaagtagccatgcaggaaatggaa gaatgtccaataagcaagaaaagagcaacatgggagactattcttgatgggaagtatcat ccaaaaggtgctaggatagatgtttctatcaatgagtgttatgatggctcctatgcagga aatcctcaggatattcatcgccaacctggatttgcttttagtcgcaacggaccagttaag agaacacctatcacacatattcttgtgtgcaggtttattgctgacaatcaaatgaatcat gcctgtatgctgtttgtagaaaattatggacagaaaataattaagaagaatttatgtcga aacttcatgcttcatctagtcagcatgcatgactttaatcttattagcataatgtcaata gataaagctgttaccaagctccgtgaaatgcagcaaaaattagaaaagggggaatctgct tcccctgcaaacgaagaaataactgaagaacaaaatgggacagcaaatggatttagtgaa attaactcaaaagagaaagctttggaaacagatagtgtctcaggggtttcaaaacagagc aaaaaacaaaaactctga