GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:17:37 Sequence gi568815580r:6362147_6563010 : 200864 bp : 40.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 981 1053 73 2 1 48 92 58 0.502 0.26 1.02 Intr + 4742 4886 145 0 1 59 50 187 0.619 10.42 1.03 Term + 11214 11319 106 0 1 77 50 68 0.226 -1.20 1.04 PlyA + 11690 11695 6 1.05 2.02 PlyA - 15017 15012 6 1.05 2.01 Sngl - 17929 17255 675 1 0 76 48 189 0.754 9.64 2.00 Prom - 18879 18840 40 -6.75 3.03 PlyA - 19009 19004 6 1.05 3.02 Term - 20247 19921 327 0 0 59 38 273 0.714 13.12 3.01 Init - 31683 31564 120 0 0 73 69 136 0.625 10.44 3.00 Prom - 33085 33046 40 -6.45 4.00 Prom + 33295 33334 40 -6.75 4.01 Init + 34184 34285 102 1 0 83 47 80 0.600 3.69 4.02 Intr + 36723 36836 114 2 0 65 68 66 0.416 2.02 4.03 Intr + 46926 46983 58 2 1 90 89 37 0.384 1.54 4.04 Intr + 50423 50660 238 0 1 70 84 159 0.334 9.45 4.05 Term + 51420 51579 160 0 1 86 54 87 0.436 1.53 4.06 PlyA + 53211 53216 6 1.05 5.05 PlyA - 53397 53392 6 1.05 5.04 Term - 58480 58374 107 2 2 83 49 137 0.733 6.89 5.03 Intr - 60225 60038 188 2 2 5 30 136 0.095 -2.09 5.02 Intr - 67461 67316 146 0 2 -17 90 110 0.144 -1.14 5.01 Init - 71382 71293 90 0 0 61 116 89 0.930 9.54 5.00 Prom - 75425 75386 40 -2.95 6.03 PlyA - 75624 75619 6 1.05 6.02 Term - 76021 75800 222 1 0 69 43 152 0.611 4.73 6.01 Init - 87477 87367 111 0 0 88 85 26 0.036 2.56 6.00 Prom - 94056 94017 40 -5.75 7.03 PlyA - 94275 94270 6 1.05 7.02 Term - 100391 99998 394 1 1 49 36 537 0.410 38.12 7.01 Init - 100864 100698 167 1 2 79 66 253 0.742 21.35 7.00 Prom - 101688 101649 40 -6.95 8.02 PlyA - 101809 101804 6 1.05 8.01 Sngl - 102659 102450 210 2 0 42 41 212 0.725 6.84 8.00 Prom - 104884 104845 40 -4.55 9.03 PlyA - 105026 105021 6 1.05 9.02 Term - 123103 122976 128 2 2 108 44 148 0.773 9.86 9.01 Init - 132342 132300 43 0 1 59 52 63 0.309 0.43 9.00 Prom - 137951 137912 40 -7.85 10.00 Prom + 142369 142408 40 -3.85 10.01 Init + 142506 142776 271 2 1 48 62 117 0.098 2.28 10.02 Intr + 147388 147526 139 2 1 136 58 91 0.118 9.50 10.03 Intr + 149258 149464 207 2 0 9 110 141 0.101 5.67 10.04 Intr + 164881 165028 148 1 1 109 47 98 0.563 7.12 10.05 Intr + 174386 174479 94 1 1 69 92 70 0.705 4.12 10.06 Term + 177719 177900 182 0 2 43 43 182 0.714 5.99 10.07 PlyA + 178342 178347 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 89234 89554 321 1 0 58 48 174 0.891 6.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_1|107_aa WEKEGEKNRQKTVKSPFPEFVNCGRRSPFTLAKETVEQISALERGAAAVPGSDLTHRTQA GHKTQVPRKAHSRFNRSKVPSIPTCIKTQTNTPLDVLDIDSSLADLL >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_1|324_bp tgggaaaaagaaggagagaaaaatcgccaaaaaacggtcaagtctccgtttcctgagttc gtaaattgtggcagaagatctccttttactctagccaaagagactgtggagcagatctct gcattagaaagaggagcagcagcagtgcctggcagtgacctgactcaccgaacccaagca gggcacaaaacacaagtccctaggaaagcccacagcaggtttaacaggagtaaagtccct tccattcctacttgcataaaaacgcagaccaacactccactagacgtccttgacattgac tcctctctagctgatctgctgtag >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_2|224_aa MDKFLHIYTLPRLNQEEVESLKRPITNSEIEAVINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKRFQSIEKERILPNSFYEASIIPIPKPGRDKTEKEIFIPISLMNIDAKILNKILA KRILQHIEKLIHHDQVGFISGMQGWFNICKSINVIHHINRTNDKNHMIISIDAEKALDKI QLAFMLKTFNKLGIDGTYIKIIRAMYDKPTANIILNGQKLEHFL >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_2|675_bp atggataaattcctgcacatatacaccctcccaagactaaaccaggaagaagtcgaatcc ctaaagagaccaataacaaattctgaaattgaggcagtaattaatagcctaccaaccaaa aaaagtccaggaccagacggattcacagctgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaacgattccaatcaatagaaaaagagagaatcctccctaactca ttttatgaggccagcatcattccgataccaaaacctggcagagacaaaacagaaaaagaa attttcattccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactggca aaacgaatcctgcagcacatcgaaaagcttatccaccacgatcaagtcggcttcatatct gggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccatcacataaacaga accaatgacaaaaaccacatgattatctcaatagatgcagaaaaggccttggacaaaatt caacttgccttcatgctaaaaactttcaataaactaggtatcgatggaacatatatcaaa ataataagagctatgtatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaacatttcctttga >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_3|148_aa MDLGFMSITGRGGQEPAGEAMITIWVGDDVAPTRAVAASVDHNSSPARGQHWMENESDEL TEVGFRRWVITNSSELKEHALTQCKEAKNLEKRLDKLLTRIISLEKNINELMELKSTARE LREAYTSINSQIDQVEKGYQRLKINLMK >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_3|447_bp atggacctaggctttatgagtatcactggccgtggaggccaagagcctgctggagaggct atgatcacaatctgggtgggagatgatgtagctccaaccagggcagtagcagcgtctgtg gatcacaactcctcgccagcaaggggacaacactggatggagaatgagtctgatgaattg acagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaggagcatgct ctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagacaaattgctaactaga ataatcagtttagagaagaacataaatgaactgatggagctgaaaagcacagcacgagaa cttcgtgaagcatacacaagtatcaatagccaaattgatcaagtagaaaaaggatatcag agattgaagatcaacttaatgaaataa >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_4|223_aa MQDLGIPVEGAPGRDGTAPSAAQTPGASSATTHQATLPTQGICQTWKPASEDPKGAPQCL SSNPKTTSHFLLTSKLLALDLDLNAICYTAPRRTRGLSISKSSFRVSLISLTVAFITSCL CSAVVCSHTDSVLRVFVEHALTHVPSPMPSKGGLLPPKRSKKCRLRYKPHRWNRLENRRA TLHLFQPRTETGACHKHLLEQLQQKDPGDGPPASHSGVQTSVK >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_4|672_bp atgcaggaccttgggatcccagtggaaggtgcccctggcagggatggcacagccccttct gcagcccagacacccggagcctcatctgctaccacccaccaggccactttacctactcag gggatctgccagacctggaagccagcctctgaggaccccaagggagcaccccagtgtttg tccagcaatccaaaaacaaccagccactttctgctgacatccaaattgttggccctggat ttagatcttaacgccatttgttacacagcaccaagaagaactcgagggctttccatcagc aaatcttccttcagagtcagcctgatctccctcactgttgcttttatcacctcctgcctc tgctcagcagttgtctgctcacatacggactcggtcctacgagtatttgttgagcatgca cttacccacgtgccaagtcccatgccaagcaaaggaggcttattaccaccaaaaaggagt aagaagtgtcgtttaagatataaaccacacaggtggaatagattagaaaaccgaagagcc acactacacttatttcagccaaggactgaaactggagcctgccacaagcatcttctagaa cagctccaacagaaggatccaggagatgggccaccagcaagtcatagtggagtccaaaca agtgtgaagtga >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_5|176_aa MKAEEYPPDWRAVIKNRCEYGDVCIELEPKGEEEEERNKRIREKKMKEDLDMSNVVKNSC RISRQQDGNLSVQLQRMSGQNLIQSKALALFNSMKAERGGEAAEEKFGAGRGWVMRFKER SCLHNIKVQGETASADAEAAAEIATATPVFRKHQLDQSAAINIKTRLSMGKEIMTC >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_5|531_bp atgaaagcagaagaatatcctccagattggagggctgtaataaaaaaccgatgtgagtat ggagatgtatgcattgagttggaacccaagggggaggaggaggaggaaagaaataaaagg atcagagagaaaaaaatgaaggaggatttggatatgtcgaatgtggtgaaaaattcatgc agaattagcaggcagcaggatggaaatttgagtgtgcaactgcagagaatgtcaggccaa aacctgatccagagtaaggccctagctcttttcaattctatgaaggctgagagaggtggg gaagctgcagaagaaaagtttggagctggcagaggttgggtcatgaggtttaaggaaaga agctgtctccataacataaaagtgcaaggtgaaacagcaagtgctgatgcagaagctgca gcagaaattgccacagccaccccagtcttccgcaagcaccaactcgatcagtcagcagcc atcaacatcaagacaagactttccatgggcaaagagattatgacttgctga >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_6|110_aa MALALVEKDREQMNQLCTMFELISAREKIKQAQGDGSSLTFLPAGGGHEPQHWVWKWAEE HWEALPLLASLSSSSSEQRSDVSYHCSLHITLGDEHEGKAKLLTETLLLT >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_6|333_bp atggcgcttgcattggtagagaaagaccgtgaacaaatgaatcaattatgtactatgttt gagctgataagtgcaagggagaaaataaaacaggcacagggagatgggtcgagtttgacc ttccttccggctggaggaggacacgagccacagcactgggtgtggaagtgggctgaagag cactgggaagctttgcctcttctggcatccctctcttcgtcctcctctgagcagagaagt gatgtctcatatcactgcagccttcatataaccttgggagacgagcatgaaggtaaagcc aagctcctcacagagacgctgcttctgacctga >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_7|186_aa MAGEKVEKPDTKEKKPEAKKVDAGGKVKKGNLKAKKPKKGKPHCSRNPVLVRGIGRGKRV VFLKQLASGLLLVTGPLVLNRVPLRRTHQKFVIATSTKIDISNVKIPKHLTDAYFKKKKL RKPRHQEGEIFDTEKEKYEITEQRKIDQKAVDSQILPKIKAIPQLQGYLRSVFALTNGIY PHKLVF >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_7|561_bp atggcaggtgaaaaagttgagaagccagatactaaagagaagaaacccgaagccaagaag gttgatgctggtggcaaggtgaaaaagggtaacctcaaagctaaaaagcccaagaagggg aagccccattgcagccgcaaccctgtccttgtcagaggaattggcaggggcaagagggtg gttttcctgaagcagctggctagtggcttattacttgtgactggacctctggtcctcaat cgagttcctctacgaagaacacaccagaaatttgtcattgccacctcaaccaaaatcgat atcagcaatgtaaaaatcccaaaacatcttactgatgcttacttcaagaagaagaagctg cggaagcccagacaccaggaaggtgagatcttcgacacagaaaaagagaaatatgagatt acggagcagcgcaagattgatcagaaagctgtggactcacaaattttaccaaaaatcaaa gctattcctcagctccagggctacctgcgatctgtgtttgctctgacgaatggaatttat cctcacaaattggtgttctaa >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_8|69_aa MKHAKEQENMALSQKKFAEILHEEVQMLDSLDKDKSTVLHMLKELKENRRVIHEQIVKIN EEKEVTNKS >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_8|210_bp atgaagcatgcaaaggaacaagaaaatatggccctttcacagaagaaatttgcagaaatt ttacatgaagaagtacagatgttggattcactagacaaagacaaatcaactgtcttacat atgctcaaggagctaaaggagaacagaagagttattcatgagcaaatagtgaaaattaat gaggagaaagaagttacaaacaagagctaa >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_9|56_aa MRDLPLPVTWTPFTGSKQSIEEKVLMEGEAGLDALCGNSRGLTAEVEEFQGAAKTA >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_9|171_bp atgagggatttgccgctgccggtaacttggacaccttttactggcagtaaacagtccata gaggaaaaagtgctcatggagggagaggcaggcttggatgctctatgtggaaactccaga gggctaacagcagaggttgaagagttccaaggcgctgctaagaccgcctaa >gi568815580r:6362147_6563010|GENSCAN_predicted_peptide_10|346_aa MSLISMAMRPRTSGVTPDNKAGSEGRKEGRRETGRKEGERVRDRRKEEREEGGKEGRKEG RKEDKQVSKRHLCWAGLENCRAASSPSSVTGAEIAMSLFLKLSAVSLPSVTPGDRLSESP VKAEASFGCSSGFRDANYQDEYFKALWVLGMSSALSSCNNRGHHTLMYRHGASASLPWGL GKPGATMGSKYPWINTDSCFSHRSSGLRHQDDIASEQKTKYTNKHQKVLLPVRMQICEGL LGLDFQIADWTSFSCCFFAVLSPKKYASQLYPLNYGDTESIPNGSVDQRTTTEEAGDQLE GYCNEPGDCDTGGKWPDSGSILKAELMSFERKKGMKDDSKAFHLSS >gi568815580r:6362147_6563010|GENSCAN_predicted_CDS_10|1041_bp atgtccttgatctccatggctatgaggccaagaacctcaggtgtcaccccagacaacaag gctggttcagaaggaaggaaagaaggaaggagagaaacaggaaggaaggaaggagagaga gtgagggatagaaggaaggaggaaagggaggaaggagggaaggaaggaaggaaggaaggg aggaaggaagacaagcaggtaagcaagcgacacctctgctgggcaggcctggagaactgc agagctgcttcaagtcccagctctgtcacaggggctgagattgcaatgtcactgttctta aaactgtctgctgtgtcacttccgagtgtaacccctggagacaggctctcagaatcacct gtgaaggcggaagccagttttggctgctccagcggatttagggatgccaactaccaggat gagtatttcaaagctctctgggtcctggggatgagcagtgccctctcatcctgtaataat cgtggacatcacacactgatgtacagacatggtgccagcgcctcccttccttggggtctt ggaaaacccggtgccacaatgggatctaagtatccttggatcaatacggattcgtgtttc tcacaccgaagttcagggttgagacatcaagatgacattgcttcagaacagaaaaccaag tatacaaataagcaccaaaaagttttacttcctgtgagaatgcaaatatgcgaaggactc ctaggtttggacttccagattgcagattggacttccttctcgtgttgtttctttgctgtc ctctcccccaaaaaatatgcctcacaactttatccccttaactatggggatactgagagc attcccaatggctccgtggatcaaaggacaacaactgaggaagcgggtgaccagttggaa ggctattgcaacgagccaggtgactgtgacacaggtggtaaatggccggattcaggatcg attttaaaggcagagctgatgagtttcgagagaaagaaaggaatgaaggatgattccaaa gctttccacctgagcagttag