GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:57:41 Sequence gi568815597f:50870369_51074267 : 203899 bp : 39.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 926 921 6 1.05 1.01 Sngl - 18754 17966 789 1 0 64 38 286 0.401 16.87 1.00 Prom - 20232 20193 40 -8.25 2.02 PlyA - 20432 20427 6 -0.45 2.01 Sngl - 20842 20492 351 1 0 72 36 236 0.618 12.60 2.00 Prom - 22908 22869 40 -6.05 3.02 PlyA - 27196 27191 6 1.05 3.01 Sngl - 36445 35339 1107 1 0 58 49 488 0.930 38.52 3.00 Prom - 36931 36892 40 -11.44 4.02 PlyA - 37378 37373 6 1.05 4.01 Sngl - 38119 37463 657 1 0 49 41 256 0.454 12.92 4.00 Prom - 38212 38173 40 -10.15 5.03 PlyA - 38383 38378 6 1.05 5.02 Term - 39572 39334 239 0 2 80 54 232 0.964 14.35 5.01 Init - 48185 48053 133 2 1 78 47 25 0.048 -2.25 5.00 Prom - 48309 48270 40 -3.65 6.06 PlyA - 48994 48989 6 1.05 6.05 Term - 62330 62055 276 2 0 -19 42 247 0.076 3.88 6.04 Intr - 74868 74725 144 0 0 54 84 59 0.016 1.66 6.03 Intr - 89485 89399 87 1 0 83 72 124 0.238 9.55 6.02 Intr - 90024 89842 183 0 0 27 -52 225 0.334 1.46 6.01 Init - 93194 93192 3 2 0 93 115 0 0.748 3.25 6.00 Prom - 96512 96473 40 -9.95 7.00 Prom + 97288 97327 40 -4.35 7.01 Init + 98420 98958 539 1 2 27 15 509 0.055 31.88 7.02 Intr + 101836 101899 64 1 1 99 53 32 0.015 -1.40 7.03 Term + 103525 103902 378 0 0 81 41 309 0.960 19.30 7.04 PlyA + 103941 103946 6 -1.75 8.00 Prom + 104857 104896 40 -5.85 8.01 Init + 106624 106694 71 0 2 29 27 122 0.544 0.77 8.02 Intr + 107508 107631 124 0 1 58 64 155 0.869 9.77 8.03 Term + 107813 108034 222 1 0 69 55 230 0.538 13.73 8.04 PlyA + 108831 108836 6 1.05 9.00 Prom + 109181 109220 40 -4.95 9.01 Init + 118828 118884 57 0 0 114 100 -25 0.524 2.67 9.02 Term + 135959 136381 423 1 0 57 38 254 0.890 11.61 9.03 PlyA + 136778 136783 6 1.05 10.02 PlyA - 137763 137758 6 1.05 10.01 Sngl - 154305 153583 723 0 0 86 41 249 0.975 15.98 10.00 Prom - 159839 159800 40 -5.15 11.00 Prom + 174125 174164 40 -4.95 11.01 Init + 188566 188710 145 0 1 71 75 149 0.048 12.23 11.02 Term + 190152 190225 74 1 2 65 47 78 0.037 -1.51 11.03 PlyA + 191410 191415 6 1.05 12.03 PlyA - 192454 192449 6 1.05 12.02 Term - 197445 197300 146 1 2 -22 42 184 0.380 -0.11 12.01 Init - 197642 197531 112 2 1 52 93 64 0.445 3.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 82845 82725 121 0 1 68 74 83 0.825 5.30 S.002 Sngl + 98420 98962 543 1 0 27 48 520 0.936 37.74 S.003 Init + 100001 100129 129 1 0 57 77 109 0.907 6.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_1|262_aa MVKFLDTYTLPRLNQEEAESLNRPITGSEIEAIINSLPTKKSPGPDGFRAKFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNINAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGSFDIRKSVNVIQHINRTKDKNLIDAEKAFDKILQPF MLKSLNKLGINGMYLNIIRAVYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVL EVLARAIRQEKERKGIQLRKEE >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_1|789_bp atggttaaattcctcgatacatacaccctcccaagactaaaccaggaagaagctgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccgggaccagatggattcagagctaaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaagccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcaatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcatatcaaaaagcttatccaccatgatcaggtgggcttcatccct gggatgcaaggctcgttcgacatacgcaaatcagtaaacgtaatccagcatataaacaga accaaagacaaaaacctaatcgatgcagaaaaggcctttgacaaaattctacagcccttc atgctaaaaagtctcaataaattaggtattaatgggatgtatctcaatataataagagct gtttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttgaaaactggcactagacagggatgccctctctcaccacttcttttcaacatagtgttg gaagttctggccagggcaatcaggcaggagaaagaaagaaaggggattcaattaagaaaa gaggaataa >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_2|116_aa MELKTVAQELHDTCTSFNSRFHQLQERISVIEDQMNEIKQEGKFREKRIKINEQSLQEIW DYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARRANIQIQEIQRTSQR >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_2|351_bp atggaactgaaaaccgtggcacaagaactacatgacacatgtacaagcttcaatagccga tttcatcaactgcaagaaagaatatcagtgattgaagaccaaatgaatgaaattaagcaa gaagggaagtttagagaaaaaagaataaaaataaatgaacaaagtctccaagaaatatgg gactatgtgaaaagaccaaatctacgtttgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcgggccaacattcaaattcaggaaatacagagaacgtcacaaagataa >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_3|368_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESRIMSELPFTIAS KRTKYLGIQLTRDVKDLFKENYKPLLNELKEDTNKWKNIPCLWIGRINIVKMAILPKIIY RFNAIPIKPPMTFFTELEKRTLKFIWNQKRACIAKTIVSQKNKAGGITLPDFKLYYKATI TKTAWYWYQNRDVDQWNTTEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAI CRKLKLDPFLTPYTKINSRWIKDSNVRRKTIKTLEENVGNTIQDLGMGKDFMTKTPKAMA TKAKIDKRDLIKLKSFCTAKETAIRVNRQPTEWEKNFAIYPSDKGLISRIYKELKHIYKK KSTPSKSG >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_3|1107_bp atgattgtgtatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaggcattcctatac accaataacagacaaacagagagccgaatcatgagtgaactcccattcacaattgcttca aagagaacaaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaactaaaagaggatacaaacaaatggaagaacattcca tgcttatggataggaagaatcaacattgtgaaaatggccatactgcccaagataatttat agattcaatgccatccccatcaagccaccaatgactttcttcacagaattggaaaaaaga actttaaagttcatatggaaccaaaaaagagcctgcattgccaagacaatagtaagccaa aagaacaaagctggaggcatcacgttacctgacttcaaactatactacaaagctacaata accaaaacagcatggtactggtaccaaaacagagatgtagaccaatggaatacaacagag ccctcagaaataataccacacatctacaaccatctgatctttgacaaacctgataaaaac aagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaggtgg attaaagactcaaatgttagacgtaaaaccataaaaaccctagaagaaaacgtaggcaat accattcaggacttaggcatgggcaaggacttcatgactaaaacaccaaaagcaatggca accaaagccaaaattgacaaacgggatctaattaaactaaagagcttctgcacagcaaaa gaaactgccatcagagtgaacaggcaacctacagaatgggagaaaaattttgcaatctac ccatctgacaaagggctaatatccagaatctacaaagaacttaaacatatttacaagaaa aaatcaaccccatcaaaaagtgggtga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_4|218_aa MGDFNTPLSTLDRSMRQKVNKNIQELNSALHQADLIDIYRTLHPRSAEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTRNHSTTWKLNNLLLNDYWV YNEMKAEIKMFFETNENEDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELDKQEQTHSKASRRQEITKIRAELKEIETQKNLQKNQ >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_4|657_bp atgggagactttaacaccccactatcaacattagacagatcaatgagacagaaagttaac aagaatatccaggaattgaactcagctctgcaccaagcagatctaatagacatctacaga accctccaccccaggtcagcagaatatacattcttctcagcaccacatcacacttattcc aaaattgaccacatagttggaagtaaagcgctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaagctc actcgaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta tataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacgaagacaca acataccagaacctctgggacacttttaaagctgtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaa gaactagataagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggagatagagacacaaaaaaaccttcaaaaaaatcaatga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_5|123_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQEQKTKHRIFSLIEECSSSPATEQSWMEN DFDELRQEGFRQSVITNFSELKGDLQTHRKEAKNLEKRLDEWLTRINSVEKTLNDLMKLK TMA >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_5|372_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaaccatcattctcagtaaactatcgcaagaacaaaaaaccaaacaccgcata ttctcactcatagaggaatgcagctcctcaccagcaacggaacaaagctggatggagaat gactttgacgagttgagacaagaaggcttcagacaatcggtaataacaaacttctccgag ctaaagggggatcttcaaacccatcgcaaagaagctaaaaaccttgaaaaaagattagac gaatggctaaccagaataaacagcgtagagaagaccttaaatgacctgatgaagctgaaa accatggcatga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_6|230_aa MRRDPGRVVARSALDRRVVTWSRKAALCDLDDRQNVRQPWRWPTRGGGGGQGGSGGGGGG RGVRASLPEPRNSAAAMASNMDREMILADFQKPICVILLAFWTVCSLTVVHLLSYKPLSN ELIVKVFDASMLPAVHRHLWSCEKRATILQIPEWQIHQQLHPVPGKAADTQCQPVKAPRR EAVPCKATGAELPKTMGTHLLHPCDLNVRPGVKGDHFGALKFNCPAGFHT >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_6|693_bp atgcgcagagatcctggccgggtagtcgcgcgctcagcgcttgacaggagggtggtcacg tggagccgcaaagcggcgctttgcgacctcgatgacaggcaaaatgtgcgacagccgtgg cgctggccaaccaggggcggaggcggcggccagggaggaagcggaggaggcggaggcggc cgcggcgtgcgcgcttcgctcccggagccgcggaactcggcggccgccatggcgtccaac atggaccgggagatgatcctggcggattttcagaaaccaatctgtgttatacttttggct ttctggactgtttgcagtctaacagtagtgcaccttttatcgtacaagccactcagcaat gaactgattgtcaaggtatttgatgcttccatgctccctgccgttcataggcatctgtgg agctgtgagaagagggccaccatcctccagatcccagaatggcagatccaccaacagctt caccctgtgcctggaaaagctgcagacactcagtgccagcctgtgaaagcacctagaagg gaggctgtaccctgcaaagccacaggcgcggagctgcccaagaccatgggaacccacctt ttgcatccgtgtgacctgaatgtgagacctggagtcaaaggagatcattttggcgcttta aaatttaactgccccgctggatttcacacttga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_7|326_aa MRSEKREKKPAPSLRGNRSSNCFSEKSSKGGSEGKKQKRGRHASACRGERSRDRRSRGGS ATKETVTGSGAAKEEAVLPGSAPGHSWLAAALSAAKGHRPGTARGGKVAPGAEAAERSPR RRPDWLPRNCGDSPYSELGLRFPGLSPSPEAPTKPGKEGKDSGGSSSMSAYSRKPERARS SLNMTKSLGTVVVVSIFNNQKVMKLGNPEIARRLLLRGANPDLKDRTGFAVIHDAARAGF LDTLQTLLEFQADVNIEDNEGNLPLHLAAKEGHLRVVEFLVKHTASNVGHRNHKGDTACD LARLYGRNEVVSLMQANGAGGATNLQ >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_7|981_bp atgaggtcggaaaaaagggagaagaaaccggcaccctctctgagaggcaacagaagcagc aattgtttcagcgaaaaaagcagcaagggagggagtgaaggaaaaaagcaaaaaaggggg cgacacgcaagtgcctgtaggggtgaaaggagcagggaccggcgatctagggggggatca gctacaaaagaaactgtcactgggagcggtgcggccaaggaggaagcagtgctgccaggc tctgctccagggcacagctggctggcggctgccctgtccgcagcaaaggggcacaggccg gggaccgcgagaggtggcaaagtggcaccgggcgccgaggctgctgagcgctcgccgaga cggcgaccggactggctgccccggaactgcggcgactctccctactcagaacttggccta cgtttcccaggactctccccatctccagaggcccccacaaaaccgggaaaggaaggaaag gacagcggcggcagcagctcaatgagtgcctacagcagaaagcctgaacgagctcggtcg tccttaaatatgactaaatctttagggacagttgtggtcgtatcaatttttaataaccaa aaagttatgaaacttggaaatcccgagattgccaggagactgctacttagaggtgctaat cccgatttgaaagaccgaactggtttcgctgtcattcatgatgcggccagagcaggtttc ctggacactttacagactttgctggagtttcaagctgatgttaacatcgaggataatgaa gggaacctgcccttgcacttggctgccaaagaaggccacctccgggtggtggagttcctg gtgaagcacacggccagcaatgtggggcatcggaaccataagggggacaccgcctgtgat ttggccaggctctatgggaggaatgaggttgttagcctgatgcaggcaaacggggctggg ggagccacaaatcttcaataa >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_8|138_aa MESVAQDWFRNLSSQKGAFGFCRGIWIPVALERQKGRGDRKAAKEASGRSLPQPPPAGNL GGALGAAVPEEAAAIPLDGSGDLCLRTGAGGRPGSKGQRAGECLALRGGELAAGRSGEAE QTAPDLDASTPFPSPHDS >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_8|417_bp atggagtccgtggcccaggactggtttcgaaatctcagttcccaaaaaggtgctttcggc ttttgccgggggatttggatcccagtggccctggagcggcagaagggccgcggcgatagg aaagcagcgaaggaggccagtggccggagtctgccccagcccccacccgcgggtaacctt ggaggcgccctcggagcagcggtccccgaggaggcggcagccatcccactggacggttct ggcgacctgtgcctgaggaccggggccggagggcggccagggagcaagggccagcgggcc ggtgagtgcctagctcttcgcggaggtgagctggcggcgggacgcagcggggaggctgag caaacggcgcccgacctcgatgccagcactcccttcccctccccccatgacagctga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_9|159_aa MPSQLTSYVIEVSLSDQNKDHNSSPAREQNWTENEFDKLTEVGFRRWEITNSSELKEHVL IQCKEAKNLEKRLEELLTRITSLEKNINDLMELKNTAQEFCEAYTSINSRINQAEERISE IEDQLNEIKPEDKIRGKRRKRNEQTSNKYGTMWKDQTYV >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_9|480_bp atgcccagccagctaacctcgtatgtaattgaggtttctctctcagatcagaacaaggat cacaactcctcgccagcaagggaacaaaactggacagagaatgagtttgacaaattgaca gaagtaggcttcagaaggtgggaaataacaaactcctccgagctaaaggagcatgttcta atccaatgcaaggaagctaagaaccttgaaaaaaggttagaggaattgctaactagaata accagtttagagaagaacataaatgacttgatggagctgaaaaacacagcacaagaattt tgtgaagcatacacaagtatcaatagccgaatcaatcaagcagaagaaaggatatcagag attgaagatcaacttaatgaaataaagcctgaggacaagattagaggaaaaagaaggaaa aggaatgaacaaacctccaataaatatgggactatgtggaaagaccaaacctacgtttga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_10|240_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIRNQKRACIAKTILSKKNKARSITLPD FKLYYKVTVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNHLIFDEPDKNKEWEKDSLFNK WHWENWLAMCRKLKLDPFLTPYTKINSIWIKDLNVRLKTTKSLEENVGNTIQDIGMGKDF MTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKNFAISQRANIQNQQIT >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_10|723_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcataaggaaccaaaaaagagcc tgcattgccaagacaatcctaagcaaaaagaacaaagctcgaagcatcacgctacctgac ttcaaactatactacaaggttacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataacaccacacatctacaaccat ctgatctttgacgaacctgacaaaaacaaggaatgggaaaaggattccctatttaataaa tggcactgggaaaactggctagccatgtgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaatatggattaaagacttaaatgttagacttaaaaccaca aaatccctagaagaaaatgtaggcaataccattcaggacataggcatgggcaaggacttc atgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaaattttgcaatctcacaaagggctaatatccagaatcaacaaataact taa >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_11|72_aa MEEDCHLQVGASDGSSGLSGVATVKMPASAGEAQLGLHTLLSPRVPGTGNEKERRAAALQ GAQMHRLPQPGL >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_11|219_bp atggaggaagactgtcacctacaggttggtgcaagtgatggcagcagtggcctgtctgga gtagccactgtgaagatgccggcttcagcaggggaagcacaactggggctgcacactctg ctgagcccacgggtgccgggaacaggcaatgagaaggagagaagagctgcagcccttcag ggagctcagatgcacaggctcccccagccagggctgtga >gi568815597f:50870369_51074267|GENSCAN_predicted_peptide_12|85_aa MGIFLHKLSLPAAIHMRRDLPLLAFHHDCEAPPATWDLNWYRERGMAEKIPENVEATLEL GNWQRLEQFGGLRRRQENVGKFGAF >gi568815597f:50870369_51074267|GENSCAN_predicted_CDS_12|258_bp atgggaattttcctgcacaagctctctttgcctgctgccatccacatgagacgggacttg cctctccttgccttccaccatgattgtgaggctcccccagccacgtgggacttaaattgg taccgagagaggggtatggctgaaaagatacctgaaaatgtggaagcgactttggaactg ggtaactggcagagattggaacagtttggagggctcagaagaagacaggaaaatgtggga aagtttggagctttttga