GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:03:52 Sequence gi568815575r:23603897_23836017 : 232121 bp : 43.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5514 5661 148 2 1 52 82 108 0.546 6.85 1.02 Intr + 21912 22013 102 1 0 70 95 37 0.227 2.75 1.03 Intr + 31505 31597 93 0 0 102 77 23 0.583 2.54 1.04 Term + 32833 32993 161 2 2 71 54 73 0.447 0.30 1.05 PlyA + 34592 34597 6 1.05 2.02 PlyA - 38444 38439 6 1.05 2.01 Sngl - 43610 42609 1002 2 0 86 37 285 0.770 20.15 2.00 Prom - 45509 45470 40 -4.96 3.03 PlyA - 45678 45673 6 1.05 3.02 Term - 46819 46532 288 1 0 -104 54 344 0.630 7.18 3.01 Init - 48345 48085 261 0 0 63 50 98 0.593 0.57 3.00 Prom - 55819 55780 40 -4.36 4.00 Prom + 58291 58330 40 -5.66 4.01 Init + 60624 60822 199 2 1 105 82 131 0.978 11.59 4.02 Intr + 63613 63915 303 2 0 14 39 346 0.910 18.86 4.03 Intr + 67633 67750 118 2 1 36 78 141 0.883 7.42 4.04 Intr + 78500 78630 131 2 2 113 77 89 0.943 10.64 4.05 Intr + 79775 79809 35 0 2 101 98 16 0.856 1.84 4.06 Term + 82389 82439 51 2 0 102 54 37 0.746 -0.87 4.07 PlyA + 82477 82482 6 1.05 5.11 PlyA - 82760 82755 6 1.05 5.10 Term - 87940 87800 141 1 0 76 54 34 0.303 -3.27 5.09 Intr - 88385 88137 249 2 0 33 119 91 0.042 4.33 5.08 Intr - 93627 93590 38 1 2 79 79 17 0.031 -2.12 5.07 Intr - 100948 100798 151 1 1 60 107 79 0.782 6.74 5.06 Intr - 101184 101097 88 2 1 86 96 26 0.763 3.17 5.05 Intr - 102843 102732 112 1 1 96 78 77 0.918 6.94 5.04 Intr - 118857 118774 84 1 0 87 111 59 0.842 7.89 5.03 Intr - 126668 126631 38 1 2 101 69 23 0.823 -0.39 5.02 Intr - 127090 126920 171 0 0 99 87 68 0.590 6.86 5.01 Init - 132016 131958 59 1 2 95 71 42 0.630 3.98 5.00 Prom - 149330 149291 40 -5.86 6.00 Prom + 154204 154243 40 -5.06 6.01 Init + 161589 161709 121 2 1 53 55 50 0.786 -1.45 6.02 Intr + 162039 162168 130 1 1 65 84 67 0.903 3.75 6.03 Intr + 162584 162746 163 2 1 82 100 83 0.844 8.88 6.04 Intr + 179417 179521 105 1 0 30 69 93 0.042 2.01 6.05 Intr + 179762 179813 52 1 1 102 86 14 0.946 1.08 6.06 Intr + 179904 179987 84 1 0 108 86 60 0.965 7.59 6.07 Intr + 181432 181533 102 2 0 74 80 115 0.966 9.45 6.08 Intr + 181624 181664 41 2 2 121 91 15 0.999 3.04 6.09 Term + 181790 181960 171 1 0 94 55 125 0.988 7.63 6.10 PlyA + 182000 182005 6 1.05 7.00 Prom + 191957 191996 40 -1.86 7.01 Init + 211173 211271 99 2 0 85 82 93 0.703 8.66 7.02 Intr + 231607 231663 57 0 0 128 87 -14 0.002 1.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 179456 179521 66 1 0 81 69 76 0.885 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_1|167_aa MTEDEGEHDWYKCSSFNNLHSKPWSQRDKDINVSLTYEENLGLELLLPEALFWWWMGSTK PPSQKTTEKMKNCLIKTFQGGYVELQVLPEEMSRPPATDFSPLPLPFLLGATHGGMGSAV YPGGCPKDKGSQVTCSSEPETRLPGATAIDSNPAPITSRTAGQLNAP >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_1|504_bp atgacagaagatgaaggagagcatgactggtacaagtgcagcagcttcaataaccttcac tcaaaaccttggagtcagagggacaaagacattaatgtctcactcacatatgaagagaat ctggggctggaactgctcctgccagaagcactattctggtggtggatgggcagcacaaag cccccatcccaaaagaccacggagaagatgaaaaactgcctcataaaaacttttcaagga ggatacgtagaattgcaagttttacctgaggaaatgtccaggccccctgctacagacttt agccccttgcccttgcctttccttctaggggctacacatggtggaatgggcagtgcagtg tacccaggaggctgccccaaagacaaagggagccaagtcacatgctcctcagagcctgaa actcgcctgcctggagccacagccattgacagcaaccctgctcccatcaccagcaggact gcaggacaattaaatgcaccctga >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_2|333_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACISKTILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQLNRTEPSEITPHVYNHLIFDKPDKNKQWGKDSLFNK WSWENWVAIWRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTGKETTIRVNRQPTEWEKMFAIYSSDKGLISRIY IELKQIYKKKSNNPINKWAKDMNKHFSKEDIYAANRHMKKCSSSLAIREMQIKTTKRYHL TPVRMAIIQKVRKQQVLERTWRNRNTFTLLVGL >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_2|1002_bp atggccatactgcccaaggtaatttatagattcaatgctatccccatcaagctgccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgtatttccaagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaattgaacagaacagagccctcagaaataacaccacacgtgtacaaccat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtcctgggaaaactgggtagccatatggagaaagctgaaactggaccccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagaccaaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacaggaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaatgtttgcaatctactcatctgacaaagggctaatatccagaatctac atagaactcaaacaaatttacaagaaaaaatcaaacaaccccatcaacaagtgggcaaag gatatgaacaaacacttctcaaaagaagacatttatgcggccaacagacacatgaaaaaa tgctcgtcatcactggccatcagagaaatgcaaatcaaaaccacaaagagataccatctc acaccagttagaatggcgatcattcaaaaagtcaggaaacaacaggtgctggagaggacg tggagaaataggaacacttttaccctgttggtgggactgtaa >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_3|182_aa MQIVQTWRCTLSQGYHCSWAAVDPFSVYGLVNPVHTADVEFFEITVSWTGLIFGFTNHMQ LHIGATFTANSTPCNELFNSHTIPRQQTENDFDELREEGFRRSNFSELKEKVRTQRKEVK NLEKRLDEWVTRITSAEKSLNDLIELKTMARELRDECTSFSSQFDQLEERVSVLEDQMNE MK >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_3|549_bp atgcagatagttcagacctggcgctgtaccctttctcaaggttaccactgttcatgggct gcagtagatcccttttctgtgtatggccttgtcaatcctgtgcacacagcagatgtggaa ttctttgaaataactgttagttggactggtttaatatttggattcaccaaccacatgcag ttacacattggtgcaacctttacagcaaacagtactccctgcaatgaactatttaattca catactatccccagacagcagacagagaatgactttgacgagttgagagaagaaggcttc agacgatcaaacttctccgagctaaaggagaaagttcgaacccaacgcaaagaagttaaa aaccttgaaaaaagattagacgaatgggtaactagaataaccagtgcagagaagtcctta aatgacctgattgagctgaaaaccatggcacgagaactacgtgatgaatgcacaagcttc agtagccaatttgatcaactggaagaaagggtatcagtgctggaagatcaaatgaatgaa atgaagtga >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_4|278_aa MDHRSRLRGTGLNRIPGTQSRAPRVPLPFHVQQEAREGEDWEREPPRQRPPIYEPPESEE LPDNVMAARRRQKRRSRQGTCFCARVVMEALPLLAATTPDHGRHRRLLLLPLLLFLLPAG AVQGWETEERPRTREEECHFYAGGQVYPGEASRVSVADHSLHLSKAKISKPAPYWEGTAV IDGEFKELKLTDYRGKYLVFFFYPLDLGLFIIDDKGILRQITLNDLPVGRSVDETLRLVQ AFQYTDKHGEVCPAGWKPGSETIIPDPAGKLKYFDKLN >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_4|837_bp atggatcaccgaagccgactacggggcacaggcctgaaccgaatccctgggactcagtcc cgagccccccgagtcccactccccttccacgtgcaacaggaggccagggaaggagaagac tgggagcgagagccacctcgtcagaggcctcctatctatgagccaccagaaagtgaagag ctgccagataatgttatggctgcccggcggcggcagaagcggcgctcgcgccaagggacg tgtttctgcgctcgcgtggtcatggaggcgctgccgctgctagccgcgacaactccggac cacggccgccaccgaaggctgcttctgctgccgctactgctgttcctgctgccggctgga gctgtgcagggctgggagacagaggagaggccccggactcgcgaagaggagtgccacttc tacgcgggtggacaagtgtacccgggagaggcatcccgggtatcggtcgccgaccactcc ctgcacctaagcaaagcgaagatttccaagccagcgccctactgggaaggaacagctgtg atcgatggagaatttaaggagctgaagttaactgattatcgtgggaaatacttggttttc ttcttctacccacttgatttaggtctcttcattattgatgacaaaggaatcctaagacaa attactctgaatgatcttcctgtgggtagatcagtggatgagacactacgtttggttcaa gcattccagtacactgacaaacacggagaagtctgccctgctggctggaaacctggtagt gaaacaataatcccagatccagctggaaagctgaagtatttcgataaactgaattga >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_5|376_aa MVSLFDAYVVKNLVLLLFERDHVKAMEERKLLHSFLAKSQDGLPPRRMKDSYIEVLLPLG SEPELREKYLTVQNTVRFGRILEDLDSLGVLICYMHNKIHSAKMSPLSIVTALVDKIVNK GRRIAFSSTSLLKMAPSAEERTTIHEMFLSTLDPNGSRPFVVAVDDIMFQKPVEVGSLLF LSSQVCFTQNNYIQVRVHSEVASLQEKQHTTTNVFHFTFMSEKEVPLVFPKTYGAIFLEG SVVSLKVSCLSNEVSIKGPRGPGTQNFWTAELVKACRKVNKNSPMCWDDGAPQLHGNRSS RAQDPSRPRPMCLFIWVFIRILQNILHKKLVDSIRIELDQKIPGWCPLQNCLPVGGENPY TLHIWSQKSSVLIVVM >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_5|1131_bp atggtgtctctgtttgatgcctatgtggtgaaaaaccttgttttacttttatttgaaaga gaccatgtgaaggcaatggaagaaaggaaattacttcatagtttcttggctaaatcacag gatggactgcctcctaggagaatgaaggacagttatattgaagttctcttgcctttgggc agtgagcctgaattacgagagaaatatttgactgttcaaaacaccgtaagatttggcagg attcttgaggatcttgacagcttgggagttcttatttgttacatgcacaacaaaatccac tccgccaagatgtctcctttatcgatagttacagccctggtggataagattgtgaacaag gggagaagaattgccttcagctccacgtcgttactgaaaatggcccccagcgctgaggag aggaccaccatacatgagatgtttctcagcacactggatccaaatggttctcgaccgttt gtggtagcagtagatgacatcatgtttcagaaacctgttgaggttggctcattgctcttt ctttcttcacaggtatgctttactcagaataattatattcaagtcagagtacacagtgaa gtggcctccctgcaggagaagcagcatacaaccaccaatgtctttcatttcacgttcatg tcggaaaaagaagtgccattggttttcccaaaaacatatggagccattttcttggaggga tcagtggtgtcactgaaggtgtcatgcctatccaatgaagtctccataaaaggcccaaga ggaccaggtacgcagaacttctggacagctgagctggtgaaggcttgcaggaaggtgaac aagaattcacccatgtgttgggatgatggtgcaccccaactccacgggaatagaagctcc cgtgctcaggacccttccagacctcgccctatgtgcctcttcatctgggtgtttattcgt atccttcaaaatatccttcataagaaactggtggacagcatcagaattgaattggatcag aagattcctggctggtgtccactgcagaattgcttgcctgttggtggggaaaatccctac accctacacatttggtcacagaagtcttctgtattgattgttgtcatgtga >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_6|322_aa MATPQVEQPVDHFQSIGTALTSERNDGIAVPQLYQLDSPTTRKLCDQWCLCLSFDQARQT CSTDVAWQAVLSSCYQPGFHACRGPAIQWVLSSRPTSRKNEVRGQLEGEQGGEEPYRVTE QLSGNLKWVAPFRRQVVPGPGPQREEKQKTKMAKFVIRPATAADCSDILRLIKELAKYEY MEEQVILTEKDLLEDGFGEHPFYHCLVAEVPKEHWTPEGHSIVGFAMYYFTYDPWIGKLL YLEDFFVMSDYRGFGIGSEILKNLSQVAMRCRCSSMHFLVAEWNEPSINFYKRRGASDLS SEEGWRLFKIDKEYLLKMATEE >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_6|969_bp atggctactccacaggtagagcagccagtagaccattttcaaagcattgggacagctctc acctcagaaagaaatgatggtattgcggtgcctcagctctaccaactagattcaccaacc actagaaagctctgtgaccagtggtgcctttgcctgagttttgatcaggcccgccagact tgttccactgacgtggcctggcaggctgtgctcagctcatgctaccagccaggattccac gcctgccgaggtcccgccattcagtgggtcctgagttctcgtcccacatccaggaagaat gaggtacgcggacaactagagggtgagcaaggcggagaggagccttatcgagtgacagaa cagctctcgggaaacttgaagtgggtagctcctttccgcaggcaggttgtcccagggcct ggtccgcaaagggaagaaaagcaaaagacgaaaatggctaaattcgtgatccgcccagcc actgccgccgactgcagtgacatactgcggctgatcaaggagctggctaaatatgaatac atggaagaacaagtaatcttaactgaaaaagatctgctagaagatggttttggagagcac cccttttaccactgcctggttgcagaagtgccgaaagagcactggactccggaaggacac agcattgttggttttgccatgtactattttacctatgacccgtggattggcaagttattg tatcttgaggacttcttcgtgatgagtgattatagaggctttggcataggatcagaaatt ctgaagaatctaagccaggttgcaatgaggtgtcgctgcagcagcatgcacttcttggta gcagaatggaatgaaccatccatcaacttctataaaagaagaggtgcttctgatctgtcc agtgaagagggttggagactgttcaagatcgacaaggagtacttgctaaaaatggcaaca gaggagtga >gi568815575r:23603897_23836017|GENSCAN_predicted_peptide_7|52_aa MTRDNRVEGYDLTSEVMQYHFGCIQVIVGEVTKEFDRFIALLKTKNWKQPML >gi568815575r:23603897_23836017|GENSCAN_predicted_CDS_7|156_bp atgacgagagacaaccgggtggaaggttatgacctaacctcagaagtcatgcagtatcac tttggctgcatccaggtcattgtgggggaggtcactaaggaatttgatagatttattgca ttgttaaaaactaaaaactggaaacaacctatgttg