GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:31:36 Sequence gi568815578r:21411926_21613669 : 201744 bp : 43.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2949 3666 718 2 1 68 77 451 0.013 37.03 1.02 Term + 5595 6955 1361 1 2 -1 47 321 0.009 10.69 1.03 PlyA + 7161 7166 6 1.05 2.03 PlyA - 8125 8120 6 1.05 2.02 Term - 31294 31180 115 0 1 16 42 122 0.088 -1.56 2.01 Init - 39591 39551 41 0 2 74 98 31 0.197 2.38 2.00 Prom - 57102 57063 40 -1.66 3.11 PlyA - 57504 57499 6 1.05 3.10 Term - 69821 68937 885 2 0 13 41 307 0.302 10.99 3.09 Intr - 71020 70366 655 0 1 4 36 329 0.117 11.18 3.08 Intr - 77739 77639 101 0 2 60 77 74 0.796 2.41 3.07 Intr - 86887 86857 31 0 1 14 93 77 0.059 -1.07 3.06 Intr - 88798 88619 180 0 0 101 38 101 0.116 5.38 3.05 Intr - 89669 89583 87 1 0 99 37 80 0.138 3.09 3.04 Intr - 94691 94418 274 0 1 53 117 183 0.655 14.30 3.03 Intr - 98295 98216 80 2 2 100 76 26 0.591 1.79 3.02 Intr - 100560 100002 559 1 1 115 85 919 0.845 86.19 3.01 Init - 101744 101486 259 2 1 83 105 538 0.999 52.30 3.00 Prom - 102176 102137 40 -8.56 4.04 PlyA - 103002 102997 6 1.05 4.03 Term - 107698 107594 105 1 0 73 44 107 0.383 3.21 4.02 Intr - 110477 110288 190 1 1 99 80 87 0.189 8.59 4.01 Init - 144815 144697 119 2 2 81 115 20 0.157 3.67 4.00 Prom - 148013 147974 40 -4.16 5.05 PlyA - 151458 151453 6 1.05 5.04 Term - 160094 159969 126 2 0 50 53 100 0.369 0.98 5.03 Intr - 164685 164555 131 1 2 87 66 71 0.714 5.21 5.02 Intr - 167558 167403 156 0 0 62 36 92 0.532 1.38 5.01 Init - 171825 171753 73 0 1 111 110 43 0.894 8.05 5.00 Prom - 187136 187097 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 5636 6955 1320 1 0 78 47 293 0.893 20.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:21411926_21613669|GENSCAN_predicted_peptide_1|692_aa MRKNQCKNAEDSKNQNAASPPKVHNSLPAREQNWTENKFDKLTEVGIRRWVINSSELKEH VLTQCKEAKNLDKRLQELLTIIASLEKNINDLMELKNTARELHEVYASINSQIDQVEERI SEIEDQLNEIKCEDKIREKRMKRNEQSLQEIWDYVKRPNLRWIGVPESDRENGTKLENTL PDITQENFPNLARQANIQIQEIQRTPQRYSSRRATPRYIIIRFTKVEMKEKMVRAAREKE TQQKEKISGQYPMMNTDAKILNKILANQIQQHIKKLIHHDQVSFIPGMQGWFNICKSINV IHHINKTNDKNHMIISIDAEKPFDKIQHPFMLKILSKLGIDGTYLKIIRAIYDKPTANIV LNGKNLDTFPLKTHTRQGCPFSPLLFNIVLEVPAKAVRQEKEIKGIQIGKEEVKLSLFAD DMTVYLENPIISAPKLLKLISNFSKVTGYKINVQKSQAFLYSSNSQIMSELPFTIATKRI KYLGMQLTRDVKDLFKENYKPLLKEIREDTNKWKNIPCSWIGRINIVKMTILPKVIFRFN AIAIKLQLAFFAELEKTTLNFIWNQKRTHIAKAILSKKNKAGGIMIPDFKLYYKDTVTKA AWYWYQNRYIDQWNRTEALEITPHIYQPKCPSMIDWIKKMWHIHTMEYCAAIKKDEFMSF VGTWMKLETIILSKLSQGQKTKHHMFSLIGGN >gi568815578r:21411926_21613669|GENSCAN_predicted_CDS_1|2079_bp atgaggaaaaaccagtgcaaaaatgctgaagattccaaaaaccagaatgctgcttctcct ccaaaggttcacaactccttgccagcaagggaacaaaactggacagagaataagtttgac aaattgacagaagtaggcatcagaagatgggtaataaactcctctgagctaaaggagcat gttctaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgctaact ataatagccagtttagagaagaacataaatgacctgatggagctgaaaaacacagcaaga gaacttcatgaagtatacgcaagtatcaatagccaaattgatcaagtggaagaaaggata tcagagattgaagatcaacttaatgaaatcaagtgtgaagacaagattagagaaaaaaga atgaagaggaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaaccta cgttggattggtgtacctgaaagtgacagggagaatggaaccaagttggaaaacacactt ccggatattacccaggagaacttccctaacctagcaagacaggccaacattcaaattcag gaaatacagagaacaccacaaagatactcctcaagaagagcaaccccaagatacataatc atcagattcaccaaggttgaaatgaaggaaaagatggtaagggcagccagagagaaagag acacaacaaaaagagaaaatttcaggccaatatcccatgatgaacactgatgcaaaaatc ctcaataaaatactggcaaaccaaatccagcagcacatcaaaaagcttatccaccacgat caagtcagcttcatccctgggatgcaaggctggttcaacatatgcaaatcaataaacgta atccatcacataaacaaaaccaatgacaaaaaccacatgattatctcaatagatgcagaa aagccctttgataaaattcaacaccccttcatgctaaaaatactcagtaaactaggtatt gatgggacgtatctcaaaataataagagctatttatgacaaacccacagccaatattgta ctgaatgggaaaaacttggatacattccctttgaaaacccacacaagacaaggatgccct ttctcaccactcctattcaacatagtattggaagttccggccaaggcagtcaggcaagag aaagaaataaagggtattcagataggaaaagaggaagtcaaattgtctctgtttgcagat gacatgactgtatatttagaaaaccccatcatctcagccccaaaactcctaaagctgata agcaacttcagcaaagtcacaggatacaaaatcaatgtgcaaaaatcacaagcattctta tacagcagtaatagccaaatcatgagtgaactcccattcacaattgctacaaagagaata aaatacctaggaatgcaacttacaagggatgtgaaggacctcttcaaggagaactacaaa ccactgctcaaggaaataagagaggacacaaacaaatggaaaaacattccatgctcatgg ataggaagaatcaatatcgtgaaaatgaccatactacccaaagtaatttttagatttaat gctattgccattaagctgcaattggctttctttgcagaattagaaaaaactactttaaat ttcatatggaaccagaaaagaacccatatagccaaggcaatcctaagcaaaaagaacaaa gctggaggcatcatgatacctgacttcaaactctactacaaggatacagtaaccaaagca gcatggtactggtaccaaaacagatatatagaccaatggaacagaacagaggccttagaa ataacaccacacatctaccaacccaaatgtccatcaatgatagactggattaagaaaatg tggcacatacacaccatggaatactgtgcagccataaaaaaggatgagttcatgtccttt gtagggacatggatgaagctggaaaccatcattctgagcaaactatcgcaaggacagaaa accaaacaccacatgttctcactcataggtgggaattga >gi568815578r:21411926_21613669|GENSCAN_predicted_peptide_2|51_aa MQPMSHGLDELALHLAKVKMIDKPIAAQSVSCKDGERRSKLDHLPGKQFDI >gi568815578r:21411926_21613669|GENSCAN_predicted_CDS_2|156_bp atgcagcccatgagccatggattggatgagcttgctctacatttggcaaaggttaaaatg attgacaagcccattgctgcccaatccgtctcctgcaaggatggggaaagacgatctaaa ttggaccatcttcctggaaagcagtttgacatttga >gi568815578r:21411926_21613669|GENSCAN_predicted_peptide_3|1036_aa MSLTNTKTGFSVKDILDLPDTNDEEGSVAEGPEEENEGPEPAKRAGPLGQGALDAVQSLP LKNPFYDSSDNPYTRWLASTEGLQYSLHGLAAGAPPQDSSSKSPEPSADESPDNDKETPG GGGDAGKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPTQVKIWFQNHR YKMKRARAEKGMEVTPLPSPRRVAVPVLVRDGKPCHALKAQDLAAATFQAGIPFSAYSAQ SLQHMQYNAQYSSASTPQYPTAHPLVQAQQWTWPLSGDWIPGLLPSSVRRSEPGCAVPPG IVINKEANRPQRRHPSKRGNVGQQPRPARGSRVRSTRPLGPENRACAACCRRQEAPEPSV PERLCARVRAEPGSRRRQLRISGLAARVGARLSTLKKRACERRGTIQDVQDCGRSCSSHL AWITVPQALGYKRITWDLDKMQIRVQQPWHSAFLMSSRETLVLLGQDHTLGGKGLAGNSR APAVTMGYNTEHILQARKADAKRQFSANTKQQLVDKCPNFITLWSWFFEKINKIDRPLAR LIKKKREKNQIDTIKNDKGDITTDATEILPTIREYYKHLYANKLENLEEMDKFLDTYTLP RLNQEEVESLNRPITGSEIEAIINSLPPKKSPGPDGFTAKFYQRYKEDLAPFLLKLFQSI EKEGILPNSFYEANIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQNIKKLI HHDQVGFIPGMQGWFNIHKSINRIKYLGVQLTRDVKDLFKENYKPLLNEIKEDTNKWKNI PCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFMWNQKRARIAKTILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRVELSEIIPHIYIHLIFDRPDK NNKWGKDSLFNKWCWENWLAILRKLKLDPFLTPYTKINSRWIKDLTVRPKTIKTLEENLG NTIQAIGMGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTAWEKIFTI YSSDKGLISRKNLQRT >gi568815578r:21411926_21613669|GENSCAN_predicted_CDS_3|3111_bp atgtcgctgaccaacacaaagacggggttttcggtcaaggacatcttagacctgccggac accaacgatgaggagggctctgtggccgaaggtccggaggaagagaacgaggggcccgag ccagccaagagggccgggccgctggggcagggcgccctggacgcggtgcagagcctgccc ctgaagaaccccttctacgacagcagcgacaacccgtacacgcgctggctggccagcacc gagggccttcagtactccctgcacggtctggctgccggggcgccccctcaggactcaagc tccaagtccccggagccctcggccgacgagtcaccggacaatgacaaggagaccccgggc ggcgggggggacgccggcaagaagcgaaagcggcgagtgcttttctccaaggcgcagacc tacgagctggagcggcgctttcggcagcagcggtacctgtcggcgcccgagcgcgaacac ctggccagcctcatccgcctcacgcccacgcaggtcaagatctggttccagaaccaccgc tacaagatgaagcgcgcccgggccgagaaaggtatggaggtgacgcccctgccctcgccg cgccgggtggccgtgcccgtcttggtcagggacggcaaaccatgtcacgcgctcaaagcc caggacctggcagccgccaccttccaggcgggcattcccttttctgcctacagcgcgcag tcgctgcagcacatgcagtacaacgcccagtacagctcggccagcaccccccagtacccg acagcacaccccctggtccaggcccagcagtggacttggcctctctccggggactggatc ccgggcctccttccctccagcgttcggcggtccgaaccagggtgtgcggtccccccaggc atcgttattaataaagaggcgaataggcctcagcgccgccatccgagcaagcggggaaat gtgggccagcagcccaggcccgcgcgcggatcccgcgtacgctccacgcgccccctcggg ccggagaaccgagcgtgtgccgcgtgctgccgccgccaggaggcgcccgagcccagcgtt cccgagcgtctctgcgcgcgggtccgggcagagcccgggagccgccggaggcagctgcgc atcagcggactcgcggcccgggtcggagcccgtctcagcactctgaagaagcgtgcttgt gagcgccgcgggacgatccaggatgtccaggattgtggacgatcctgttcttcccatctg gcctggatcacagttcctcaagctttgggatacaaacgaatcacctgggatttggataaa atgcagattcgtgttcagcagccctggcattctgcatttctcatgagctccagggagacg ctggtgctgctggggcaggaccacactttgggtggcaagggcttggcaggcaattccaga gcccctgcggtcaccatgggctacaacacggagcatattctgcaggccagaaaagctgac gccaaaaggcagttctcagccaatacaaaacagcagttagtagataaatgccccaacttc atcactctttggagctggttttttgaaaagatcaacaaaattgatagaccgctagcaaga ctaataaagaagaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggat atcaccactgatgccacagaaatactacctaccatcagagaatactataaacacctctat gcaaataaactagaaaatctagaagaaatggataaattcctggacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggttctgaaattgag gcaataattaatagtctaccacccaaaaaaagtccaggaccagatggattcacagccaaa ttctaccagaggtacaaagaggacctggcaccattccttctgaaactattccaatcaata gaaaaagagggaatcctccctaactcattttatgaggccaacatcatcctgataccaaag cctggcagagacacaaccaaaaaagagaattttagaccaatatccctgatgaacattgat gcaaaaatcctcaataaaatactggcaaacagaatccagcagaacatcaagaagcttatc caccatgatcaagttggcttcatccctgggatgcaaggctggttcaacatacacaaatca ataaacagaataaaatacctaggagtccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatggataggaagaatcaatattgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatgtggaaccaaaaaagagcccgcattgccaagacaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagagta gagctctcagaaataataccacacatctacatccatctgatctttgacagacctgacaaa aacaacaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atacttagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaaga tggattaaagacttaactgttagacctaaaaccataaaaaccctagaagaaaacctaggc aatacaattcaggccataggcatgggcaaggacttcatgactaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacagcatgggagaaaatttttacaatc tactcatctgacaaagggctaatatccagaaagaacctacaaagaacttaa >gi568815578r:21411926_21613669|GENSCAN_predicted_peptide_4|137_aa MTVNAMWGPGFHLRTGKGHSRRAEGIQIKPGVQSLVLHPCRPARERAPGTVTAACQQRGV SGHRRRRRAAPELGFTADPGYEPLALPGPDSARSPSRVVRTQQPLLPHWKSKPKWGCFGD AILKVNVFIATLRDQIG >gi568815578r:21411926_21613669|GENSCAN_predicted_CDS_4|414_bp atgactgtgaatgcaatgtggggtcctggcttccacctgagaacaggaaaaggacatagc agaagagctgagggaatccagataaaacctggagttcagtcactagtattgcatccatgc cgcccggcgcgggaacgcgctccgggcacggtcaccgcggcctgccagcagagaggggtc tcggggcaccgtcgtcgccgccgagcggcgccggagctcgggttcacggcggaccccggt tatgagcccctcgctctcccgggtcctgactcggctcgatccccaagccgcgtagtccgg acgcagcagcccctgctcccgcattggaagtcgaagccaaagtggggctgttttggagat gccatccttaaggtgaacgtctttatcgccacactcagggaccagattgggtga >gi568815578r:21411926_21613669|GENSCAN_predicted_peptide_5|161_aa MALAPHGLSPSIWPLPSVQALEKAGHVARPHKSPFENHYHPSSFQIGGLSLLKGPKAAME KCPKALFASLGQRSTCEILFITKSPPYSNVSGTHYCFCWFPSSSWRPPWTNQQLDANPTQ FRPPALVCGTLLRCGRWNWHRVHFSELTFTPYSTWVFNGEP >gi568815578r:21411926_21613669|GENSCAN_predicted_CDS_5|486_bp atggccctagctccccacggcctgtctccttcaatctggcctctcccctctgtccaggct ttagagaaagcaggtcatgttgctcgtccacataaaagcccattcgagaatcattatcac ccaagcagcttccagattggcggcttatcacttctgaagggtcccaaggcggccatggag aagtgccccaaagccttgtttgcttccttgggccaaaggtccacctgtgaaatacttttt atcaccaagtcaccaccttactccaacgtttctggaactcactactgtttttgttggttt ccaagcagcagctggagacctccctggaccaaccagcagctggatgcaaacccaactcag tttcgcccaccagccctggtctgcgggacgctgctccggtgtggaaggtggaactggcat cgagtgcatttcagtgagctcaccttcactccttattccacatgggtgttcaacggggag ccctga