GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:26:36 Sequence gi568815592r:5922_206856 : 200935 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 58993 60814 1822 2 1 71 42 564 0.009 36.47 1.02 Term + 63824 64583 760 2 1 9 36 284 0.083 7.21 1.03 PlyA + 64766 64771 6 1.05 2.00 Prom + 69565 69604 40 -5.35 2.01 Init + 72968 73129 162 1 0 65 85 75 0.583 4.68 2.02 Intr + 80768 80853 86 1 2 68 63 64 0.737 -0.40 2.03 Intr + 82515 82679 165 0 0 103 41 106 0.301 5.55 2.04 Intr + 91029 91213 185 0 2 46 50 120 0.005 2.51 2.05 Intr + 91600 91684 85 2 1 72 100 72 0.022 4.76 2.06 Term + 91855 92197 343 1 1 -28 38 272 0.021 3.60 2.07 PlyA + 92424 92429 6 1.05 3.00 Prom + 92958 92997 40 -7.05 3.01 Init + 94403 94481 79 1 1 40 52 126 0.197 5.47 3.02 Intr + 100580 100683 104 0 2 -3 40 173 0.246 2.37 3.03 Intr + 102531 102617 87 2 0 49 101 92 0.307 5.85 3.04 Intr + 136398 136742 345 2 0 36 42 258 0.492 10.66 3.05 Intr + 136829 137317 489 1 0 -10 -4 546 0.231 27.87 3.06 Intr + 137560 137687 128 0 2 65 -5 152 0.283 2.16 3.07 Intr + 137842 138070 229 1 1 76 16 202 0.296 8.75 3.08 Intr + 138376 138557 182 0 2 93 76 132 0.452 10.24 3.09 Intr + 138587 138674 88 2 1 50 66 96 0.380 2.65 3.10 Intr + 139004 139161 158 1 2 60 80 48 0.328 -0.91 3.11 Intr + 142076 142150 75 2 0 74 93 47 0.033 1.51 3.12 Intr + 150123 150235 113 0 2 46 81 95 0.082 3.70 3.13 Intr + 150912 151092 181 1 1 37 72 52 0.054 -3.50 3.14 Term + 151995 152232 238 0 1 42 50 215 0.198 7.76 3.15 PlyA + 153062 153067 6 1.05 4.13 PlyA - 155819 155814 6 1.05 4.12 Term - 159047 158907 141 2 0 79 44 123 0.613 3.95 4.11 Intr - 164710 164431 280 0 1 58 92 168 0.517 10.76 4.10 Intr - 165949 165846 104 1 2 57 26 86 0.056 -2.65 4.09 Intr - 172339 172144 196 0 1 50 101 67 0.022 2.70 4.08 Intr - 178325 178175 151 0 1 47 98 43 0.040 -0.50 4.07 Intr - 179752 179435 318 2 0 61 18 216 0.104 6.91 4.06 Intr - 180539 180359 181 2 1 5 64 167 0.853 4.52 4.05 Intr - 182986 182820 167 2 2 66 75 124 0.614 7.66 4.04 Intr - 186110 186041 70 2 1 87 61 60 0.564 1.04 4.03 Intr - 186291 186164 128 1 2 19 109 157 0.376 10.38 4.02 Intr - 189098 188942 157 2 1 19 49 161 0.335 3.96 4.01 Init - 192341 192297 45 2 0 55 52 69 0.169 0.74 4.00 Prom - 193326 193287 40 -8.05 5.00 Prom + 193858 193897 40 -9.35 5.01 Init + 194449 194643 195 0 0 60 19 239 0.348 13.18 5.02 Term + 198188 198394 207 1 0 106 44 134 0.670 7.16 5.03 PlyA + 199536 199541 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 59061 60857 1797 2 0 70 38 479 0.844 35.25 S.002 Sngl + 63933 64583 651 2 0 80 36 225 0.898 12.42 S.003 Sngl + 108418 108606 189 0 0 72 42 206 0.919 9.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:5922_206856|GENSCAN_predicted_peptide_1|860_aa XIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITEAEIVAIINSL PTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGVLPNSFYEASIILIPKPGRDTT KKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHQDQVGFIPGMQGWFNIRKSINVIQH INRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNG QKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMI VYLENPIVSAQNLPKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQTMSELPFTIASKR IKYLGIQLTRDVKDLFKENYKSLLKEIKEDTKKWKNMPCSWVGRINIVKMAILPKVIYRF NAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYDKATVTK TAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICR KLKLDPFLIPYTKINSRWIKDLNVRPQTIKTLEENLGFTIQDIGMGKDFMSKTPREALLC IVGENTKWSKDPSALSTPPTSCSRPQRRNKSVSHGSYPHPLLFTMNDRVNSVKTTILPKA TYKFNAIPIKIPPSFFTELEKTRLKFTWNQKRAHIAKARLSKKNKSRGITLLDFKLYYKA IVTKTAWYWYKNRHIDQWNRIENPEIKPNTFSQLIFDKASKNIKWGKDTLFNKWCWYNWQ ATCRRMQLDPHLSPYKQINSRWFTDLNLRPETIKILEDKIGKTLLDIGLGKDFTIKNPKA NTTKINRWDLIKLKAFCTSK >gi568815592r:5922_206856|GENSCAN_predicted_CDS_1|2583_bp naaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctggacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacagaagctgaaattgtggcaataatcaatagctta ccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggagtcctc cctaactcattttatgaggccagcatcattctgataccaaagccaggcagagacacaaca aaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaacgaatccagcagcacatcaaaaagcttatccaccaagatcaagtgggc ttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcat ataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatgggacg tatttcaaaataataagagctatctatgacaaacccacagccaatatcatactgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaattaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgatt gtatatctagaaaaccccattgtctcagcccaaaatcttcctaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaac aacagacaaacagagagccaaaccatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactac aaatcactgctcaaggaaataaaagaggatacaaagaaatggaagaacatgccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattc aatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaaaagaac aaagctggaggcatcacgctacctgacttcaaactatacgacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccctca gaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaga aagctgaaactggatcccttccttataccttatacaaaaatcaattcaagatggattaaa gacttaaacgttagacctcaaaccataaaaaccctagaagaaaacctaggctttaccatt caggacataggcatgggcaaggacttcatgtctaaaacaccgagagaggcactcttatgc attgttggtgagaatacaaaatggagcaaagaccccagtgctttatccacacctccaaca agctgcagtcgaccacaaagaagaaacaagtctgtctcccatgggtcctacccacacccc ctgctgttcaccatgaatgatagagtcaacagtgtgaaaacgaccatactgccaaaagca acctacaaattcaatgcaattcccatcaaaataccaccatcattcttcacagaactagaa aaaacaaggctaaaattcacatggaaccaaaaaagagcccacatagccaaagcaagacta agcaaaaagaataaatctagaggcatcacattactcgacttcaaactatactataaggcc atagtcaccaaaacagcatggtactggtataaaaataggcatatagaccaatggaataga atagagaacccagaaataaagccaaatactttcagccaactgatctttgacaaagcaagc aaaaacataaagtggggaaaggacaccctattcaacaaatggtgctggtataattggcaa gccacatgtagaagaatgcaactggatcctcatctctcaccttataaacaaatcaactca agatggttcacagacttaaatctaagacctgaaaccataaaaattctagaagataagatt ggaaaaacccttctagacattggcttaggcaaagacttcacaatcaagaacccaaaagca aacacaacaaagataaatagatgggacttaattaaactgaaagccttctgcacatcaaaa taa >gi568815592r:5922_206856|GENSCAN_predicted_peptide_2|341_aa MILHCQVIAPKIWFSSTLSDYRKPETRLGAKDAKMNETSSLPSKSLLSSGRVTHAWSQHG ERLGDAVKQRHWEQLQTFSQTRKDMDEAGSHHPQQTNTGTENQTPHVLSRKRELNNESKH MDTWRGTTHTRASQGDRGAPRGKGGCGHSFSRLKASLKSLMALKRAADLPAQYSSSDKGQ TASSSGSLTPVYPDWETPPRSQLLTSKGTKENWTENEFDELREVGFRSMHKDQALIRSSG RKISETEYQLNEINQEDKIREKRMKRNEQSLQEIWDYVKRPNLRLIAVPESDGENGTKLE NTLRNIIQENFPNLARQANIQIQKYGEHYKDTPQEKQPQDT >gi568815592r:5922_206856|GENSCAN_predicted_CDS_2|1026_bp atgatactgcattgtcaagtcattgctccaaaaatatggtttagctcaacactgagtgac tataggaaaccagaaaccaggctgggcgctaaagatgcaaagatgaatgagacatcatct ctgccgtccaaaagcttactgtctagtgggagagttacacacgcctggagtcaacatggg gagaggcttggagatgctgtgaagcaaagacactgggaacagctgcagacattttcccag accaggaaggacatggatgaagctggaagccatcaccctcagcaaactaacacaggaaca gaaaaccaaacaccacatgttctcagtcgtaagagggagttgaacaatgagagcaaacac atggatacatggaggggaacaacacacaccagggcctctcagggggacaggggagcacct aggggaaagggtggctgtgggcacagcttcagcagacttaaagcatctttgaaaagcctg atggctctgaagagagcagcagatctcccagcacagtattcgagctctgataagggtcag actgcctcctcaagtgggtctctgacccccgtgtatcctgactgggagacacctcccaga tcacaactcctcaccagcaagggaacaaaagaaaactggacagagaatgagtttgacgaa ttgagagaagtaggtttcagaagcatgcacaaggatcaagcactgattcggtcaagcgga agaaagatatcagagactgaatatcaacttaatgaaataaatcaagaagacaagattaga gaaaaaagaatgaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaacga ccaaatctacgtttgattgctgtacctgaaagtgatggggagaatggaaccaagttagaa aacactcttcggaatattatccaggagaacttccctaacctagcaaggcaggccaatatt caaattcagaaatatggagaacattacaaagacactcctcaagaaaagcaaccccaagac acatag >gi568815592r:5922_206856|GENSCAN_predicted_peptide_3|831_aa MLLSIDAEKAFDKIQQPFMLKTLNKLGHGYEQHHLHTTNDVDEEDLSDAASKGDDFALSE QSQDAHFLQPEAYGLGEGAETATGTAHQGNHVRVEECGRSLCGCVPLVLHPLPDPSLQPH EAQQPASHSVACNQRKQPAKLPAVAHERPPGGTGSVDPGRPPGATCPESPGPATPHTLGV VEPGKSSPPTMEEEPWAPQGSPCWTAQSLSALRKEQDSSSEKDGRSPNKWDKDHIWWPMS GAHDLQQAAPGPGGAHQGHPNQDNRTVSQMLSERWYTLGPSEMQKYDLAFQVKVAHLQQG PKEVQLRGQAHKPGASSSVTRARGSGAYQRRALPLPLGRPLNSCQLQPKHSRARMPRSSF CGAERLHTHMASEDTASDEEHTVIHEEEGVMMSLLMMALAPPTPISSSRSGFWQSMVTPC PPPTHTRMLPPQPWRPPPSYWAQEPSKPRSLVNAAERAPYGPNPWGWGPRDAFQGGLFPP NGSCHLLVEKPRTGSGNRRPRRSCPLHCTCPGPVPALIMQLFQAHCFFLSTRPQPPSRPT MHTSSPPSPVGVLTAPHLAQTLDAALAAPPLPLPESHPCLTPPQPAQALAQTWLPCHCLC PRVGALTAWLEGDTPALPQHLGVSITTTGRAANRKDFVHRRCCPELSQSEAVPPSTHFLS AECLQQERGSSSRDPTLHRGQQHLEGQMCKGQVSGFAAIALHVGFRLMMLNSGQSSSPAW APLPALPYSAGSQLSSLRHLAGSRLLNYQVSSRATVNYTLWHGFARHVLEASSHAGHALP HLGKQRMRKISIKFTQNDLQQDLENAQFHKIPVSKSFQCTALASGSCEDRP >gi568815592r:5922_206856|GENSCAN_predicted_CDS_3|2496_bp atgcttctctcaatagatgcagaaaaggccttcgacaaaattcagcagcccttcatgcta aaaaccctcaataaactaggccatggctatgagcagcaccatctccacaccaccaatgac gtggatgaagaagatttgagcgatgcagcctccaaaggagatgactttgcgctttctgaa cagtctcaagatgcacactttttacaaccagaagcctatggactgggtgagggagcagaa acagccacaggtactgcccatcagggtaatcatgttcgtgtggaggaatgtggaaggtca ctctgcggctgtgttcccctggtactccatccccttcctgacccctccctgcagccacac gaggcccagcaacctgccagtcactcagtggcctgcaaccagagaaaacaacctgccaag ttgccagctgttgctcatgagcgtccaccaggtgggacagggagtgttgaccctgggcgg ccccctggagccacctgccctgaaagcccagggcccgcaaccccacacactttgggggtg gtggaacctggtaaaagctcacctcccaccatggaggaggagccctgggcccctcagggg agtccctgctggacagcccagtccctcagtgccctgcgcaaggaacaggactcatcttct gagaaggatggacgcagccccaacaaatgggacaaggaccacatctggtggcccatgagt ggcgctcatgatcttcagcaagcggcaccaggccctggcggggcgcaccagggtcacccc aaccaggataaccggaccgtcagccagatgctgagcgagcggtggtacaccctggggccc agtgagatgcagaaatacgacctggccttccaggtgaaggtggcccacttgcaacaagga ccgaaagaagtccagctcagaggccaagcccacaagccaggggctagcagcagtgtaaca agggctcgtgggagcggagcatatcagagacgggcactgccactgcccctggggcgtcct ctgaactcctgtcagttgcagcccaaacactccagagctcggatgccaaggagcagcttc tgtggggcagaacggctgcacacacacatggccagtgaggacacagcgagtgacgaggag cacacggtcatccatgaggaggagggggtgatgatgtcattgctgatgatggctttagca ccaccgacaccgatctcaagttcaaggagtggcttttggcagtctatggtcacaccctgt cctcctcctacacatactcggatgcttcctcctcaaccttggcgcccacctccttcttac tgggcccaggagccttcaaagcccaggagtctggtcaacgcagcagagcgggccccctac ggccccaacccctggggatgggggcccagggacgccttccaaggtggcctgtttcctccc aatggatcctgccaccttctggtggagaagccgaggacaggctcagggaaccggagaccg agaaggagctgtcctcttcactgcacgtgccctggaccagtgccggccctgatcatgcag ctcttccaggcccactgcttcttcctgtccactaggccacagccgccctccaggcccact atgcacacatcttcccctccaagccctgtgggggtcctgaccgcacctcacctggctcag actcttgacgctgccctggctgccccaccactgcctctgcccgagagtcacccctgcctg accccacctcaacctgctcaggctctggcacaaacctggctgccctgccactgcctctgc cccagagttggggccttgacagcctggttggaaggggacaccccagccctgcctcaacac ctgggggtctccataactaccacaggcagggctgcaaacagaaaggattttgttcaccgt cgatgctgccctgagttgtcccaaagcgaggccgtgcccccaagcacacactttctgagt gccgagtgtctccagcaggagaggggctctagttcccgggatcccaccctccacagagga cagcagcaccttgagggacagatgtgcaaagggcaagtcagcggttttgcagcaatagca ttgcatgtgggttttaggctgatgatgctcaacagtggacaatcctcttcacctgcctgg gccccactgcctgcccttccatacagtgcagggtcacagctgagctctctgagacacctc gcaggatctcggcttctgaactatcaagtgtcttccagggctacagtgaattacacactc tggcatgggtttgcacgccacgtgctggaagccagcagtcatgcaggccatgctctccca cacttaggaaaacagcgcatgagaaagatctccatcaagttcacacagaatgacctccag caagacctggaaaatgctcagttccacaaaatacctgtgtccaaatcctttcagtgcaca gctctggcatcaggttcctgtgaggacagaccctga >gi568815592r:5922_206856|GENSCAN_predicted_peptide_4|645_aa MADNAGSFNAGNAQMTLTALIEGSNTQNKAPSTIIQGQWIASPPTPAIRERLSYASAPHI TEQECGSGVSPGADWLEAGPPPMLVFELADAGDVDLQYLYEPSVTQVQVSAPAATAPVAS PAPARNPGLHYEPTQSCGERSRGTTWRLKFGWSPQLSGTLLREAFPSLPVVYHTLMCVDV SGTYCLRIHGLLHKQQSLLAVTVGSTLGSDTPVFSGLQLQQGPNVSESTLCLEMDRTLKY DAPKIVQKTEAIQSTCYCFLDVMSSRTSVSVPLSPSSLTGDTAGSDECRVCSVKRGAWRR RCAYSKPEASLLFAWRIMGRGQCYLTSQSYQLDLRNGPAKGHSGPCSCGMLSKGQGSPST EPTWKLQGGALVSNQVHLPECSAGQRSMGSSLEEQKKSGTKFLRRHPASTPCYPRPTPTA PQYYAYPDPTHTSLLPVPCPTSTPLLVCPELVPFGAFLVSLTSRMKLLTLAFHIFGYLYG STPPYWYQLTVLVHFHAADKDIPETGLMNLQQSWKLYLQKCLYLAKKNQVIQQELLSMKE VQQKCKKLEEVNKILEQEVVNLKTHEKNMVEFGDVEECKLQLEERAGQEIEKLEEINLQR ACLSAKKVRAEGELEPRRHHNHTLGFTEGPWGLMSVITGAYDLIR >gi568815592r:5922_206856|GENSCAN_predicted_CDS_4|1938_bp atggctgataatgcgggatccttcaacgctggaaacgcccagatgaccctgaccgccctt atagaaggaagcaatacacaaaacaaagcaccatcaaccattatccagggccagtggatc gcttctcctcccacgccagccatccgagaaaggctcagctatgcatctgccccacacatc acagagcaagagtgtggttctggcgtgtctcctggagctgactggctggaggctggtcct ccacccatgctggtctttgagctggccgatgctggtgatgtggaccttcaatacttgtat gaaccatccgtcacacaagtacaggtctcggccccagcggcaacagctcctgtggcttct ccagccccagcacgcaatccggggcttcattatgagccaacccagagctgtggggaaagg agtcggggaaccacctggaggctgaaattcggatggagccctcagctctcaggcaccctc cttcgtgaagccttcccaagtctccctgtggtttatcacacgcttatgtgtgttgatgta tcaggcacatattgcctaagaattcacggcctcctgcacaagcaacagtcgctgcttgct gttactgtgggctccacgttgggatcagacacacctgtcttcagtggcttgcagttgcag caaggaccgaatgtgagtgaaagcactttgtgcctggaaatggacagaacattgaaatat gatgccccgaaaatcgtccagaagacagaagctattcagtcaacctgctactgtttcctg gatgttatgtccagccgcacttctgtgtcagttcccctcagtcccagttccctcacaggt gacacagcaggctcagatgaatgccgtgtgtgcagcgtgaagcgtggagcttggagaagg cgctgtgcttacagcaagccagaagccagcctgctgtttgcctggaggatcatggggagg ggtcaatgttacctcacctctcagagttatcagctggacctgagaaatgggccagctaag ggacactcagggccttgcagttgtggcatgctcagcaagggacagggatctccatctact gagcccacctggaagctgcagggtggagctctagtgtccaaccaggtccatctcccagag tgctcggcagggcagaggagcatgggcagcagtctggaagagcaaaagaaatcgggcaca aaattcctaaggagacaccctgccagcaccccatgctacccacggcccactccaacagca cctcaatattacgcataccctgatcctacccacacctcacttctacccgtgccctgcccc accagcaccccactcctagtgtgtccggaattggttccttttggtgcgttcttggtctcg ctgacttcaagaatgaagctattgacccttgcgttccacatttttgggtatctttacggc agcacaccaccctactggtaccaacttactgtgttagtccattttcatgctgctgataaa gacatacctgagactggtttgatgaacttacagcagagctggaaactatatcttcaaaaa tgtctatatttggctaaaaagaatcaagttattcaacaggagttattatctatgaaagaa gtacaacagaaatgtaaaaaacttgaggaggttaacaaaattttggaacaagaagtagtg aatcttaagacacatgaaaaaaatatggtagaatttggtgacgtagaagaatgtaaattg cagttggaagaaagagcaggacaggaaatagaaaaattagaagaaatcaatttacagcga gcctgccttagtgccaaaaaggtcagagcagaaggggagctggagccccgcagacatcac aatcacactcttggatttactgaaggcccatggggactcatgtctgtgattactggagct tatgatctaatcaggtga >gi568815592r:5922_206856|GENSCAN_predicted_peptide_5|133_aa MWGWRSQSQPRNSNCEPSEATAPHAAGNSEEATADEEEEGEERKEVTYSHEEKHRGHSGP TGGGRLQSLAHAWHTAHCSCSMSFVDWMFACGKWKQGPDGWAEGLSYSNMHWWSCLVFGH QLNPRALNLWSVG >gi568815592r:5922_206856|GENSCAN_predicted_CDS_5|402_bp atgtggggctggaggtcacagagccagccacggaacagcaactgtgagccatcagaagcc acagcaccccatgcagcaggaaattctgaggaagcaacagctgatgaggaagaggaagga gaagagagaaaagaggtaacttattcccatgaggagaagcatcgaggacactcggggccc actggtggtggcaggctccagtccctggcacatgcttggcacacagctcactgcagctgc tcaatgagttttgtggactggatgtttgcttgtgggaagtggaagcagggaccagatgga tgggcagaaggattgtcctattcaaatatgcactggtggtcctgtcttgtgtttgggcac cagttgaatccaagagctctcaacctgtggagtgtcggatga