GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:52:30 Sequence gi568815584r:103459824_103661164 : 201341 bp : 50.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2582 2638 57 1 0 122 113 36 0.693 8.58 1.02 Intr + 5734 5898 165 0 0 98 -5 170 0.305 8.86 1.03 Intr + 6149 6268 120 1 0 94 74 73 0.933 7.19 1.04 Intr + 8210 8363 154 1 1 82 115 6 0.832 2.35 1.05 Intr + 15170 15387 218 0 2 33 84 108 0.549 3.12 1.06 Intr + 20564 20667 104 1 2 51 91 49 0.227 0.47 1.07 Intr + 31954 32211 258 1 0 46 107 101 0.187 4.38 1.08 Term + 43059 43404 346 0 1 65 40 566 0.075 43.47 1.09 PlyA + 44148 44153 6 1.05 2.09 PlyA - 44408 44403 6 1.05 2.08 Term - 60219 60041 179 1 2 104 48 411 0.999 36.55 2.07 Intr - 60488 60299 190 2 1 134 99 292 0.999 34.06 2.06 Intr - 60769 60646 124 0 1 39 50 266 0.998 18.49 2.05 Intr - 61611 61440 172 1 1 40 78 469 0.997 40.10 2.04 Intr - 62127 61995 133 0 1 103 97 257 0.999 28.32 2.03 Intr - 62354 62200 155 0 2 100 80 463 0.999 46.49 2.02 Intr - 62682 62478 205 0 1 115 75 553 0.999 55.47 2.01 Init - 62831 62712 120 2 0 98 -13 192 0.886 8.29 2.00 Prom - 65437 65398 40 -7.76 3.00 Prom + 66032 66071 40 -11.72 3.01 Init + 66419 66515 97 1 1 80 76 123 0.722 10.77 3.02 Intr + 70127 70486 360 0 0 98 105 454 0.748 43.29 3.03 Intr + 72759 73025 267 1 0 101 52 439 0.509 39.10 3.04 Term + 74727 74998 272 1 2 141 50 458 0.999 42.95 3.05 PlyA + 76202 76207 6 1.05 4.00 Prom + 81151 81190 40 -6.96 4.01 Init + 81626 81743 118 1 1 76 49 78 0.280 1.07 4.02 Intr + 84009 84095 87 1 0 96 71 66 0.540 5.64 4.03 Intr + 88437 88775 339 1 0 93 103 118 0.146 9.25 4.04 Term + 93198 93400 203 1 2 61 47 144 0.963 5.15 4.05 PlyA + 94659 94664 6 1.05 5.03 PlyA - 95297 95292 6 1.05 5.02 Term - 101369 99998 1372 1 1 94 44 1137 0.996 100.73 5.01 Init - 103180 103044 137 1 2 114 60 203 0.985 17.71 5.00 Prom - 108157 108118 40 -4.86 6.00 Prom + 108611 108650 40 -5.26 6.01 Init + 115260 115344 85 2 1 77 31 100 0.052 2.87 6.02 Intr + 127451 127541 91 0 1 58 96 70 0.122 3.85 6.03 Term + 130358 130463 106 2 1 111 49 162 0.999 12.48 6.04 PlyA + 130685 130690 6 1.05 7.05 PlyA - 131238 131233 6 1.05 7.04 Term - 140963 140827 137 0 2 50 44 94 0.459 -0.62 7.03 Intr - 145209 145125 85 0 1 87 111 37 0.807 5.29 7.02 Intr - 147870 147708 163 2 1 91 41 139 0.790 9.48 7.01 Init - 155096 155089 8 2 2 114 91 0 0.449 3.40 7.00 Prom - 155940 155901 40 -7.96 8.00 Prom + 158125 158164 40 -1.36 8.01 Init + 161242 161277 36 0 0 16 105 52 0.525 -0.29 8.02 Term + 165478 165600 123 0 0 140 42 83 0.865 7.28 8.03 PlyA + 167534 167539 6 1.05 9.00 Prom + 167722 167761 40 -7.86 9.01 Init + 169314 169671 358 2 1 93 101 221 0.945 19.17 9.02 Intr + 193220 193301 82 0 1 89 -29 81 0.032 -4.80 9.03 Intr + 194741 195002 262 2 1 113 96 249 0.757 25.69 9.04 Intr + 197723 197953 231 1 0 67 75 297 0.675 24.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40340 40231 110 0 2 82 47 115 0.916 5.47 S.002 Sngl + 43048 43404 357 0 0 42 40 576 0.917 44.26 S.003 Init + 127480 127541 62 0 2 84 96 33 0.853 4.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_1|473_aa IVSAVQYCHQKRIVHRDLKAENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPYAAP ELFQGKKYDGPEVDELRERVLRGKYRIPFYMSTDCENLLKRFLVLNPIKRGTLELDASDS SSSSNLSLAKVRPSSDLNNSTGQSPHHKVQRSVSSSQKQRRYSDHAGPAIPSVVAYPKRS QTSTADSDLKEDGISSRKSSGSAVGGKGIAPASPMLGNASNPNKADIPERKKSSTVPSSN TASGGMTRRNTYVCSERTTADRHSVIQNGKENSTIPDQRTPVASTHSISSAATPDRIRFP RGTASRSTFHGQPRERRTATYNGPPASPSLSHEATPLSQTRSRGSTNLFSKLTSKLTRSR NVSAEQKDENKEAKPRSLRFTWSMKTTSSMDPGDMMREIRKVLDANNCDYEQRERFLLFC VHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIASKIANELKL >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_1|1422_bp attgtgtctgcagttcaatactgccatcagaaacggatcgtacatcgagacctcaaggct gaaaatctattgttagatgccgatatgaacattaaaatagcagatttcggttttagcaat gaatttactgttggcggtaaactcgacacgttttgtggcagtcctccatacgcagcacct gagctcttccagggcaagaaatatgacgggccagaagtggatgaactgagagagagagta ttaagagggaaatacagaattcccttctacatgtctacagactgtgaaaaccttctcaaa cgtttcctggtgctaaatccaattaaacgcggcactctagagctggatgctagtgattcc agttctagcagcaatctttcacttgctaaggttaggccgagcagtgatctcaacaacagt actggccagtctcctcaccacaaagtgcagagaagtgtttcttcaagccaaaagcaaaga cgctacagtgaccatgctggaccagctattccttctgttgtggcgtatccgaaaaggagt cagaccagcactgcagatagtgacctcaaagaagatggaatttcctcccggaaatcaagt ggcagtgctgttggaggaaagggaattgctccagccagtcccatgcttgggaatgcaagt aatcctaataaggcggatattcctgaacgcaagaaaagctccactgtccctagtagtaac acagcatctggtggaatgacacgacgaaatacttatgtttgcagtgagagaactacagct gatagacactcagtgattcagaatggcaaagaaaacagcactattcctgatcagagaact ccagttgcttcaacacacagtatcagtagtgcagccaccccagatcgaatccgcttccca agaggcactgccagtcgtagcactttccacggccagccccgggaacggcgaaccgcaaca tataatggccctcctgcctctcccagcctgtcccatgaagccacaccattgtcccagact cgaagccgaggctccactaatctctttagtaaattaacttcaaaactcacaaggagtcgc aatgtatctgctgagcaaaaagatgaaaacaaagaagcaaagcctcgatccctacgcttc acctggagcatgaaaaccactagttcaatggatcccggggacatgatgcgggaaatccgc aaagtgttggacgccaataactgcgactatgagcagagggagcgcttcttgctcttctgc gtccacggagatgggcacgcggagaacctcgtgcagtgggaaatggaagtgtgcaagctg ccaagactgtctctgaacggggtccggtttaagcggatatcggggacatccatagccttc aaaaatattgcttccaaaattgccaatgagctaaagctgtaa >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_2|425_aa MALPARAADPADLGRVPGGAGGPGGGGLSGTREPGNPGVPPAAAMPFSNSHNALKLRFPA EDEFPDLSAHNNHMAKVLTPELYAELRAKSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVA GDEESYEVFKDLFDPIIEDRHGGYKPSDEHKTDLNPDNLQGGDDLDPNYVLSSRVRTGRS IRGFCLPPHCSRGERRAIEKLAVEALSSLDGDLAGRYYALKSMTEAEQQQLIDDHFLFDK PVSPLLLASGMARDWPDARGIWHNDNKTFLVWVNEEDHLRVISMQKGGNMKEVFTRFCTG LTQIETLFKSKDYEFMWNPHLGYILTCPSNLGTGLRAGVHIKLPNLGKHEKFSEVLKRLR LQKRGTGGVDTAAVGGVFDVSNADRLGFSEVELVQMVVDGVKLLIEMEQRLEQGQAIDDL MPAQK >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_2|1278_bp atggcgctccccgcgcgcgctgcggaccccgctgaccttggccgcgtcccggggggcgcc ggggggcccggcggcgggggcctgagtggtacgcgggagcccgggaaccccggcgtgccg cccgccgccgccatgcccttctccaacagccacaacgcactgaagctgcgcttcccggcc gaggacgagttccccgacctgagcgcccacaacaaccacatggccaaggtgctgaccccc gagctgtacgcggagctgcgcgccaagagcacgccgagcggcttcacgctggacgacgtc atccagacaggcgtggacaacccgggccacccgtacatcatgaccgtgggctgcgtggcg ggcgacgaggagtcctacgaagtgttcaaggatctcttcgaccccatcatcgaggaccgg cacggcggctacaagcccagcgatgagcacaagaccgacctcaaccccgacaacctgcag ggcggcgacgacctggaccccaactacgtgctgagctcgcgggtgcgcacgggccgcagc atccgtggcttctgcctccccccgcactgcagccgcggggagcgccgcgccatcgagaag ctcgcggtggaagccctgtccagcctggacggcgacctggcgggccgatactacgcgctc aagagcatgacggaggcggagcagcagcagctcatcgacgaccacttcctcttcgacaag cccgtgtcgcccctgctgctggcctcgggcatggcccgcgactggcccgacgcccgcggt atctggcacaatgacaataagaccttcctggtgtgggtcaacgaggaggaccacctgcgg gtcatctccatgcagaaggggggcaacatgaaggaggtgttcacccgcttctgcaccggc ctcacccagattgaaactctcttcaagtctaaggactatgagttcatgtggaaccctcac ctgggctacatcctcacctgcccatccaacctgggcaccgggctgcgggcaggtgtgcat atcaagctgcccaacctgggcaagcatgagaagttctcggaggtgcttaagcggctgcga cttcagaagcgaggcacaggcggtgtggacacggctgcggtgggcggggtcttcgacgtc tccaacgctgaccgcctgggcttctcagaggtggagctggtgcagatggtggtggacgga gtgaagctgctcatcgagatggagcagcggctggagcagggccaggccatcgacgacctc atgcctgcccagaaatga >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_3|331_aa MTVYGGQAAVTGTAAADNSGEVALKAFRPKCQGPWPLPDISTMSFVAYEELIKEGDTAIL SLGHGAMVAVRVQRGAQTQTRHGVLRHSVDLIGRPFGSKVTCGRGGWVYVLHPTPELWTL NLPHRTQILYSTDIALITMMLELRPGSVVCESGTGSGSVSHAIIRTIAPTGHLHTVEFHQ QRAEKAREEFQEHRVGRWVTVRTQDVCRSGFGVSHVADAVFLDIPSPWEAVGHAWDALKV EGGRFCSFSPCIEQVQRTCQALAARGFSELSTLEVLPQVYNVRTVSLPPPDLGTGTDGPA GSDTSPFRSGTPMKEAVGHTGYLTFATKTPG >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_3|996_bp atgacagtgtacgggggccaggcagctgtgacggggacagcagcagcagataacagtggg gaggtggccctcaaggccttcagacctaaatgtcagggtccttggcccttgccagacatt agcaccatgagcttcgtggcatacgaggagctgatcaaggagggtgacacggccatcctg tcactgggccatggtgcaatggtggcggtgcgtgtgcagcgtggggcacagacccagacc cggcatggtgtcctgcggcactcagttgaccttatcggccgccccttcggctccaaggtg acgtgcggccgaggtggctgggtgtatgtgctgcaccccacgcccgagctctggacgctg aacctgccgcaccgcacgcagatcctctactccacagacatcgccctcatcaccatgatg ttggagcttcggcccggctctgtggtctgtgagtctggcaccggcagtggctctgtgtcc cacgccatcatccgcaccattgcacccacgggtcacctgcacacggtggagttccaccag cagcgggcagagaaggcccgggaggagttccaggagcaccgtgtgggccgctgggtgact gtgcgcacccaggacgtgtgccgcagtggctttggcgtgagccacgtggccgacgccgtc ttcctggacatcccatcaccctgggaggccgtgggccacgcctgggacgccctcaaggtc gaaggcgggcgcttctgctccttctcaccgtgcatcgagcaggtgcaacgcacatgccag gcgctggcagcgcgcggcttctcagagctgagcaccctggaggtgctgccacaggtctac aacgtgcgcactgtcagcctgccaccgcccgacctgggcacaggcacagatggccctgcc ggctccgacaccagccccttccgcagcggcacgcccatgaaggaggccgtgggccacacc ggctacctgaccttcgccaccaagaccccaggctag >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_4|248_aa MAPAAPSSGLLTSSWPLSRGRRHSPPLLPLTVPAWLVPPGTGHVLHPQGAEHIRPDHVGF LQEEAAGPGALSKASKPPLACGAQEALAVLALSPTRCLKFFQGLQCARPWLWALLLKSLH FPLHPHVQRNAGCARGAELEGPPGESEHPSFLISFWVSLSSLAISKGMPLHPKALPLCQA PAGTRDGEAVCGTESLPNGVTDPWQLRKDQRIRSGEIASLHFNLTIERYIFAVVIHKWKT YGQNKALV >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_4|747_bp atggctcctgcagccccgagctctggcctgctcacctcttcgtggcctctgtctcggggc cgccggcactccccgcctctcctgcctctgactgtgcctgcctggctggtgcccccaggc actgggcacgtgctccatccccaaggcgctgagcacatcaggccagaccacgtgggcttc ctgcaggaggaggcagcaggcccaggggcactcagcaaagcctccaaacctcccttggct tgtggggcccaggaagccctggcagtcctggcgctgtcccctacccgctgcctgaagttc ttccagggcctccagtgtgcccggccctggctctgggctctgcttttgaaaagtcttcat tttcctttacatccccacgtgcagagaaatgccggctgcgcccgaggagcagagttggag gggcccccgggagaatctgaacatcccagtttcctgatctctttctgggtctcactttca tctctggccatcagcaagggcatgcccctccacccgaaggccctgcctctgtgccaggct ccagcggggaccagagatggagaggcagtgtgcggcactgagtccttaccaaatggggtc acagacccctggcagctccggaaagaccaaagaatccgaagcggggaaatagcttcactt cacttcaaccttactatagaaagatacatttttgcagttgtaattcacaagtggaagacg tacggacagaataaagcactcgtttaa >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_5|502_aa MAPRPLAPAAHGSIATARGAALRQGREARGSAAARPTVRFPSGATGACETEHNKSMDMGN QHPSISRLQEIQKEVKSVEQQVIGFSGLSDDKNYKKLERILTKQLFEIDSVDTEGKGDIQ QARKRAAQETERLLKELEQNANHPHRIEIQNIFEEAQSLVREKIVPFYNGGNCVTDEFEE GIQDIILRLTHVKTGGKISLRKARYHTLTKICAVQEIIEDCMKKQPSLPLSEDAHPSVAK INFVMCEVNKARGVLIALLMGVNNNETCRHLSCVLSGLIADLDALDVCGRTEIRNYRREV VEDINKLLKYLDLEEEADTTKAFDLRQNHSILKIEKVLKRMREIKNELLQAQNPSELYLS SKTELQGLIGQLDEVSLEKNPCIREARRRAVIEVQTLITYIDLKEALEKRKLFACEEHPS HKAVWNVLGNLSEIQGEVLSFDGNRTDKNYIRLEELLTKQLLALDAVDPQGEEKCKAARK QAVRLAQNILSYLDLKSDEWEY >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_5|1509_bp atggccccacgccccctggctcccgcggcgcacggcagcattgcgacggcgcgaggggcc gctttacggcagggtcgcgaagcccgaggaagcgcggcggcgcggccgaccgtgcgcttt cccagcggtgcgacgggtgcttgtgaaactgaacacaacaaaagtatggatatgggaaac caacatccttctattagtaggcttcaggaaatccaaaaggaagtaaaaagtgtagaacag caagttatcggcttcagtggtctgtcagatgacaagaattacaagaaactggagaggatt ctaacaaaacagctttttgaaatagactctgtagatactgaaggaaaaggagatattcag caagctaggaagcgggcagcacaggagacagaacgtcttctcaaagagttggagcagaat gcaaaccacccacaccggattgaaatacagaacatttttgaggaagcccagtccctcgtg agagagaaaattgtgccattttataatggaggcaactgcgtaactgatgagtttgaagaa ggcatccaagatatcattctgaggctgacacatgttaaaactggaggaaaaatctccttg cggaaagcaaggtatcacactttaaccaaaatctgtgcggtgcaagagataatcgaagac tgcatgaaaaagcagccttccctgccgctttccgaggatgcacatccttccgttgccaaa atcaacttcgtgatgtgtgaggtgaacaaggcccgaggggtcctgattgcacttctgatg ggtgtgaacaacaatgagacctgcaggcacttatcctgtgtgctctcggggctgatcgct gacctggatgctctagatgtgtgcggccggacagaaatcagaaattatcggagggaggta gtagaagatatcaacaaattattgaaatatctggatttggaagaggaagcagacacaact aaagcatttgacctgagacagaatcattccattttaaaaatagaaaaggtcctcaagaga atgagagaaataaaaaatgaacttctccaagcacaaaacccttctgaattgtacctgagc tccaaaacagaattgcagggtttaattggacagttggatgaggtaagtcttgaaaaaaac ccctgcatccgggaagccaggagaagagcagtgatcgaggtgcaaactctgatcacatat attgacttgaaggaggcccttgagaaaagaaagctgtttgcttgtgaggagcacccatcc cataaagccgtctggaacgtccttggaaacttgtctgagatccagggagaagttctttca tttgatggaaatcgaaccgataagaactacatccggctggaagagctgctcaccaagcag ctgctagccctggatgctgttgatccgcagggagaagagaagtgtaaggctgccaggaaa caagctgtgaggcttgcgcagaatattctcagctatctcgacctgaaatctgatgaatgg gagtactga >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_6|93_aa MAVSAFHADLAVYACVCTHVCAYVHVCTCQKATLNAEEMADFYKEFLSKNFQKHMYYNRD WYKRNFAITFFMGKVALERIWNKLKQKQKKRSN >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_6|282_bp atggctgtcagcgcattccatgctgacctggccgtgtatgcgtgcgtgtgtacgcatgtg tgtgcgtacgtgcacgtgtgtacatgtcagaaagcaacattgaatgcagaagaaatggcg gacttctacaaggaatttttaagtaaaaattttcagaagcacatgtattataacagagat tggtacaagcgcaattttgccatcaccttcttcatgggaaaagtggccctggaaaggatt tggaacaagcttaaacagaaacaaaagaagaggagcaactag >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_7|130_aa MPRTSAVTATQHRTLLTISRKQQSEQVQPMSQMEKPKGDFDAGPRLRSHTAPPVTAAGYS RAWHAGGIRKYPENKSPRTLLSSQEPMNERSCCSILTCIWCCLESKDHLVVIIRKAIQKQ NFLSEEFKSN >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_7|393_bp atgcccagaacctcggcagtcacggccacgcagcacagaacactgctcaccatctctcgt aagcagcagtcagagcaggtgcaacccatgtcacagatggagaagccgaaaggggacttt gatgcgggccctcggctcagatctcacactgccccacccgtcaccgctgcggggtacagc agggcctggcacgcaggaggtattcggaagtatccagagaataagtcaccaaggaccctg ctgtcctcccaggaaccaatgaacgagcgttcctgttgctccatcctcacctgcatttgg tgttgtttggagagcaaagatcacctggtggtgatcatcaggaaggccatccagaaacaa aacttcttatctgaggaattcaaaagtaattag >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_8|52_aa MWDNVDECQKQEAFTDRAAHKAADPQEPKGNSCLLVSSKKPLAKTDIYKHLQ >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_8|159_bp atgtgggacaatgtggatgaatgtcaaaaacaggaggcattcacagacagggctgcccac aaggctgcagatcctcaagaaccaaagggaaattcctgcctgcttgtctcttccaaaaag ccgctcgcaaagacagacatctacaaacatctacagtaa >gi568815584r:103459824_103661164|GENSCAN_predicted_peptide_9|311_aa MTSCRRAALWGGARRGAGGSNTETPFSAAGAAQAAERDWLGRLGCWCEEPRGCARRPRGQ RVGGRGCCGAVAPAPLAGDCCAVPHTAEAGSAHSRCSARAPGGAPGADSARERGPQEQGA SVSPSIHYMDFLGQELPLEAVESYKIKMYDNMSTMVYIKEDKLEKLTQDEIISKTKQVIQ GLEALKNEHNSILQSLLETLKCLKKDDESNLVEEKSNMIRKSLEMLELGLSEAQVMMALS NHLNAVESEKQKLRAQVRRLCQENQWLRDELANTQQKLQKSEQSVAQLEEEKKHLEFMNQ LKKYDDDISPS >gi568815584r:103459824_103661164|GENSCAN_predicted_CDS_9|933_bp atgacgtcatgccggcgcgcggcattgtggggcggggcgaggcggggcgccggggggagc aacactgagacgccattttcggcggcgggagcggcgcaggcggccgagcgggactggctg ggtcggctgggctgctggtgcgaggagccgcggggctgtgctcggcggccaaggggacag cgcgtgggtggccgaggatgctgcggggcggtagctccggcgcccctagctggtgactgc tgcgccgtgcctcacacagccgaggcgggctcggcgcacagtcgctgctccgcgcgcgcg cccggcggcgctccaggtgctgacagcgcgagagagcgcggccctcaggagcaaggcgcg tctgtcagtccatccattcactacatggactttctgggccaggagctgcctctagaagct gtggagtcctacaagatcaaaatgtatgacaacatgtccacaatggtgtacataaaggaa gacaagttggagaagcttacacaggatgaaattatttctaagacaaagcaagtaattcag gggctggaagctttgaagaatgagcacaattccattttacaaagtttgctggagacactg aagtgtttgaagaaagatgatgaaagtaatttggtggaggagaaatcaaacatgatccgg aagtcactggagatgttggagctcggcctgagtgaggcacaggttatgatggctttgtca aatcacctgaatgctgtggagtccgagaagcagaaactgcgtgcgcaggttcgtcgtctg tgccaggagaatcagtggctacgggatgaactggccaacacgcagcagaaactgcagaag agtgagcagtctgtggctcaactggaggaggagaagaagcatctggagtttatgaatcag ctaaaaaaatatgatgacgacatttccccatcc