GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:32:29 Sequence gi568815596f:37244732_37472824 : 228093 bp : 39.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1321 1464 144 0 0 63 36 219 0.997 13.76 1.02 Intr + 2725 2898 174 0 0 42 116 136 0.996 11.01 1.03 Term + 3404 3619 216 1 0 101 40 122 0.995 4.86 1.04 PlyA + 3666 3671 6 1.05 2.19 PlyA - 3884 3879 6 1.05 2.18 Term - 8619 8446 174 0 0 90 37 181 0.999 9.98 2.17 Intr - 9558 9473 86 1 2 92 98 23 0.986 2.32 2.16 Intr - 12198 11931 268 0 1 89 93 199 0.996 16.68 2.15 Intr - 14950 14852 99 1 0 54 94 47 0.684 1.29 2.14 Intr - 15653 15492 162 2 0 85 68 68 0.274 3.75 2.13 Intr - 22805 22699 107 0 2 105 98 -36 0.121 -1.89 2.12 Intr - 23641 23570 72 2 0 56 67 75 0.099 0.66 2.11 Intr - 24956 24884 73 2 1 104 99 93 0.990 10.06 2.10 Intr - 27701 27649 53 0 2 98 103 2 0.939 0.41 2.09 Intr - 29966 29690 277 2 1 77 72 229 0.992 16.37 2.08 Intr - 33258 33135 124 2 1 17 65 62 0.515 -3.53 2.07 Intr - 35198 35015 184 0 1 57 90 246 0.962 19.62 2.06 Intr - 37888 37811 78 2 0 83 83 67 0.950 4.30 2.05 Intr - 46268 46137 132 0 0 36 93 112 0.750 6.20 2.04 Intr - 48540 48402 139 0 1 36 111 111 0.409 7.32 2.03 Intr - 50496 50372 125 1 2 73 70 25 0.156 -1.42 2.02 Intr - 58722 58521 202 0 1 26 103 100 0.272 3.24 2.01 Init - 71793 71506 288 0 0 30 79 207 0.677 11.16 2.00 Prom - 77516 77477 40 -4.45 3.06 PlyA - 78367 78362 6 1.05 3.05 Term - 79264 79085 180 1 0 68 49 253 0.982 16.03 3.04 Intr - 80681 80470 212 0 2 26 -31 226 0.136 2.21 3.03 Intr - 99109 98996 114 2 0 114 116 39 0.925 8.80 3.02 Intr - 100068 99874 195 1 0 32 35 154 0.784 2.96 3.01 Init - 100447 100258 190 1 1 93 39 182 0.887 11.03 3.00 Prom - 101906 101867 40 -4.95 4.00 Prom + 102502 102541 40 -7.15 4.01 Init + 105451 105510 60 0 0 64 89 44 0.848 3.42 4.02 Intr + 108058 108204 147 0 0 54 94 214 0.912 18.11 4.03 Intr + 114849 115127 279 2 0 50 65 270 0.144 17.65 4.04 Intr + 122501 122677 177 1 0 64 49 92 0.619 2.09 4.05 Intr + 124954 125053 100 0 1 129 99 -32 0.756 0.76 4.06 Intr + 127625 127741 117 0 0 63 80 73 0.787 3.52 4.07 Term + 133720 133856 137 2 2 33 38 137 0.087 0.40 4.08 PlyA + 134658 134663 6 1.05 5.05 PlyA - 135925 135920 6 1.05 5.04 Term - 138996 138514 483 0 0 74 39 211 0.810 8.56 5.03 Intr - 139473 139437 37 2 1 10 115 25 0.315 -5.25 5.02 Intr - 142947 142780 168 2 0 61 94 176 0.294 13.64 5.01 Init - 147719 147607 113 2 2 76 116 72 0.964 8.53 5.00 Prom - 158259 158220 40 -1.95 6.05 PlyA - 158400 158395 6 1.05 6.04 Term - 167094 166927 168 0 0 86 36 145 0.969 6.00 6.03 Intr - 170014 169731 284 2 2 -14 89 175 0.465 3.61 6.02 Intr - 171338 171183 156 0 0 74 29 83 0.138 0.06 6.01 Init - 173480 173330 151 2 1 60 72 102 0.942 6.05 6.00 Prom - 175651 175612 40 -4.25 7.02 PlyA - 176596 176591 6 1.05 7.01 Sngl - 177720 177445 276 0 0 95 47 156 0.849 7.33 7.00 Prom - 180793 180754 40 -3.75 8.03 PlyA - 181031 181026 6 1.05 8.02 Term - 182412 182300 113 1 2 86 53 72 0.223 1.24 8.01 Init - 189747 189603 145 0 1 38 73 149 0.335 8.73 8.00 Prom - 195916 195877 40 -3.65 9.03 PlyA - 196303 196298 6 1.05 9.02 Term - 202921 202808 114 1 0 60 40 116 0.212 1.59 9.01 Init - 217474 217403 72 1 0 59 98 81 0.863 7.32 9.00 Prom - 219296 219257 40 -5.05 10.03 PlyA - 219749 219744 6 1.05 10.02 Term - 223845 223738 108 0 0 76 48 92 0.926 1.53 10.01 Intr - 227500 227380 121 0 1 64 119 55 0.932 5.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_1|177_aa HDETRDHVEVCPDAGVIIEELSQRIALTGGAALVADYGHDGTKTDTFRGFCDHKLHDVLI APGTADLTADVDFSYLRRMAQGKVASLGPIKQHTFLKNMGIDVRLKVLLDKSNEPSVRQQ LLQGYDMLMNPKKMGERFNFFALLPHQRLQGGRYQRNARQSKPFASVVAGFSELAWQ >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_1|534_bp catgacgaaacaagggatcatgttgaagtgtgtcctgatgctggtgttatcatcgaggaa ctttctcaacgcattgcattaactggaggtgctgcactggttgctgattatggtcatgat ggaacaaagacagataccttcagagggttttgcgaccacaagcttcatgatgtcttaatt gccccaggaacagcagatctaacagctgatgtggacttcagttatttgcgaagaatggca cagggaaaagtagcctctctgggcccaataaaacaacacacatttttaaaaaatatgggt attgatgtccggctgaaggttcttttagataaatcaaatgagccatcagtgaggcagcag ttacttcaaggatatgatatgttaatgaatccaaagaagatgggagagagatttaacttt tttgccttgctacctcatcagagacttcaaggtggaagatatcagaggaatgcacgtcag tcaaaaccctttgcatccgttgtagctgggtttagtgaacttgcttggcagtga >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_2|880_aa MSANNSPPSAQKSVLPTAIPAVLPAASPCSSPKTGLSARLSNGSFSAPSLTNSRGSVHTV SFLLQIGLTRESVTIEAQELSLSAVKDLVCSIVYQKVCHPWTAACYQLSGRLASSCGVAA LHQQGERAYATAFFGVLALVGSPVLVWHLRTMRSPRHLKDGEGDCKFLEDKDYSVFTNCG ASILHLPDMQEVCLFEISFIYTNSRFPECGFFGMYDKILLFRHDMNSENILQLITSADEI HEGDLVEVVLSALATVEDFQIRPHTLYVHSYKAPTFCDYCGEMLWGLVRQGLKCEDCKFN CHKRCASKVPRDCLGEVTFNGEPSSLGTDTDIPMDIDNNDINSDSSRGLDDTEEPSPPED KMFFLDPSDLDVERDEEAVKTISPSTSNNIPLMRVVQSIKHTKRKSSTMVKEGWMVHYTS RDNLEIPLSEILRISSPRDFTNISQGSNPHCFEIITDTMVYFVGENNGDSSHNPVLAATG VGLDVAQSWEKAIRQALMPVTPQASVCTSPGQGKDHKDLSTSISVSNCQIQENVDISTVY QIFADEVLGSGQFGIVYGDLSVLRVVTSLEDFLSSEEQRTVRGKHRKTGRDVAIKVIDKM RFPTKQESQLRNEVAILQNLHHPGIVNLECMFETPERVFVVMEKLHGDMLEMILSSEKSR LPERITKFMVTQILVALRNLHFKNIVHCDLKPENVLLASAEPFPQVKLCDFGFARIIGEK SFRRSVVGTPAYLAPEVLRSKGYNRSLDMWSVGVIIYVSLSGTFPFNEDEDINDQIQNAA FMYPPNPWREISGEAIDLINNLLQVKMRKRYSVDKSLSHPWLQDYQTWLDLREFETRIGE RYITHESDDARWEIHAYTHNLVYPKHFIMAPNPDDMEEDP >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_2|2643_bp atgtctgcaaataattcccctccatcagcccagaagtctgtattacccacagctattcct gctgtgcttccagctgcttctccgtgttcaagtcctaagacgggactctctgcccgactc tctaatggaagcttcagtgcaccatcactcaccaactccagaggctcagtgcatacagtt tcatttctactgcaaattggcctcacacgggagagtgttaccattgaagcccaggaactg tctttatctgctgtcaaggatcttgtgtgctccatagtttatcaaaaggtctgccatcca tggacagcagcgtgttatcagctcagtgggcgccttgcctcatcgtgtggggtggctgcc ctccaccagcaaggggaaagggcctatgcaacagccttttttggggtactggcacttgtt gggtctccagtgcttgtctggcatctaagaacaatgagatcccccagacacttgaaggat ggtgaaggtgattgtaagttcctggaagataaggactattctgtctttacaaactgtggg gccagtatcttgcacttaccggacatgcaagaagtatgcctttttgaaatcagttttatt tatacaaatagtagatttccagagtgtggattctttggcatgtatgacaaaattcttctc tttcgccatgacatgaactcagaaaacattttgcagctgattacctcagcagatgaaata catgaaggagacctagtggaagtggttctttcagctttagccacagtagaagacttccag attcgtccacatactctctatgtacattcttacaaagctcctactttctgtgattactgt ggtgagatgctctggggattggtacgtcaaggactgaaatgtgaagattgcaaattcaac tgccataaacgctgtgcatcaaaagtaccaagagactgccttggagaggttactttcaat ggagaaccttccagtctgggaacagatacagatataccaatggatattgacaataatgac ataaatagtgatagtagtcggggtttggatgacacagaagagccatcacccccagaagat aagatgttcttcttggatccatctgatctcgatgtggaaagagatgaagaagccgttaaa acaatcagtccatcaacaagcaataatattccgctaatgagggttgtacaatccatcaag cacacaaagaggaagagcagcacaatggtgaaggaagggtggatggtccattacaccagc agggataacctggaaattccactttcagaaattctccgcatatcttcaccacgagatttc acaaacatttcacaaggcagcaatccacactgttttgaaatcattactgatactatggta tacttcgttggtgagaacaatggggacagctctcataatcctgttcttgctgccactgga gttggacttgatgtagcacagagctgggaaaaagcaattcgccaagccctcatgcctgtt actcctcaagcaagtgtttgcacttctccagggcaagggaaagatcacaaagatttgtct acaagtatctctgtatctaattgtcagattcaggagaatgtggatatcagtactgtttac cagatctttgcagatgaggtgcttggttcaggccagtttggcatcgtttatggagatttg tcagttttaagagtagttacatctctggaggacttcctgtcatcagaggaacaaagaact gtccgaggaaaacatagaaagactgggagggatgtggctattaaagtaattgataagatg agattccccacaaaacaagaaagtcaactccgtaatgaagtggctattttacagaatttg caccatcctgggattgtaaacctggaatgtatgtttgaaaccccagaacgagtctttgta gtaatggaaaagctgcatggagatatgttggaaatgattctatccagtgagaaaagtcgg cttccagaacgaattactaaattcatggtcacacagatacttgttgctttgaggaatctg cattttaagaatattgtgcactgtgatttaaagccagaaaatgtgctgcttgcatcagca gagccatttcctcaggtgaagctgtgtgactttggatttgcacgcatcattggtgaaaag tcattcaggagatctgtggtaggaactccagcatacttagcccctgaagttctccggagc aaaggttacaaccgttccctagatatgtggtcagtgggagttatcatctatgtgagcctc agtggcacatttccttttaatgaggatgaagatataaatgaccaaatccaaaatgctgca tttatgtacccaccaaatccatggagagaaatttctggtgaagcaattgatctgataaac aatctgcttcaagtgaagatgagaaaacgttacagtgttgacaaatctcttagtcatccc tggctacaggactatcagacttggcttgaccttagagaatttgaaactcgcattggagaa cgttacattacacatgaaagtgatgatgctcgctgggaaatacatgcatacacacataac cttgtatacccaaagcacttcattatggctcctaatccagatgatatggaagaagatcct taa >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_3|296_aa MRPRLPAQQPPARHISCAYQHPHQTAAQERPVTSPSPPGVSRTAPDWPRHPGEYAEPRLL RAPGQGGHQQQQVEGAHDAPVSSACHLSESVWERVPRAAGGHEPGNARPSPSLFSTLGST APAFPSPPVRIRAWPALSHLPVTSSSQKNQNKDTEAEPFHREQRFEGSHSRLKADGEETP STGKTPAVTGTSGLGRAGLGAELRCRRPPRGQAAGRSMREGCDISGWKQLSALDALQVNA VRGIGLAVCRHTYDSSDKCVRATCRLFHTQIDGMDVTVSSITPIPTGNKTIGVIFD >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_3|891_bp atgcgccctcgcctcccggctcagcagccgccggcgagacacatttcctgtgcctatcag cacccccatcagaccgcagcccaggagaggccagtgacatcacccagcccccctggcgtc tccagaacagccccggactggccgaggcaccccggcgagtatgcagagccccgactcctc cgggcgccggggcagggcggccaccagcagcagcaggtggagggtgcccacgacgcgccg gtgtcttccgcctgccatctctccgagtctgtctgggagcgggttccccgggccgctggc ggtcacgagcccgggaacgcgcgtccttcgccttccctcttctccacccttgggtcgact gcgcccgccttcccatcgcccccggtgaggatcagggcttggcctgctctgagtcatctc ccagtaacatctagttcccagaaaaatcagaacaaggatactgaagcagaaccatttcac agagagcaaagatttgaagggtcacattcgcgtttaaaggccgacggggaggaaacaccg agcaccgggaaaacccctgcagtcaccggaacgtcgggcttgggacgtgccggtttgggc gccgagctgcggtgccggcggccaccccgggggcaggcagcagggaggagcatgcgggag ggctgtgacatctcagggtggaagcagctctctgctctggacgccttacaagtgaacgca gtcagaggcataggccttgccgtctgcaggcacacatacgattcaagcgacaagtgtgtg cgcgcgacctgccgacttttccacacacagattgatggcatggacgtaactgtgtcttca atcacacccatccccacaggaaataagactatcggtgtcatctttgattag >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_4|338_aa MGAEPQSIWGLAKCQSDRSMNYHQPAILNSSALRQIAEGTSISEMWQNDLQPLLIERYPG SPGSYAARQHIMQRIQRLQADWVLEIDTFLSQTPYGYRSFSNIISTLNPTAKRHLVLACH YDSKYFSHWNNRVFVGATDSAVPCAMMLELARALDKKLLSLKTVSDSKPDLSLQLIFFDG EEAFLHWSPQDSLYGSRHLAAKMASTPHPPGARGTSQLHGMDLLVLLDLIGAPNPTFPNF FPNSARWFERLQAIEHELHELGLLKDHSLEGRYFQNYSYGGVIQDDHIPFLRRVSVQGEY LSSEQQFHKVLGLTAIGPSGALGPLLIQSLGWWMECTD >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_4|1017_bp atgggggcagaacctcagagcatttggggcctggccaaatgccagagtgaccgaagcatg aattaccaccagccagccattttgaattcatcggctcttcggcaaattgcagaaggcacc agtatctctgaaatgtggcaaaatgacttacagccattgctgatagagcgatacccggga tcccctggaagctatgctgctcgtcagcacatcatgcagcgaattcagaggcttcaggct gactgggtcttggaaatagacaccttcttgagtcagacaccctatgggtaccggtctttc tcaaatatcatcagcaccctcaatcccactgctaaacgacatttggtcctcgcctgccac tatgactccaagtatttttcccactggaacaacagagtgtttgtaggagccactgattca gccgtgccatgtgcaatgatgttggaacttgctcgtgccttagacaagaaactcctttcc ttaaagactgtttcagactccaagccagatttgtcactccagctgatcttctttgatggt gaagaggcttttcttcactggtctcctcaagattctctctatgggtctcgacacttagct gcaaagatggcatcgaccccgcacccacctggagcgagaggcaccagccaactgcatggc atggatttattggtcttattggatttgattggagctccaaacccaacgtttcccaatttt tttccaaactcagccaggtggttcgaaagacttcaagcaattgaacatgaacttcatgaa ttgggtttgctcaaggatcactctttggaggggcggtatttccagaattacagttatgga ggtgtgattcaggatgaccatattccatttttaagaagagtttcagtgcaaggtgagtac ctgtcctctgagcaacagttccacaaagtccttggcctgactgccattggcccaagtgga gctctgggacctttgctgattcagtctctgggctggtggatggaatgcactgattga >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_5|266_aa MTGPNSLKNRSSCCMKNRLKGARVEPGSLDKRLLKLSRAAAGEARASLDSVESDLSVAAA AAPTKHKSNPHLWGKKKQCVALGDEVIDGGGFTATITDSLGNSQWKHALKGLKPVITRLL QHGLLKPKNSPYNSPISPVLKPDKAYRLVQDLRLINQIVLPTHPVVPNPYTLLSSIPPST IHYSVLDLKHAFFTIPLHPSSQPLFAFTWTDPDSHQAQQITWAVLPQGFTDSPHYFSQVQ ISSSSVTYLGIILIKTHVLSMLIVSN >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_5|801_bp atgacgggacctaattccttaaagaaccgctcttcctgctgcatgaagaatagactgaag ggggcaagggtagaaccaggaagccttgataagaggctgttaaagttatcaagggcagct gctggggaagccagagcttccctggattcagttgagtcagacctcagtgttgcagctgct gcagctcctaccaaacataagtctaaccctcatctctggggaaagaagaagcaatgtgta gcgttaggggatgaagtgattgatggtggtggctttacagcgaccatcacagactctctg ggtaactcacagtggaagcacgctttaaaaggattaaagcctgttatcactcgcctgcta cagcatggccttttaaaacctaaaaactccccttacaattcccccatttcacctgtccta aaaccagacaaggcttacaggttagttcaggatctgcgccttatcaaccaaattgttttg cctacccaccccgtggtgccaaacccatatactctcctatcctcaatacctccctccaca atccattattctgttctggatctcaaacatgctttctttactattcctttacacccttca tcccagcctctcttcgctttcacctggactgaccctgactcccatcaggctcagcaaatt acctgggctgtactgccgcaaggcttcacagacagcccccattacttcagtcaagtccaa atttcatcctcatctgttacctatctcggcataattctcataaaaacacacgtgctctcc atgctgattgtgtccaactga >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_6|252_aa MLVVVLSSEGDFPILLNVPVTHAIDTPGVYPQAQRRPPSNDPFPWQHLDSALCIAQRDSW GQCLNPQQEKASTAMPSFEVLDHCKGEVSKKRKLLMVRRIEWELIGSEKLTFSPSILNEF RTKETCKAQINVKEKSPSTALTSLNLVHLLVVIFQNNSESPFRVTDPVRKKLWRSYSDFY GSPGPTHQDISLLKGQRTLSPLFTPGALWPASDPPEWVIHKKAQAQAQCPLGLQEIQESL IQHRVGIHAALL >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_6|759_bp atgttagtggtggtcctgtcttcagaaggtgactttcccattcttctcaatgtccctgtc acccatgctattgacaccccaggagtctatcctcaggctcagagacgtcctcctagcaat gacccattcccttggcaacaccttgactcagcactgtgtattgcacaaagggactcctgg ggacagtgtcttaacccacagcaggagaaggcctccactgccatgccaagctttgaagtc ctggatcattgcaagggtgaagtctccaagaaaaggaagctgctgatggtcagaagaata gagtgggaattaataggatctgagaaactgacattttcgccctcaattctcaatgaattt agaacaaaagaaacctgcaaagctcagataaacgtgaaagaaaaatcaccaagcactgct ctaacttctttgaatcttgtgcatctcctagtggtgatctttcagaataactctgaaagc ccattcagagttactgacccagtaaggaagaaactgtggagaagttattcagacttctac ggtagtccgggcccaacgcaccaagacatttcactattaaagggacagagaaccctgtcc cccctttttactcctggggccctctggcctgcatcagaccctcctgagtgggttatccat aagaaggcacaagctcaagctcagtgcccacttggcctgcaggaaattcaagaatcattg atccaacatagagtaggaatccacgctgccctactatag >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_7|91_aa MNVVKALRVAGSVCGFGFTCSFDFLIWAIFTQRVNTNGILAQTEDSAAEIHHYNVSQNSG PQTFWHQGPVLWKTKPRMAAWGLWGGRMISR >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_7|276_bp atgaatgtggtgaaagctttgcgtgttgctgggagcgtgtgtgggtttggtttcacatgc agctttgatttcctcatctgggccatcttcacacagcgtgttaacaccaacgggatattg gctcagactgaagactcagcagctgagattcatcattacaatgtctcccagaacagcggt ccccaaactttttggcatcagggaccggttttgtggaagacaaaaccacggatggcggcc tgggggttgtggggtggaaggatgatttccagatga >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_8|85_aa MFFNTPKSHTSLPEIDPTQEEIPDLPEKEFRRLVIKLIREAPEKGEVQSVGWAFSSYDLP YMDLGVFPGPPDQERQSEYSTTGTA >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_8|258_bp atgttctttaacacccccaaaagtcacactagcttaccagaaattgatccaacccaagaa gaaatccctgatttacctgaaaaggaattcaggagactggttattaagctaatcagggag gcaccagagaaaggtgaagtccaatcagttggttgggctttcagctcttatgacctgccc tatatggatttaggagtgttccctggtcccccagatcaggagagacagtcagaatattca acaactggtacagcatga >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_9|61_aa MSESQNYYADKRVYAVRFRENLKQPVGQHLLPLIRTLAEEPATAELLNPSGSLLCTKPRS P >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_9|186_bp atgagtgaatctcaaaactattatgccgacaaaagagtatatgctgtcagattccgtgaa aatctaaaacagccagttggacaacacctgctgcctctcatcagaactcttgcagaagag cctgccacagcagagctgctgaacccctctggttctctcctctgtacaaagccccgttcc ccatag >gi568815596f:37244732_37472824|GENSCAN_predicted_peptide_10|76_aa XSCDISASECRPSFSPSFSQLTTLSGQPPASYADTLNPESLAHPFEEPNALTTTVALYVG CLTRKEHLAVSLVYFQ >gi568815596f:37244732_37472824|GENSCAN_predicted_CDS_10|231_bp nnaagttgtgacatctcagcctcggaatgtaggccatcatttagcccaagtttcagtcaa ctaaccacactgtcaggacaacctccagctagttatgctgacacattaaatccagaatct ctggcccacccatttgaagaacccaatgccttaacaaccacagtcgcattgtatgttgga tgtttgacacggaaagagcatttggcagtttctttggtctatttccagtga