GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:45:30 Sequence gi568815576f:45317610_45532359 : 214750 bp : 46.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4762 4817 56 0 2 89 91 28 0.528 0.98 1.02 Intr + 5563 5818 256 1 1 74 95 397 0.639 36.45 1.03 Intr + 8779 8802 24 0 0 112 69 26 0.379 1.32 1.04 Intr + 10233 10454 222 2 0 80 86 427 0.974 40.02 1.05 Intr + 12994 13122 129 0 0 58 64 176 0.965 13.09 1.06 Intr + 14816 15101 286 1 1 96 80 234 0.999 20.31 1.07 Intr + 17344 17573 230 2 2 75 65 95 0.404 3.49 1.08 Term + 17741 17854 114 1 0 148 41 -22 0.445 -2.33 1.09 PlyA + 18683 18688 6 1.05 2.20 PlyA - 19338 19333 6 1.05 2.19 Term - 27048 26947 102 0 0 83 50 60 0.911 -0.02 2.18 Intr - 27960 27850 111 0 0 100 88 111 0.995 12.88 2.17 Intr - 32159 32119 41 0 2 90 106 11 0.618 1.04 2.16 Intr - 37506 37350 157 0 1 74 53 93 0.777 3.98 2.15 Intr - 41186 41088 99 2 0 62 90 132 0.999 11.11 2.14 Intr - 42349 42196 154 0 1 73 51 84 0.983 3.27 2.13 Intr - 44375 44230 146 2 2 35 59 203 0.473 11.18 2.12 Intr - 50032 49871 162 1 0 29 92 88 0.693 3.47 2.11 Intr - 52451 52323 129 2 0 30 87 127 0.851 7.69 2.10 Intr - 54683 54546 138 2 0 117 90 70 0.998 10.76 2.09 Intr - 69437 69258 180 2 0 47 69 60 0.391 0.06 2.08 Intr - 72249 72103 147 0 0 38 58 159 0.767 8.33 2.07 Intr - 76232 76025 208 1 1 84 40 116 0.797 5.48 2.06 Intr - 77158 77076 83 1 2 100 53 12 0.648 -2.66 2.05 Intr - 81744 81486 259 2 1 91 98 169 0.992 15.67 2.04 Intr - 84962 84724 239 2 2 87 87 75 0.942 3.71 2.03 Intr - 89054 88851 204 2 0 31 94 59 0.497 0.20 2.02 Intr - 89256 89144 113 1 2 72 111 -18 0.736 -1.00 2.01 Init - 95958 95850 109 0 1 117 53 195 0.957 19.34 2.00 Prom - 96715 96676 40 -4.06 3.00 Prom + 98003 98042 40 -8.56 3.01 Init + 100001 100337 337 1 1 68 95 293 0.050 25.44 3.02 Intr + 100913 101012 100 0 1 64 49 60 0.454 -1.13 3.03 Intr + 101176 101313 138 1 0 99 71 20 0.438 0.98 3.04 Intr + 104371 104416 46 1 1 130 65 -12 0.600 -0.89 3.05 Intr + 104537 104633 97 1 1 65 42 111 0.934 3.78 3.06 Intr + 104681 104799 119 0 2 107 80 135 0.999 14.78 3.07 Intr + 108339 108566 228 2 0 48 22 375 0.996 24.97 3.08 Intr + 113291 113457 167 1 2 72 73 241 0.976 19.76 3.09 Intr + 113618 113907 290 2 2 34 66 136 0.706 2.89 3.10 Intr + 116816 116915 100 0 1 53 46 86 0.344 0.07 3.11 Term + 119364 119523 160 0 1 62 43 120 0.485 2.31 3.12 PlyA + 119712 119717 6 1.05 4.07 PlyA - 121187 121182 6 1.05 4.06 Term - 127817 127747 71 0 2 102 52 31 0.197 -1.00 4.05 Intr - 130850 130762 89 1 2 68 115 31 0.248 3.41 4.04 Intr - 132710 132612 99 1 0 82 80 44 0.102 2.23 4.03 Intr - 137068 136951 118 2 1 63 92 29 0.102 0.32 4.02 Intr - 140674 140555 120 2 0 44 91 65 0.134 2.87 4.01 Init - 146815 146680 136 1 1 53 57 98 0.523 3.50 4.00 Prom - 146990 146951 40 -3.76 5.00 Prom + 151849 151888 40 -1.16 5.01 Init + 185377 185455 79 0 1 75 75 279 0.998 24.52 5.02 Intr + 185710 185881 172 2 1 67 91 -3 0.518 -3.10 5.03 Term + 186097 186202 106 1 1 79 43 134 0.944 5.88 5.04 PlyA + 187197 187202 6 -0.45 6.00 Prom + 187370 187409 40 -8.46 6.01 Init + 187820 187932 113 1 2 76 75 133 0.726 8.48 6.02 Intr + 194457 194495 39 0 0 111 55 41 0.130 0.44 6.03 Intr + 197142 197281 140 0 2 60 72 62 0.342 1.91 6.04 Intr + 201073 201178 106 2 1 99 98 139 0.986 15.27 6.05 Intr + 207934 208069 136 1 1 72 100 225 0.983 22.67 6.06 Intr + 210238 210400 163 0 1 73 60 128 0.960 8.05 6.07 Intr + 211466 211510 45 0 0 136 68 -6 0.562 0.58 6.08 Intr + 213656 213715 60 0 0 131 78 103 0.982 12.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 96293 96406 114 1 0 96 19 158 0.886 9.91 S.002 Intr + 96713 96794 82 1 1 117 85 50 0.953 6.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_1|438_aa NSQMDSVEKTTNRSEQKSSRKFLKSLIRKQPQELLLVIGTGVSAAVAPGIPALCSWRSCI EAVIEAAEQLEVLHPGDVAEFRRKVTKDRDLLVVAHDLIRKMSPSICGTTGQRTGDAKPS FFQDCLMEVFDDLEQHIRSPVVLQSILSLMDRGAMVLTTNYDNLLEAFGRRQNKPMESLD LKDKTKVLEWARGHMKYGVLHIHGLYTDPCGVVLDPSGYKDVTQDAEVMEVLQNLYRTKS FLFVGCGETLRDQIFQALFLYSVPNKVDLEHYMLVLKENEDHFFKHQADMLLHGIKVVSY GDCFDHFPGYVQDLATQICKQQSPGWGRQVDSTGCQKTARSLGVLICDCVMKDICTRIHG VPLLMGEEAHDSDSHASDRGHHTMLPLPAGSFSESSHQAWEMLIAWTAPHYWVKQIFLLA SLFFAYTFHCLLSLTVKS >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_1|1317_bp aattcacagatggattcagtggaaaagacaacaaatagaagtgaacaaaaatccagtaga aagtttttaaaaagcctcatccggaaacagccccaggaactgctcctggttatcgggact ggcgtcagcgcagcagtggcccccggaatccctgccctttgctcgtggagaagctgcatc gaggccgtcatcgaggctgcagagcagctggaggtgctgcaccccggagacgtcgccgag ttccggaggaaagtgacaaaggaccgggacctgttggttgtcgcccatgatctgatccgg aagatgtcacctagtatatgcggcactacgggccagcgcacaggcgatgccaagcccagc ttcttccaggactgcctgatggaggtgtttgacgacctggagcagcacatccggagtcct gtggtgctgcagtcgatcctcagcctgatggacaggggcgccatggtcctgaccaccaac tatgacaacctgctggaggcctttggccggcggcagaacaagcccatggagtccctggac ttgaaggacaagaccaaggtccttgaatgggcaagagggcacatgaagtacggcgtcctc cacattcacggcctctacacggacccctgcggggtggtgctggacccatcggggtataaa gacgtcactcaagacgcagaagtcatggaagtcctccagaacttataccgcaccaagtcc tttctgtttgtgggctgtggggagacccttcgtgatcagatattccaggccctctttctt tactccgtgccgaataaggtggatttggagcactacatgcttgtgctgaaggagaatgaa gaccatttctttaagcatcaggcagatatgcttctgcacggaatcaaagttgtatcctac ggggactgttttgaccactttccaggatatgtgcaagaccttgccactcagatctgcaaa cagcaaagcccaggttggggcaggcaagtggattcgacaggctgccagaagactgcaagg tcattgggagttttaatctgtgactgtgtcatgaaggacatttgtactcgaattcatgga gtgccactcctgatgggagaggaggcccatgacagtgacagtcatgctagtgatcgcgga caccacaccatgctgcctttgccagctggctccttcagcgagtcctcgcaccaagcctgg gagatgctgatcgcgtggacagcaccacattattgggtaaagcagatctttcttcttgcc agcctgttttttgcctacacattccattgtcttctgtcactaactgtaaaatcataa >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_2|926_aa MAHLELLLVENFKSWRGRQVIGPFRRFTCIIGPNGSGGCSEFRFNDNLVSRSVYIAELEK IGIIVKAQNCLVFQGTVESISVKKPKERTQFFEEISTSGELIGEYEEKKRKLQKAEEDAQ FNFNKKKNIAAERRQAKLEKEEAERYQSLLEELKMNKIQLQLFQLYHNEKKIHLLNTKLE HVNRDLSVKRESLSHHENIVKARKKEHGMLTRQLQQTEKELKSVETLLNQKRPQYIKAKE NTSHHLKKLDVAKKSIKDSEKQCSKQEDDIKALETELADLDAAWRSFEKQIEEEILHKKR DIELEASQGNLKQIKEQIEDHKKRIEKLEEYTKTCMDCLKEKKQQEETLVDEIEKTKSRM SEVNEELNLIRSELQNAGIDTHEGKRQQKRAEVLEHLKRLYPDSVKYQLAVTKVFGRFIT AIVVASEKVAKDCIRFLKEERAEPETFLALDYLDIKPINERLRELKGCKMVIDVIKTQFP QLKKVIQFVCGNGLVCETMEEARHIALSGPERQKGLMKTLRKETDLKQIQTLIQGTQTRL KYSQNELEMIKKKHLVAFYQVEDDIFQHFCEEIGVENIREFENKHVKRQQEIDQKRYFYK KMLEVSLKGEKFLRTDRQSSEAGLASPVQETLLCNLVKGDTKLEWLFSSITQQHHTEAEE NCLQTVNELMAKQQQLKDIRVTQNSSAEKVQTQIEEERKKFLAVDREVGKLQKEVVSIQT SLEQKRLEKHNLLLDCKVQDIEIILLSGSLDDIIEVEMGTEAESTQATIDIYEKEEAFEI DYSSLKEDLKALQSDQEIEAHLRLLLQQVASQEDILLKTAAPNLRALENLKTVRDKFQES TDDEVDAALDNTNIGKVSSYIKEQTQDQFQMIVISLKEEFYSRADALIGIYPEYDDCMFS RVLTLDLSQYPDTEGQESSKRHGESR >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_2|2781_bp atggcccacctggagctgctgcttgtggaaaatttcaagtcgtggcggggccgccaggtc attggccccttccggaggttcacctgcatcatcggccccaacggctctgggggatgctca gaatttcgctttaatgataatcttgtgagtcgttctgtttacattgcagagttggaaaag ataggcataatagtcaaagcacaaaattgtttggtttttcagggaactgtagagtcaatt tcagtgaagaaacccaaagaaaggacccagttttttgaggaaatcagcacttcaggagag cttataggagaatatgaagaaaagaaaagaaagttacaaaaagccgaagaggatgcacag tttaactttaataagaaaaaaaatatagcggcagagcgcagacaagcaaaattagagaag gaagaggcagaacgttaccagagtctccttgaagaactgaaaatgaacaagatacaactg cagctttttcaactataccataatgagaaaaagattcatctcctgaacaccaagttagag catgtgaatagggatttgagtgtcaaaagagagtctttgtctcatcatgaaaacatagtt aaagccaggaaaaaggaacatggaatgctaactagacaactacaacaaacagaaaaagaa ttaaaatcggttgaaacccttttaaatcagaagaggcctcagtacattaaagccaaagaa aacacttctcaccaccttaagaaattagatgtggctaagaaatcaataaaggacagcgaa aaacaatgttctaaacaggaagatgatataaaagccctggagacagagctggctgattta gatgctgcatggagaagttttgaaaagcagattgaggaagaaattttacataaaaagcga gacattgaactggaagccagtcagggaaatctaaaacaaataaaagaacaaatagaagat cataaaaaacgaatagagaagttagaggagtatacaaagacatgcatggattgcttgaaa gagaaaaaacagcaagaggaaaccctagtggatgaaattgaaaaaacaaaatcaagaatg tctgaagttaatgaagaattgaatcttattagaagtgaattgcagaatgctgggattgat acccatgagggaaaacgtcagcaaaagagagcagaggttctggaacaccttaaaagactg tacccagattctgtgaaataccagctggctgttactaaggtttttggccggttcatcact gccattgttgtagcctctgaaaaggtagcaaaagattgtattcgatttctgaaggaggaa agagctgaacctgagacattcctcgctctagattaccttgatatcaagccaatcaatgaa agactaagggagcttaaaggctgtaaaatggtgattgatgtcataaagactcagtttcct cagctgaagaaagtgattcagtttgtgtgtggaaatggtcttgtttgtgagactatggaa gaagcaaggcatattgcactcagtggacctgaaagacagaaaggtttaatgaagacactc cgcaaagaaacagatttgaaacaaatacagaccctgatacagggaactcaaacacgactc aaatattcacaaaatgaactagagatgattaagaagaagcaccttgttgctttttaccag gtagaagacgatatcttccaacacttctgtgaagaaattggcgtggaaaatattcgtgaa tttgagaacaaacatgttaaacggcaacaagaaattgatcaaaaaaggtatttttataaa aagatgttggaagtatcactgaagggagaaaagttcctgaggaccgacagacaaagcagt gaagcaggactagcatccccagtccaggaaactctgctgtgtaacttggtcaaaggagac accaaattagagtggctgttcagcagcatcacacagcaacaccacacagaggctgaagaa aactgtctgcagacagtgaatgaactcatggcaaagcagcagcaacttaaggacatacgt gtcactcagaactccagtgccgagaaagttcaaactcaaattgaagaggaacggaagaag tttctggctgttgatagggaagtggggaaattgcaaaaagaagttgtaagtattcaaact tctctggaacagaaacgattagagaagcataacttgctgcttgattgcaaagtgcaagac attgagataatccttttgtcggggtcactggatgacatcattgaagtggagatgggaact gaagcagaaagtacccaggcaacaattgatatctatgaaaaagaagaagcctttgaaata gactacagctctctaaaagaggatttgaaggctctacagtctgatcaagaaatcgaggcc caccttaggctcttattgcagcaagtagcatcccaggaagatatcttactgaaaacagca gccccaaacctacgagcactggagaacttaaagactgtcagagacaagtttcaagagtcc acagatgatgaagtggatgcagccctagacaatactaacataggcaaagtgtcaagttac atcaaagagcaaactcaagaccagtttcagatgatagtcatctccctaaaagaagagttc tattccagagccgacgcgctgatcggcatctatcctgagtacgatgactgcatgttcagc cgagttttgaccctagatctttctcagtatccagacactgaaggccaagaaagcagcaag agacacggagagtcccgctag >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_3|593_aa MRQNDKIMCILENRKKRDRKNLCRAINDFQQSFQKPETRREFDLSDPLALKKDLPARQSD NDVRNTISGMQKFMGEDLNFHERKKFQEEQNREWSLQQQREWKNARAEQKCAAPLHACTR AALTRPATSVPERPGNEDDNIPALLGLLSMSDVSPDHLLPFFSHSYQIYALPAPLELPWP TSPVTSMSPNPRSQTSFLGAVEYLLPTPSEAGPVIIVPVLMLMAKDVVGGKGTNGHTWEE ALYTETRLQFDETAKHLQKLESTTRKAVCASVKDFNKSQAIESVERKKQEKKQEQEDNLA EITNLLRGDLLSENPQQAASSFGPHRVVPDRWKGMTQEQLEQIRLVQKQQIQEKLRLQEE KRQRDLDWDRRRIQGARATLLFERQQWRRQRDLRRALDSSNLSLAKEQHLQPMGHRSLLS TRGAWVEGTGKPQGRALSLLGESGRLPGGGFIGETQEEDGTAYSGTAFQVERGAPGLPQK SLESLGYRQDSKSSELQLQEDVKCKQTGPHGLPGKAFEQMTTLKTLFKFTVDQVMLPPPG WARDLQPAMPEPPTPSMSSCAAQASPTSAAPCSTAPSPIDRPRAEECERTARD >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_3|1782_bp atgaggcaaaatgacaaaatcatgtgcatattggaaaaccggaaaaagagggataggaaa aatctctgtagggctatcaatgacttccaacagagctttcagaagccagaaactcgccgt gaatttgatctgtccgaccccctagcccttaagaaagatcttccagcccggcagtcagat aatgatgttcggaatacgatatcaggaatgcagaaattcatgggagaggatttaaacttc catgagaggaagaaattccaagaggaacaaaacagagaatggtctttgcagcagcaaagg gaatggaagaacgcccgtgctgaacaaaaatgcgcagctcctcttcatgcctgcactcga gcagcactgacccgcccagccacttctgtgccggaaagaccgggtaacgaagacgacaac atccctgctctcttgggtctactctccatgagtgatgtctccccagatcatcttctccca ttcttctcccactcctaccagatttatgccctccctgctccactggaactgccctggcca acctcaccagtgacctccatgtcgcccaatccaaggtctcaaacgtcatttcttggggct gtagagtatctcttgcccactccttcagaggcaggtcctgtcattattgttcccgtcctc atgctcatggctaaggacgtggtaggaggcaaaggcacgaacggccacacgtgggaagag gccctctacacagagacaaggctgcagtttgacgagacagccaagcacctccagaagctg gaaagcaccaccagaaaggcagtttgtgcatctgtgaaagacttcaacaagagccaggcc atcgagtcagtggaaaggaaaaagcaagagaaaaagcaagaacaagaggacaacttggcc gagatcaccaacctcctgcgtggggacctgctctccgagaacccgcagcaggcagccagc tccttcgggccccaccgcgtggtccctgaccgctggaagggcatgacccaggagcagctg gagcagatccgcctagtccagaagcagcaaatccaggagaagctgaggctccaggaagaa aagcgccagcgagacctggactgggaccggcggaggattcagggggctcgcgccaccctg ctgtttgagcggcagcagtggcggcggcagcgcgacctgcgcagagctctggacagcagc aacctcagcctggccaaggagcagcatttgcaacccatgggccacaggtcactgctgtcc acacggggtgcatgggtggagggcaccggaaagccccagggaagagcactgagtctgctg ggggagtcaggaaggcttcctggaggaggcttcattggtgagacccaggaggaggatggc actgcctattcagggacggctttccaggtagagagaggggctcctgggctgccccagaaa agcctggagtctcttggttatagacaggactccaagagttcagagctccagctacaagaa gatgtcaaatgcaagcaaacaggcccacacgggctcccaggcaaagcatttgaacaaatg accacactgaagacgttgttcaaattcacagtcgatcaagtaatgctgccacccccaggc tgggctcgggacctgcagcccgccatgcctgagcctcccaccccctccatgagctcctgt gcggcccaagcctccccgacgagcgccgccccgtgctccacggcgcccagtcccatcgac cgcccaagggctgaggagtgcgagcgcacggcgcgggactag >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_4|210_aa MWESLELSRDLLKSFAPNADSDMDNKVQAEVVSDRDEELVGNWSKETPTQPAAQVRKRLE IHAASSSLTTTIRVPQDLSSQCTQPACILKRKHSSKKGPQRLNLVKELCKQAPGPEPHKP LIGNRAPGLHPSSAASASTIAQDETGSFRAHRGTEAERIPGVAVDGALPMDWAFAHTLHQ ILTAAEPGLTEILAFGFQCPPPDLQQTPQS >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_4|633_bp atgtgggaaagtttggaactttctagggacttgttgaaaagctttgccccaaatgctgac agcgatatggacaataaagtccaggctgaggtggtctcagatagagatgaggaacttgtt gggaactggagcaaagagactcccacccagccagctgcccaagtcagaaagcggcttgag attcatgctgcctcctcctctctcaccaccaccatccgtgttcctcaggacctgtcatcc cagtgcacacagccagcctgcattttaaaaaggaaacacagctccaaaaagggccctcag aggctgaatttggtcaaagagctttgcaaacaagctccaggtcctgagccccacaaacct ctaattgggaacagagccccaggacttcacccctcctctgcagcctccgcctccaccatt gctcaggatgaaacaggctccttcagagcccaccggggcacggaggctgagaggatccca ggagtggctgttgatggagcccttcccatggactgggcctttgcacacacattgcaccaa attctcacagcagccgagccagggctcacagagatactagcctttggattccagtgccct cccccagacctccaacagaccccacagagctga >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_5|118_aa MERAAPSRRVPLPLLLLGGLALLAAGVLANPAGSGGRIPGPGGSGALAGLGDAGWGLPSP GRLALDAGSALGYLSSPLPASTPRGPKAVVLPGSVLSGDWGLVTSTNGKGDGVFQEQI >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_5|357_bp atggagcgcgccgcgccgtcgcgccgggtcccgcttccgctgctgctgctcggcggcctt gcgctgctggcggccggagtcctagcaaatccagccggctccgggggccgcatcccgggc cctgggggttcaggggcgctagctgggctgggggacgctggctgggggttaccgagccca gggcgccttgcacttgacgctggaagcgccctcggctatctttcctcaccgcttcccgcc tccacccccagagggcccaaggcagtggttctgccgggctcggtgctcagcggggactgg ggactggtcacttccaccaacggcaagggggatggagttttccaagagcagatctga >gi568815576f:45317610_45532359|GENSCAN_predicted_peptide_6|268_aa MGVRAGASPAAPTALAHAPLLALLSAPQLRAAFATPVSPVSSYQRVPVLLGCDMDSERRK TPSLAPKEVLIQEEQERTVLEAARTEPDARTQGPGAVVDADVLLEACCADGHRMATHQKD CSLPYATESKECRMVQEQCCHSQLEELHCATGISLANEQDRCATPHGDNASLEATFVKRC CHCCLLGRAAQAQGQSCEYSLMVGYQCGQVFQACCVKSQETGDLDVGGLQETAARGSMGG WPCSHFLDKIIEVEEEQEDPYLNDRCRX >gi568815576f:45317610_45532359|GENSCAN_predicted_CDS_6|804_bp atgggggtcagggctggggcctctcctgcagcacccacagctctggctcacgccccgctc ctggccctgctgagtgccccccaactcagggctgcatttgccacccctgttagcccagtg tcgagttaccagcgagtgcctgttctgcttgggtgcgacatggacagtgagaggaggaag acaccatccctggccccgaaggaggttttaattcaggaggagcaagaaagaacagtgttg gaggctgccaggactgagcctgatgccagaactcagggcccaggagccgtggtggacgcg gatgtcctcctggaggcctgctgtgcggacggacaccggatggccactcatcagaaggac tgctcgctgccatatgctacggaatccaaagaatgcaggatggtgcaggagcagtgctgc cacagccagctggaggagctgcactgtgccacgggcatcagcctggccaacgagcaggac cgctgtgccacgccccacggtgacaacgccagcctggaggccacatttgtgaagaggtgc tgccattgctgtctgctggggagggcggcccaggcccagggccagagctgcgagtacagc ctcatggttggctaccagtgtggacaggtcttccaggcatgctgtgtcaagagccaggag accggagatttggatgtcgggggcctccaagaaacggctgcccggggcagcatgggtggc tggccttgctcccacttcttggataagatcattgaggttgaggaggaacaagaggaccca tatctgaatgaccgctgccgagnn