GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:10:25 Sequence gi568815582f:11865325_12067844 : 202520 bp : 45.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 244 239 6 1.05 1.14 Term - 7847 7795 53 0 2 73 47 55 0.397 -2.51 1.13 Intr - 10605 10437 169 0 1 81 90 10 0.626 0.02 1.12 Intr - 10851 10762 90 0 0 50 79 70 0.807 2.49 1.11 Intr - 12256 12083 174 1 0 61 97 161 0.999 14.44 1.10 Intr - 17852 17691 162 2 0 8 116 118 0.658 6.77 1.09 Intr - 19950 19857 94 2 1 79 75 64 0.996 4.17 1.08 Intr - 21287 21147 141 1 0 58 103 64 0.883 4.37 1.07 Intr - 21607 21453 155 1 2 44 83 78 0.994 1.77 1.06 Intr - 22406 22246 161 0 2 43 99 116 0.268 7.91 1.05 Intr - 31461 31234 228 1 0 29 87 157 0.183 7.54 1.04 Intr - 32557 32516 42 2 0 77 91 37 0.208 1.11 1.03 Intr - 50382 50045 338 2 2 19 70 670 0.311 53.46 1.02 Intr - 51056 50814 243 1 0 77 26 197 0.572 9.01 1.01 Init - 56276 56188 89 2 2 72 90 40 0.785 2.74 1.00 Prom - 57481 57442 40 -5.06 2.00 Prom + 58561 58600 40 -2.66 2.01 Init + 59947 59987 41 0 2 80 81 16 0.949 -0.22 2.02 Intr + 60874 60980 107 1 2 93 101 162 0.984 17.86 2.03 Term + 65909 66069 161 0 2 40 43 131 0.235 1.90 2.04 PlyA + 66460 66465 6 -3.44 3.05 PlyA - 66923 66918 6 -0.45 3.04 Term - 68337 68189 149 1 2 47 44 184 0.965 7.96 3.03 Intr - 68600 68501 100 2 1 69 86 64 0.973 3.98 3.02 Intr - 72344 72216 129 2 0 82 92 32 0.931 3.89 3.01 Init - 75919 75845 75 1 0 70 56 113 0.985 5.39 3.00 Prom - 76358 76319 40 -3.56 4.00 Prom + 85974 86013 40 -4.66 4.01 Init + 102263 102519 257 1 2 75 59 214 0.816 13.90 4.02 Intr + 111272 111489 218 2 2 11 101 102 0.372 1.95 4.03 Intr + 112258 112361 104 2 2 83 90 49 0.356 4.49 4.04 Intr + 112799 112841 43 1 1 53 81 22 0.196 -4.19 4.05 Intr + 120231 120304 74 1 2 96 41 93 0.636 4.53 4.06 Intr + 122400 122418 19 2 1 127 34 14 0.002 -3.92 4.07 Intr + 124768 124851 84 2 0 78 89 25 0.005 1.39 4.08 Intr + 133973 134034 62 0 2 40 115 81 0.022 4.45 4.09 Intr + 137667 137719 53 2 2 77 99 79 0.016 5.61 4.10 Intr + 161996 162120 125 2 2 85 84 103 0.729 9.83 4.11 Intr + 177573 177753 181 1 1 77 94 441 0.975 42.53 4.12 Intr + 181060 181130 71 1 2 71 115 34 0.554 3.23 4.13 Intr + 183048 183296 249 1 0 80 105 245 0.463 22.81 4.14 Intr + 186523 186898 376 2 1 89 100 522 0.756 47.57 4.15 Intr + 196204 196322 119 1 2 83 94 130 0.760 13.21 4.16 Term + 196606 196616 11 2 2 70 43 5 0.276 -7.44 4.17 PlyA + 197055 197060 6 1.05 5.02 PlyA - 198048 198043 6 -0.45 5.01 Term - 199163 198970 194 0 2 68 47 221 0.956 13.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 201313 201088 226 1 1 84 61 100 0.919 4.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:11865325_12067844|GENSCAN_predicted_peptide_1|712_aa MSHSAQLAIISLTHLQAFLIRILNREGFSSLCPAGQSGPLLWAKILSVFGPAQSCTNQGL PLPPHQVTGGAAERPPDLRLLALPAGSRSRFRLRPVPELRLPSAPGKTAAGGGGGGGGGG GGSSSGSSSSDSAPDCWDQADMEAPGPGPCGGGGSLAAAAEAQRENLSAAFSRQLNVNAK PFVPNVHAAEFVPSFLRGPAAPPPPVGGAANNHGAGSGAGGRAGSNSAVSMELSEPIVEN GETEMSPEESWEHKEEISEAEPGGGSLGDGRPPEESAHEMMEEEEEIPKPKSVVAPPGAP KKEHVNVVFIGHVDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVLINKMDDPTVNWSNERY EECKEKLVPFLKKVGFNPKKDIHFMPCSGLTGANLKEQSDFCPWYIGLPFIPYLDNLPNF NRSVDGPIRLPIVDKYKCRISLWAIELLYSTKKSETGFYFTFDQDMGTVVLGKLESGSIC KGQQLVMMPNKHNVEVLGILSDDVETDTVAPGENLKIRLKGIEEEEILPGFILCDPNNLC HSGRTFDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITALICLVDKKSGEKSKTRPRFV KQDQVCIARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD >gi568815582f:11865325_12067844|GENSCAN_predicted_CDS_1|2139_bp atgagccacagcgcccagctggctattatatctttaacacatctccaagctttcctcatc cgcattttgaacagggagggcttcagcagcctgtgtccagcaggccagagtggacccctc ctatgggctaaaatactctcggtcttcggcccagcacagagctgcaccaaccagggcctg ccgctgcccccccaccaggtgacaggtggggccgccgagcggccaccagacctccgcctc ctcgcgttgcctgccgggagccggagccgcttccggttgcgccctgtccctgaacttcga ctcccgtcggcccccgggaaaaccgcagcgggtggcggcggcggcggcggcggcggcggc ggcgggagcagcagcggcagcagcagcagcgactcggcgcctgactgctgggaccaggcg gacatggaagcccccgggccgggcccttgcggcggcggcggctccctggcggcggcggcc gaggcccagcgggagaacctcagcgcggccttcagccggcaactcaacgtcaacgccaag cccttcgtgcccaacgtccacgccgccgagttcgtgccgtccttcctgcggggcccggca gcgccgccacccccagttggcggcgccgccaataaccacggagccggcagcggcgcggga ggccgtgcgggttcaaattcagctgttagcatggaactttcagaacctattgtagaaaat ggagagacagaaatgtctccagaagaatcatgggagcacaaagaagaaataagtgaagca gagccagggggtggttccttgggagatggaaggccgccagaggaaagtgcccatgaaatg atggaggaggaagaggaaatcccaaaacctaagtctgtggttgcaccgccaggtgctcct aagaaagagcatgtaaatgtagtattcattgggcacgtagacacaaatcaggaagaacga gacaagggtaaaacagtagaagtgggtcgtgcctattttgaaaccgaaaagaagcatttc acaattctagatgcccctggccacaagagttttgtcccaaatatgattggtggtgcctct caagctgatttggctgtgctggtaatctcagccaggaaaggagagtttgaaactggattt gaaaaaggaggacagacaagagaacatgcaatgttggcaaagacagcaggtgtaaaacac ctaattgtgctaattaataagatggatgatccaacagtaaattggagcaatgagagatat gaagaatgtaaggagaaactagtgccatttttgaaaaaagttggcttcaatcccaaaaag gacattcactttatgccctgctcaggacttactggagcaaatctcaaagagcagtcggat ttctgtccttggtacattggattaccgtttattccatatctggataatttgccgaacttc aatagatcagttgatggaccaatcaggctgccaattgtggataagtacaagtgtagaatt tcactttgggctattgagctgttatacagcaccaagaagtctgaaactggattttatttt acatttgatcaggatatgggcactgtggtcctgggaaagctggaatcaggatctatttgt aaaggccagcagcttgtgatgatgccaaacaagcacaacgtggaagttcttggaatactt tccgatgatgtagagactgataccgtagccccaggtgaaaacctcaaaatcagactgaaa ggaattgaagaagaggagattcttccagggtttatactttgtgatcctaataatctttgt cattctggacgcacatttgatgcccagatagtgattatagagcacaaatccatcatctgc ccaggctataatgcggtgctgcatattcatacctgtattgaggaggtggaaataacagcc ttaatctgcttggtagacaaaaaatcaggagaaaaaagtaagacccgaccccgttttgtc aaacaagatcaagtatgcattgctcgcttaaggacagcaggaaccatctgccttgagacc tttaaagacttccctcagatgggtcgtttcaccttaagagatgagggtaagaccattgca attggaaaagttctgaaactggttccagagaaagactaa >gi568815582f:11865325_12067844|GENSCAN_predicted_peptide_2|102_aa MKGGNPILREGVAWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGDNGKPSTLVL TDFNKVQFTEIEGRKRKRPKKGKFAVLPSHHLIHGPDPKDLT >gi568815582f:11865325_12067844|GENSCAN_predicted_CDS_2|309_bp atgaagggagggaatccaattttacgagagggtgtagcctggtgctgccatggccacgac tgttgttacactcgagctgaggaggccggctgcagccccaagacagagcgctactcctgg cagtgcgtcaatcagagcgtcctgtgcggtgacaacggcaaaccatccaccctggtgttg actgactttaacaaggttcagttcacagagattgagggcagaaaaaggaaacggcctaaa aagggtaagtttgctgtgttgccctcacaccacttgattcatggtcctgatcctaaggat ctcacctga >gi568815582f:11865325_12067844|GENSCAN_predicted_peptide_3|150_aa MRVLWMLFWLLFWLLLEFISHQSTCVINTLADHHHRGTDFGGSPWLRIIIAFPRSYKVVL TLWTVYLWLSFLKTIFQSENGHDVSTDVQQRARRSNRRRQEGNKIGLKDVITLRRHVETK GRAKIRKMKVTTKINHHDKINGKRKTAKKQ >gi568815582f:11865325_12067844|GENSCAN_predicted_CDS_3|453_bp atgcgggtgctgtggatgctcttttggctcctcttttggctcctgctggaatttatcagc catcagtccacctgtgttatcaatactctggctgaccatcatcatcgtgggactgacttt ggtggaagtccttggttacgtatcattattgcatttccgagaagttataaagttgtcctt accctctggacagtttacctttggttgtctttcctgaagactatcttccagtctgaaaat ggacatgatgtatccacagatgtacagcagagagccaggaggtccaaccgccgtagacag gaaggaaataaaattggcctgaaagacgtcattactctacggagacatgtggaaacaaaa ggtagagctaaaatccgtaagatgaaggtgacaacgaaaatcaaccatcatgacaaaatc aatggaaagaggaagaccgccaaaaaacagtaa >gi568815582f:11865325_12067844|GENSCAN_predicted_peptide_4|681_aa MANIDLEKSRTGDEIILPRGLEYTVEECTCEDCIKSKPKVDSDHCFPLPAMEEGATILVT TKTNDYCKSLPAALSATEIEKSISASTALHTGSGDLDLRRGLRSGSAGGASSAARLYFRG GAAGAGLLSPGLSGARQPQKRQRRRRGAGTGPGRGTMSAHLGTVTSSFFFPVQTSSSHHR GGGVDGDAGSEREGKICEMDSPNNEGAGSRSSMERAGHVVILQPITVILVGQKMEIKARR PAPTFQVDVRVKRMSQYGGLVAGLAPGSQNNDKRQFLLERLLDAVKQCQIRFGGRKEIAS DSDSRVTCLCAQFEAVLQHGLKRSRGLALTAAAIKQAAGFASKTETEPVFWYYVKEVLNK HELQRFYSLRHIASDVGRGRAWLRCALNEHSLERYLHMLLADRCRLSTFYEDWSFVMDEE RSSMLPTMAAGLNSILFAINIDNKDLNGQSKFAPTVSDLLKESTQNVTSLLKESTQGVSS LFREITASSAVSILIKPEQETDPLPVVSRNVSADAKCKKERKKKKKVTNIISFDDEEDEQ NSGDVFKKTPGAGESSEDNSDRSSVNIMSAFESPFGPNSNGSQSSNSWKIDSLSLNGEFG YQKLDVKSIDDEDVDENEDDVYGNSSGRKHRGHSESPEKPLEGNTCLSQMHSWAPLKVLH NDSDILFPVSGVGSYSPAVAR >gi568815582f:11865325_12067844|GENSCAN_predicted_CDS_4|2046_bp atggctaacattgacctggaaaagagcaggactggtgatgaaattattcttccgagaggc ctcgagtacacggtggaagaatgcacctgtgaagactgcatcaagagcaaaccgaaggtc gactctgaccattgctttccactcccagctatggaggaaggcgcaaccattcttgtcacc acgaaaacgaatgactattgcaagagcctgccagctgctttgagtgctacggagatagag aaatcaatttctgctagcaccgctctccacaccgggtccggcgacttggatctgcgccgc ggtttacgttccgggtcggcgggcggggcctcgtcggcggccaggctctacttccggggc ggggcggccggggcggggctcctgtctcccggcctgtctggagctcggcagccgcagaag cggcagcggcggcggcgcggcgcaggcaccggcccggggagaggcaccatgagcgcccat ctggggaccgtcacttcgtccttcttcttcccagtccaaacttcttcctctcaccatcga gggggtggtgttgatggggatgctggaagtgagagggagggcaagatttgtgaaatggac agtcctaataatgagggggctggcagccgctcctccatggagcgggctggccatgtggtc atcctgcaacccatcactgtgattttggttggccagaaaatggaaattaaagcaaggcgg ccagcccccaccttccaggtagatgtgagggttaaacgtatgtctcagtatggcggattg gtggcgggcctggccccaggatcacagaacaatgacaaaagacaatttctgctggagcga ctgctggatgcagtgaaacagtgccagatccgctttggagggagaaaggagattgcctcg gattccgacagcagggtcacctgtctgtgtgcccagtttgaagccgtcctgcagcatggc ttgaagaggagtcgaggattggcactcacagcggcagcgatcaagcaggcagcgggcttt gccagcaaaaccgaaacagagcccgtgttctggtactacgtgaaggaggtcctcaacaag cacgagctgcagcgcttctactccctgcgccacatcgcctcagacgtgggccggggtcgc gcctggctgcgctgtgccctcaacgaacactccctggagcgctacctgcacatgctcctg gccgaccgctgcaggctgagcactttttatgaagactggtcttttgtgatggatgaagaa aggtccagtatgcttcctaccatggcagcaggtctgaactccatactctttgcgattaac atcgacaacaaggatttgaacgggcagagtaagtttgctcccaccgtttcagacctctta aaggagtcaacgcagaacgtgacctccttgctgaaggagtccacgcaaggagtgagcagc ctgttcagggagatcacagcctcctctgccgtctccatcctcatcaaacctgaacaggag accgaccccttgcctgtcgtgtccaggaatgtcagtgctgatgccaaatgcaaaaaggag cggaagaagaaaaagaaagtgaccaacataatctcatttgatgatgaggaagatgagcag aactctggggacgtgtttaaaaagacacctggggcaggggagagctcagaggacaactcc gaccgctcctctgtcaatatcatgtccgcctttgaaagccccttcgggcctaactccaat ggaagtcagagcagcaactcatggaaaattgattccctgtctttgaacggggagtttggg taccagaagcttgatgtgaaaagcatcgatgatgaagatgtggatgaaaacgaagatgac gtgtatggaaactcatcaggaaggaagcacaggggccactcggagtcgcccgagaagcca ctggaagggaacacctgcctctcccagatgcacagctgggctccgctgaaggtgctgcac aatgactccgacatcctcttccctgtcagtggcgtgggctcctacagcccagcagtggcc cgttga >gi568815582f:11865325_12067844|GENSCAN_predicted_peptide_5|64_aa XSAFDISWSSGPISSEKMELTLRADSRVWDHRKSARTLEADGELRSLDTLATESPEEQEE LWVL >gi568815582f:11865325_12067844|GENSCAN_predicted_CDS_5|195_bp nagagtgcattcgacatcagctggagctcgggtcccatcagcagtgagaagatggagctg actctgcgagctgacagccgggtgtgggaccacaggaaatctgcaaggactctagaggct gatggggagctacgttccctggacacacttgctacagaaagcccagaagagcaggaagag ctctgggttctctaa