GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:44:28 Sequence gi568815592f:96503614_96715769 : 212156 bp : 36.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6218 6827 610 1 1 88 14 252 0.193 13.33 1.02 Intr + 7796 7922 127 0 1 111 87 -3 0.114 0.72 1.03 Intr + 18230 18337 108 2 0 80 34 127 0.345 4.98 1.04 Intr + 19533 19678 146 0 2 109 66 204 0.986 19.31 1.05 Intr + 20769 20797 29 1 2 86 96 18 0.993 -0.68 1.06 Intr + 21684 21781 98 2 2 108 80 74 0.991 6.49 1.07 Intr + 22708 22822 115 1 1 75 68 131 0.996 9.33 1.08 Intr + 24889 25019 131 0 2 99 131 178 0.970 21.77 1.09 Intr + 27254 27327 74 2 2 85 -9 72 0.416 -4.67 1.10 Intr + 32631 32777 147 1 0 37 61 141 0.742 5.59 1.11 Intr + 33761 33936 176 0 2 54 39 180 0.958 8.44 1.12 Intr + 35018 35197 180 1 0 68 61 106 0.912 5.04 1.13 Intr + 36922 37042 121 0 1 25 90 103 0.955 3.35 1.14 Intr + 39281 39403 123 0 0 63 82 116 0.487 8.14 1.15 Intr + 39989 40135 147 0 0 -1 48 149 0.182 1.29 1.16 Intr + 45837 45965 129 1 0 10 74 97 0.029 0.25 1.17 Intr + 46056 46186 131 1 2 81 47 95 0.965 4.19 1.18 Intr + 47820 47900 81 2 0 82 115 4 0.724 1.42 1.19 Term + 49672 49890 219 0 0 134 39 223 0.998 18.06 1.20 PlyA + 50358 50363 6 1.05 2.04 PlyA - 50377 50372 6 1.05 2.03 Term - 53465 53073 393 2 0 13 49 250 0.632 7.65 2.02 Intr - 53677 53490 188 2 2 -20 36 116 0.415 -5.91 2.01 Init - 53970 53766 205 0 1 72 57 189 0.320 13.26 2.00 Prom - 63268 63229 40 -5.15 3.00 Prom + 76952 76991 40 -4.35 3.01 Init + 100001 100159 159 1 0 74 115 63 0.947 7.47 3.02 Intr + 101137 101311 175 0 1 78 58 76 0.773 2.19 3.03 Intr + 102289 102458 170 2 2 27 86 71 0.601 -0.46 3.04 Intr + 106959 107145 187 2 1 94 89 66 0.944 5.64 3.05 Term + 111996 112159 164 1 2 100 49 119 0.992 6.32 3.06 PlyA + 112935 112940 6 1.05 4.00 Prom + 130053 130092 40 -2.45 4.01 Sngl + 135530 136144 615 1 0 59 44 279 0.785 16.54 4.02 PlyA + 136316 136321 6 1.05 5.04 PlyA - 139414 139409 6 1.05 5.03 Term - 145169 144876 294 2 0 26 36 281 0.565 11.02 5.02 Intr - 145483 145223 261 1 0 42 -27 363 0.287 16.96 5.01 Init - 146544 146488 57 0 0 60 33 68 0.589 -0.14 5.00 Prom - 147391 147352 40 -4.25 6.05 PlyA - 147566 147561 6 1.05 6.04 Term - 153901 153757 145 0 1 45 50 145 0.123 2.80 6.03 Intr - 161133 161044 90 2 0 71 81 63 0.090 2.09 6.02 Intr - 163759 163693 67 2 1 50 78 68 0.497 -0.96 6.01 Init - 169477 169342 136 1 1 52 111 165 0.751 15.55 6.00 Prom - 169819 169780 40 -5.85 7.00 Prom + 169943 169982 40 -6.55 7.01 Init + 171041 171121 81 1 0 84 90 47 0.361 5.52 7.02 Term + 182230 182928 699 0 0 39 36 337 0.240 16.15 7.03 PlyA + 183379 183384 6 1.05 8.03 PlyA - 183539 183534 6 -0.45 8.02 Term - 185779 185668 112 0 1 55 40 129 0.029 1.85 8.01 Init - 196921 196764 158 1 2 53 52 152 0.309 7.53 8.00 Prom - 198834 198795 40 -7.55 9.03 PlyA - 199044 199039 6 1.05 9.02 Term - 201137 200986 152 0 2 58 33 124 0.017 0.99 9.01 Intr - 206597 206521 77 1 2 77 63 123 0.081 6.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_1|963_aa MAKRAPDMDQAAASEGASCKPWWLPCGAKTVGAQRVRVKALEPLPRFHRIYENTSISRQK SAAWMEPSWRISTTAVQRRNVALEAPHKSLHWALPSGAVRRGPPSSRPQNGISTNSLYYV PGKATGTQHHPVKAAEGTVPSKATEVEMSKALGVHPLYQCGLDVRYGAKGDYFGALRFND CPSGYWTCMGPVVPSFWLISPSWSVCVLSTPFYKDNRHIGLEYPNDLILTYLLFKNAIYK YSHILSSSASTASQAVMADAWEEIRRLAADFQRAQFAEATQRLSERNCIEIVNKLIAQKQ LEVVHTLDGKEYITPAQISKEMRDELHVRGGRVNIVDLQQVINVDLIHIENRIGDIIKSE KHVQLVLGQLIDENYLDRLAEEVNDKLQESGQVTISELCKTYDLPGNFLTQALTQRLGRI ISGHIDLDNRGVIFTEAFVARHKARIRGLFSAITRPSQSRQASTRALNPQAANQYQSVAS VLEELVNSGRLRGTVVGGRQDKAVFVPDIYSRTQSTWVDSFFRQNGYLEFDALSRLGIPD AVSYIKKRYKTTQLLFLKAACVGQGLVDQVEASVEEAISSGTWVDIAPLLPTSLSVEDAA ILLQQVMRAFSKQASTVVFSDTVVVSEKFINDCTELFRELMHQKAEKEMKNNPVHLITEE DLKQISTLESVSTSKKDKKDERRRKATEGSGSMRGGGGGNAREYKIKKVKKKGRKDDDSD DESQSSHTENEESGYDKKDENILEDVKWGKTSQFLEIFQDIEKARDKTLEADSELESVFM SSTTSASGTGRKRTIKDLQEEVSNLYNNIRLFEKGMKFFADDTQAALTKHLLKSVCTDIT NLIFNFLASDLMMAVDDPAAITSEIRKKILSKLSEETKVALTKLHNSLNEKDQHALLVKY QGLVVKQLVSQSKKTGQGDYPLNNELDKEQEDVASTTRKELQELSSSIKDLVLKSRKSSV TEE >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_1|2892_bp atggctaaaagggccccagatatggatcaggctgctgcttcagagggtgcaagctgtaag ccttggtggcttccatgtggtgctaagactgtgggtgcacagagggtaagagttaaggct ttggagcctctgcccagatttcatagaatatatgaaaacacctcaatatccaggcagaag tctgctgcctggatggaaccctcatggagaatctctactacagcagtgcagaggagaaat gtagcgttggaggccccacacaaaagtctccactgggcactgcctagtggagctgtgaga agagggccaccgtcctccagaccccagaatggtatatccaccaacagcttgtactatgtg cctggaaaagccacaggcactcaacaccatcctgtaaaagcagctgaggggactgtaccc agcaaagctacagaggtggagatgtccaaggccttgggagtccaccccttgtatcagtgt ggcctggatgtgagatatggagccaaaggagattatttcggagctttaagatttaatgac tgcccttctgggtattggacttgcatggggcctgtagtgccttcgttttggctgatttct ccctcttggagtgtctgtgtcttaagcacacctttttataaggacaacaggcatattgga ttagagtaccctaatgacctcattttaacctacttactgtttaaaaacgctatctacaaa tacagtcacattttaagttcctccgcgtctactgcgagtcaggccgtgatggcggacgcc tgggaagagattaggcggttggcggccgacttccagcgggcgcagttcgccgaggccacg cagaggttgtccgagcggaactgcattgagattgttaataaattgattgctcagaaacag ctagaagtagttcatacactcgatggaaaggaatatattactccagcccaaattagtaaa gaaatgagagatgagctacatgtccgaggtggtcgagtaaacattgttgatctacaacag gtaattaatgtggacctgattcatattgaaaatagaattggtgacattattaaatcagaa aagcatgttcagttagtgttgggacaactgatagatgagaattatttggatcggttggca gaagaggtcaatgataaattgcaagaaagtggtcaggtcaccatatcagaactgtgtaaa acttatgatcttcctgggaactttctgacacaggcactaactcagcgacttggtagaatt atcagtggacatattgatcttgataatagaggagtaatttttacggaagcttttgtagct cgacataaagcacgtatccgtggactattcagtgctattacccggccctctcaaagtaga caagctagcacaagggccctcaatccccaggctgccaatcagtaccagtctgtggcctct gtgcttgaggaacttgttaatagcggacgcttacgaggcactgtggttggtgggagacag gataaagctgtgtttgtccctgacatctactccaggacacagagtacttgggtggattcc tttttcaggcagaatggctatctagaatttgatgctttgtccagacttggaatcccagat gctgtaagctacataaagaaaagatataagactacacaactcttgtttttgaaagcagct tgtgttggtcaaggacttgtggatcaagtggaagcatcagtagaagaagccatcagctct ggaacatgggttgatattgcacctctgctacccacttctttatcagttgaagatgctgcc atattgcttcagcaggtgatgagggcattcagcaaacaggcctcaactgtagtctttagc gacactgttgtagtcagtgaaaaatttataaatgactgtacagaactgttccgtgagctg atgcaccagaaagctgaaaaggaaatgaaaaataatcctgtgcatttaatcactgaagaa gatctgaaacaaatctccactttagaaagcgttagtacaagtaaaaaggataaaaaagat gagcgaagaaggaaagcaacagagggcagtggaagcatgagaggaggaggtgggggcaat gccagagagtacaaaattaaaaaagtcaagaagaaaggaagaaaagatgatgatagtgat gatgaatctcaatcatcccacactgaaaatgaggaaagtggttatgacaaaaaggatgaa aacatcttagaggatgtaaagtggggaaaaacttcacaattcttggagatattccaagac attgaaaaggcaagggataaaacattggaagctgattcagagttagaaagtgtattcatg tcttcaacaacttctgcttctgggacgggcagaaaacgcacaatcaaggacttgcaagaa gaagtttcaaacctgtacaataacattaggttatttgaaaaagggatgaagttttttgca gatgacacacaggctgctcttaccaaacacttgctgaagtcagtgtgtactgatatcact aacctcattttcaacttcttagcttcggatttaatgatggcagtagacgatcctgcagcc attacaagtgaaataagaaagaaaattttaagtaaattatcagaagaaaccaaagtagct cttacaaaactccataactctctgaatgaaaaggatcagcatgctcttttggtaaagtat caaggtttggttgtaaagcagctagtcagtcaaagtaagaagactgggcagggagattat cccttgaataatgaattagacaaagaacaagaagatgttgccagtactactcgtaaagag cttcaagaactttcttcatccattaaagaccttgttctcaaatctaggaaatcatctgtg acggaagagtaa >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_2|261_aa MDNKVQVEVISDGDEELVGNWSKGDSRYVLAKKLVASLPCPRDLWNFELERDDLGYLVEE ISKQQSIQEKKNPFSEEKFKPAAETCISNREPNVKSPRQWGKMSPWRIRDLHGSLSHHRP RGLGENGFVGQGLGALHPNHSNHGFRGCKPKPWQLPRDVESASARKSRIEVWKPPPIFQK MYGNVWMPLQTFAAGAGSSWRTSARAVQKGNVGWEPPHRVPTRAPPSGALRRGPLSSRPQ NGSSTDSLHHVPGNATLNASP >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_2|786_bp atggacaataaagtccaggttgaggtgatctcagatggagatgaggaacttgttgggaac tggagcaaaggtgactctcgttatgttttagcaaagaaactggtggcatctctcccctgc cctagagatctgtggaactttgaacttgagagagatgatttagggtatctggtggaagaa atttctaagcagcaaagcattcaagaaaagaaaaacccattttctgaggagaaattcaag ccagctgcagaaacttgcataagtaacagggaaccaaatgttaaatcaccaagacaatgg gggaaaatgtctccatggcgtatcagagatcttcacggcagcctctcccatcacaggccc agaggcctaggagaaaatggtttcgtgggccagggacttggtgccctgcatcccaaccac tctaatcatggcttcagagggtgcaagcccaagccttggcagcttccacgtgatgttgag tctgcgagtgcacggaagtcaagaattgaggtttggaaacctccgcctatatttcagaag atgtatggaaacgtctggatgcccctgcagacgtttgctgcaggggcggggtcctcatgg agaacctctgctagagcagtgcagaagggaaatgtggggtgggagcccccacacagagtc cctactagggcaccgcctagtggagctctgagaagagggcctctgtcctccagaccccag aatggtagctccactgacagcttgcaccatgtgcctggaaatgccacactcaatgccagc ccatga >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_3|284_aa MTTAHFYCQYCTASLLGKKYVLKDDSPYCVTCYDRVFSNYCEECKKPIESDSKDLCYKDR HWHEGCFKCTKCNHSLVEKPFAAKDERLLCTECYSNECSSKCFHCKRTIMPGSRKMEFKG NYWHETCFVCENCRQPIGTKPLISKESGNYCVPCFEKEFAHYCNFCKKVITSGGITFCDQ LWHKECFLCSGCRKDLCEEQFMSRDDYPFCVDCYNHLYANKCVACSKPISGLTGAKFICF QDSQWHSECFNCGKCSVSLVGKGFLTQNKEIFCQKCGSGMDTDI >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_3|855_bp atgacaactgctcacttttactgtcaatactgcacagcatcacttcttgggaagaaatat gtactaaaggatgacagtccatactgtgttacatgttatgatcgtgtattttctaactat tgcgaggaatgcaaaaaaccaattgaatctgattctaaggatctttgttacaaagaccgg cactggcatgaaggatgcttcaagtgcaccaaatgcaatcactctttggtggaaaagcct tttgctgccaaggatgagcgcctgctgtgcacggagtgctattctaacgagtgctcctcc aagtgcttccactgcaagaggaccatcatgcctggttcccgcaaaatggaatttaaggga aactactggcatgaaacctgttttgtgtgtgagaattgccgacaacctatagggacaaag cctttgatctccaaagagagtggcaattattgtgtgccatgttttgagaaggagtttgct cactactgcaacttttgtaagaaggtgataacttcaggtgggataacattttgtgaccag ctatggcataaagagtgttttctgtgtagtggctgtaggaaagatctctgtgaagaacag ttcatgtccagagacgactatccattctgcgtggactgctacaaccatctttatgccaac aagtgtgtagcctgttccaaacccattagtggtctcacaggtgccaagtttatctgcttt caagacagccagtggcatagcgaatgctttaactgcgggaaatgctctgtctccttggtg ggtaaaggcttcctgacccagaacaaggaaatcttctgccaaaaatgtggctccggaatg gacactgacatctag >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_4|204_aa MEDEMNEMKREENFREKRIKRREQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIRENFPNLARQANIQIQEIQRMPQRYSSRRATPRHIIVKFTKVEMKEKMLRATREKDQV THKWKPIRLTADLLAETLQARREWGPIFNILKGKNFQPRISYPAKLSFISEGEIKSFTDK QMLRDFVTIKARKKLHQLTSKITS >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_4|615_bp atggaagatgaaatgaatgaaatgaagcgagaagagaattttagagaaaaaagaataaaa agaagggaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctc attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccgggagaacttccccaatctagcaaggcaggccaacattcaaattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaaa ttcaccaaagttgaaatgaaggaaaaaatgttaagggcaaccagagagaaagaccaggtt acccacaaatggaagcccatcagactaacagctgatctcttggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaaggaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatcctttacagacaag caaatgctgagagattttgtcaccatcaaagctaggaagaaactgcatcaactaacgagc aaaataaccagctaa >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_5|203_aa METTHTRTLHSETAEHRHKLALSKEKAIFSSSAKIMKPNDEKPDELKPSISQALLQLEMS SDLKAQLRELNITAAKETEVGSGRKAIIIFVPVPKLKSFQKIQVQLRRILPKPTRKSCMK NKQKHPRSRALTAVHDAILEDLVFPSEIVGKRICMKLDGGLLVKVHLDKAQQNNVEHKVE IFSGVYKKLTGKDVNFEFPEFQL >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_5|612_bp atggaaacgacccacacgaggacacttcatagtgaaactgcagagcacagacataaactg gcgctcagcaaggaaaaggccatattcagttccagcgccaagatcatgaagcccaatgac gagaagccggacgagctcaagcccagcatctcccaggctcttctgcagctggagatgagc tcagacctcaaggctcagctcagggagctgaatattacggcagccaaagaaactgaagtt ggtagtggtcggaaagctatcataatctttgttcccgttcctaaactgaaatctttccag aaaatccaagtccagctaaggagaattctgcctaagccaactcgaaaaagctgtatgaaa aataagcaaaagcatcccaggagccgtgccctgacagctgtgcacgacgcaatccttgag gacttggtcttcccaagtgaaattgtgggcaagagaatctgcatgaaactggacggtggc ctgctcgtaaaggttcatttggacaaagcacagcagaacaatgtggaacacaaagttgaa attttttctggtgtctataagaagctcacgggcaaggatgttaattttgaattcccagag tttcaattgtaa >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_6|145_aa MSGRTLEMSGRTLEISGSWVMEDQDKLSNVSSSIPWLLEVKGPKPEPSIKSAGKEIKEFA EFRSSTTNYWLSVQEITGPRPNTWHSAAIAALFTITKRGQLRSAPPGGEWRALGDGPLSS LSAGEPSVQKLPGSMPSNGLSLHAL >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_6|438_bp atgagtggcagaaccttggagatgagtggcagaaccttggagataagtggcagttgggtg atggaagatcaggataaactcagcaatgtcagcagcagcatcccttggctgctggaagtg aaggggccaaagcccgagcctagtatcaaatcagctggcaaagaaataaaggagtttgca gaatttagatctagcaccacaaattattggctttctgtgcaggaaataacaggacctagg ccaaatacctggcattcggcagcaatagcagcattattcacaataaccaaaagaggtcag ctgcggtctgcacctcctggaggggaatggcgagccctgggggatgggcctctgtcctca ctctctgctggagagcctagtgtacaaaagctcccaggctccatgccgtccaatggcctg tctctgcacgctctctag >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_7|259_aa MKNMKEGFPEKLIYLKDKLELAKNRLEPRDLVPCIPATLAVAERGKHRAQAVASEGVSLR HWWLSHGFEPVSAQKSRTGVWGPPPRFQKIYRNAWMPRQKFASGARFSWRTSARAMQKGN VGSEPPHRVPTGELPSRATVSRPPSSRPQNGRSTDSLYHALGKATDTQCQPMKAARRETV PCKATGVELPKTIGNQLLHQCDLDMRCGVKGDYFGALRFDCPSGFLTCMGPVAPFVLANL SHLEWLYLPNVCTPIVSRK >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_7|780_bp atgaagaacatgaaagaaggcttccctgagaaactcatttatctaaaggataaattggag ttagccaagaacaggttggagcctagggacttggtgccctgcatcccagccactctagct gtggctgaaaggggaaaacatagagctcaggctgtggcttcagagggtgtaagcctcagg cattggtggctttcacatggttttgagcctgtgagtgcacagaagtcaagaactggggtt tggggacctccacctagatttcagaagatctatagaaacgcctggatgcccaggcagaag tttgcttcaggggcaaggttctcatggagaacctctgctagggcaatgcagaagggaaat gtgggttcagagcccccacacagagtccctactggggaactgcctagtagagcaactgtc tccagaccaccgtcctccagaccccagaatggtagatccactgacagtttgtaccatgca cttggaaaagccacagacactcaatgccagcccatgaaggcagctcgaagagagactgta ccctgcaaagccacaggggtggagctgcccaagaccattggaaaccagcttttacatcag tgtgacctggatatgagatgtggagtcaaaggagattattttggagctttaagatttgac tgcccctctggatttctgacttgcatggggcctgtagccccatttgttttagccaatttg tcccatttggaatggctgtatttacccaatgtctgtacccccattgtatctaggaagtaa >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_8|89_aa MRKVFHIPSFEDGERWPQAKECGQALEAGKGKGTYFLLDLPKRNAVTDKSLAKDCAGSYL KPAQYWVSLSAHGGHYLVTAYICSMPSIP >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_8|270_bp atgagaaaggtgtttcatattcccagttttgaagatggagaaagatggccacaagccaag gaatgtggacaggctctagaagctggaaaagggaaggggacctattttcttctagacctt ccaaaaaggaatgcagtcactgacaaatctttagccaaagactgtgctgggtcatacctg aagccagcccagtactgggtctcactcagtgcccacggcggccactacctggtgactgcc tatatttgctcaatgccctcaataccctag >gi568815592f:96503614_96715769|GENSCAN_predicted_peptide_9|76_aa XEDKGAHAQKEVDWECLRTQRLTIKEGAAKSSKQNPCLPLTAQVPLDLLDQVAGQLVQQP GPAAEPSFVLGTTLNE >gi568815592f:96503614_96715769|GENSCAN_predicted_CDS_9|231_bp nnagaggacaaaggagcacatgctcaaaaagaagtagattgggagtgcctgcggacccag agactgaccatcaaagaaggggctgctaaaagctccaaacagaatccctgtttgccactg acagctcaagtaccattggatctgctggatcaagtggcagggcagcttgtacagcagcct ggacctgctgcagagccttcttttgttttgggcaccactttaaatgagtag