GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:06:01 Sequence gi568815589f:27424337_27624957 : 200621 bp : 39.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 682 677 6 1.05 1.06 Term - 5838 5629 210 0 0 48 49 158 0.004 4.21 1.05 Intr - 14381 14268 114 2 0 82 7 95 0.058 0.52 1.04 Intr - 14666 14590 77 0 2 51 110 47 0.192 1.52 1.03 Intr - 21355 21171 185 0 2 33 66 102 0.042 1.01 1.02 Intr - 28051 27868 184 2 1 81 63 66 0.181 1.32 1.01 Init - 31214 30797 418 2 1 68 106 566 0.963 53.04 1.00 Prom - 36822 36783 40 -3.85 2.04 PlyA - 36888 36883 6 1.05 2.03 Term - 41641 41531 111 1 0 33 49 139 0.064 2.08 2.02 Intr - 46919 46809 111 2 0 95 81 59 0.416 5.56 2.01 Init - 47268 47074 195 0 0 66 41 124 0.302 4.48 2.00 Prom - 47651 47612 40 -8.45 3.06 PlyA - 47855 47850 6 1.05 3.05 Term - 48217 48013 205 0 1 113 55 109 0.631 5.96 3.04 Intr - 81571 81451 121 2 1 90 67 77 0.063 4.43 3.03 Intr - 86633 86460 174 0 0 24 18 153 0.015 0.89 3.02 Intr - 94815 94684 132 1 0 48 45 89 0.095 0.30 3.01 Init - 96491 96326 166 2 1 81 26 119 0.188 4.84 3.00 Prom - 97394 97355 40 -7.85 4.00 Prom + 99918 99957 40 -1.95 4.01 Sngl + 100001 100624 624 1 0 55 45 312 0.885 19.54 4.02 PlyA + 101411 101416 6 1.05 5.16 PlyA - 102684 102679 6 1.05 5.15 Term - 103716 103678 39 0 0 82 43 25 0.260 -6.39 5.14 Intr - 104126 103930 197 0 2 68 62 139 0.471 7.51 5.13 Intr - 104402 104185 218 1 2 120 17 121 0.636 5.42 5.12 Intr - 105705 105219 487 1 1 78 113 243 0.565 16.94 5.11 Intr - 111950 111830 121 2 1 103 16 39 0.018 -2.65 5.10 Intr - 123085 123049 37 0 1 111 65 28 0.074 0.15 5.09 Intr - 124330 124221 110 1 2 114 83 27 0.643 2.96 5.08 Intr - 126371 126314 58 1 1 115 99 46 0.832 6.37 5.07 Intr - 132460 132225 236 1 2 82 40 246 0.730 14.76 5.06 Intr - 134271 134155 117 0 0 90 75 17 0.554 0.34 5.05 Intr - 135963 135891 73 2 1 93 63 17 0.733 -1.81 5.04 Intr - 137313 137249 65 0 2 86 71 91 0.809 3.80 5.03 Intr - 138140 138045 96 2 0 79 95 70 0.939 6.09 5.02 Intr - 141254 141195 60 2 0 80 76 66 0.859 2.61 5.01 Init - 142784 142341 444 2 0 73 115 177 0.991 14.96 5.00 Prom - 144480 144441 40 -7.25 6.03 PlyA - 144912 144907 6 1.05 6.02 Term - 149037 148571 467 1 2 -22 45 366 0.395 15.49 6.01 Init - 152670 152616 55 0 1 92 115 48 0.750 9.40 6.00 Prom - 153360 153321 40 -7.45 7.00 Prom + 163918 163957 40 -2.95 7.01 Sngl + 166118 166432 315 1 0 50 41 150 0.197 2.20 7.02 PlyA + 166826 166831 6 1.05 8.03 PlyA - 168748 168743 6 1.05 8.02 Term - 172164 172011 154 2 1 28 41 232 0.224 8.91 8.01 Init - 182716 182661 56 1 2 12 89 73 0.034 0.61 8.00 Prom - 183571 183532 40 -7.25 9.03 PlyA - 183670 183665 6 1.05 9.02 Term - 184699 184145 555 1 0 45 40 332 0.785 17.44 9.01 Init - 185859 185728 132 0 0 90 89 167 0.856 17.19 9.00 Prom - 186118 186079 40 -10.35 10.03 PlyA - 186184 186179 6 -0.45 10.02 Term - 186451 186218 234 1 0 89 36 170 0.262 7.24 10.01 Init - 187054 186743 312 1 0 43 51 173 0.251 6.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 310 436 127 2 1 74 92 80 0.863 5.82 S.002 Intr + 5930 6125 196 2 1 83 53 117 0.840 6.20 S.003 Intr + 7316 7492 177 1 0 121 86 35 0.884 5.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_1|395_aa MSIALKQVFNKDKTFRPKRKFEPGTQRFELHKRAQASLNSGVDLKAAVQLPSGEDQNDWV AVHVVDFFNRINLIYGTICEFCTERTCPVMSGGPKYEYRWQDDLKYKKPTALPAPQYMNL LMDWIEVQINNEEIFPTCVVNSHVSFEVQLRYHVLCEGFHPLEYQLFKLCNMSNSSDYHP LTLVLVTFKLLDGWMDGWMDGLGILLQLTGRKEVAKYCGLTYYGPQISHPYLASQSGKEK ACSQGHLSSHVFEEKCEQMCQRVEATHINKPLRLPEVKTPEALSLFQQAPQVYVSSLAAP CPPNGAIQQLPFEECPLTNMTKTGAQVCLAEDVENESTRSRSSDVTLNPERLIRQGEMFK QQLWEALRRQVAPSADHQGTLLEQRASIELRCIRS >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_1|1188_bp atgtccatagccctgaagcaggtattcaacaaggacaagaccttccgacccaagaggaaa tttgaacctggcacacagaggtttgagctgcacaaacgggctcaggcatccctcaactcg ggtgtggacctgaaggcggctgtgcagttgcccagtggggaggaccagaatgactgggtg gcagtacatgtggtggacttcttcaatcggatcaacctcatctatggcaccatctgtgag ttctgcaccgagcggacctgtcctgtgatgtcagggggccccaaatatgagtatcggtgg caggatgatctcaagtataagaagccaacagcgctgccagctccccagtacatgaacctt cttatggattggattgaggttcagatcaacaacgaggaaatatttccaacatgcgtggtg aattctcatgtatcctttgaggttcagctcagataccatgtcctctgtgaaggctttcat ccattagaataccagctctttaaattatgcaatatgtctaattcatctgactatcaccca ctaactctagtcctggtaactttcaagttattggatggatggatggatggatggatggac ggcctgggaattctactacagttaacaggaagaaaagaagttgctaaatattgcggactt acctattatggccctcagataagccatccatatttggcatctcaaagtggaaaagaaaaa gcctgcagccaggggcatttgtcttcccatgtttttgaagagaaatgtgagcaaatgtgt caacgagtagaagctacacatatcaataaaccgttgaggctcccagaagtaaagacccca gaggccctaagtctctttcaacaggcgccgcaagtgtacgtttcatctcttgcagctcct tgtcctcccaacggtgctatccagcagctgccctttgaggaatgtcctttaacaaatatg accaaaactggagctcaggtgtgcctggcagaggatgtggagaatgaaagtaccaggtca aggagtagtgatgtcaccttaaatccagagagattgatcagacagggtgaaatgtttaag caacagctctgggaagcccttagaaggcaagtggcaccctcagcagaccatcagggaact ctgttggagcaaagggcctccatagaacttcgttgcattcgctcctga >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_2|138_aa MEEEEANVEQCDIVSSRAHTLEMVDLGVKIGQSDLRSIYLTTSLYNTVTKSHKAQGWFIE LASPQEMTFGPWTTGYARPGMSDSLAQGALKSWRAAESTHPLAMSEIFMAAPPITGPEAQ QEKVVLWAGPRIPVLYAA >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_2|417_bp atggaagaagaggaagctaacgttgagcaatgtgatatagtaagctccagagcccacact ctagaaatggtagatttgggagtcaaaatcgggcagtctgatctcagatccatttactta acgaccagtctatacaacacagtgaccaagagccacaaagcgcaggggtggttcattgaa ctagcctctccacaggaaatgacttttggcccatggaccactggatatgccaggccaggc atgtcagactccctggctcagggagctcttaaatcctggagagcagcagagtccacacat ccactggccatgtcagagatcttcatggctgccccacccatcacaggcccagaggcccag caggaaaaagtggttttgtgggccggtcccaggatccccgtgctgtatgcagcctag >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_3|265_aa MQAPTDLSQQSSDVTSPAESHSSATLPPPAELHLIAFLYAGFTCAHTRNKCTWQRGIWRG QSQMSGKIKGINRVVDGNLGVHLMSYTTSGAFSLGFPEAGCGWNSFLTSVAKGGKGTNAP VRRGKSQGPAKVSAVDFREYLLTPTVWNLVDPGGETGFLRARVKEKEERGGKNGSTGNEF PGRAETPEECVTSYTLHVFLRPNRGNPLEAYLTRRIDECRILNEKEVSGCGFLYCQLGPK IKRALAAGVWLSPAPVADVVFHGFA >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_3|798_bp atgcaagctcccactgacctctcacagcagagctcagacgtgacatcccctgcagagagc cattcctctgctactcttcctcctcctgcagagttacatttgattgcatttctctatgct ggctttacatgtgcacatacccgtaacaagtgtacatggcaaagagggatatggagagga caaagtcagatgtcggggaaaattaagggaattaacagagtggtagatggcaacttaggt gtccatttaatgtcctatactacttctggtgcattctccttgggattccccgaggctggg tgtggctggaattcatttctgacttctgtggcaaaaggaggaaaaggcactaatgcccct gtgagaagaggcaagagccagggacctgccaaggtctctgcagtagacttcagggagtat ctcctgacccccactgtgtggaacttggtggatcctggtggagagacaggctttttgaga gccagagtcaaagagaaggaagagagagggggaaaaaatggctctacgggaaatgagttt ccaggcagagctgagacacctgaagaatgtgtcacttcatatactttgcacgtatttctt agaccaaatagaggaaatccactggaggcttacctaacaagacgtattgatgaatgtaga atcttaaatgaaaaagaagtcagtggatgtggcttcctgtactgtcagcttggaccaaaa ataaagcgagccctcgccgcaggggtttggctcagccctgcgcctgtggcagatgttgtt ttccatggctttgcttga >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_4|207_aa MSTKPDMIQKCLWLEILMGIFIAGTLSLDCNLLNVHLRRVTWQNLRHLSSMSNSFPVECL RENIAFELPQEFLQYTQPMKRDIKKAFYEMSLQAFNIFSQHTFKYWKERHLKQIQIGLDQ QAEYLNQCLEEDKNENEDMKEMKENEMKPSEARVPQLSSLELRRYFHRIDNFLKEKKYSD CAWEIVRVEIRRCLYYFYKFTALFRRK >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_4|624_bp atgagcaccaaacctgatatgattcaaaagtgtttgtggcttgagatccttatgggtata ttcattgctggcaccctatccctggactgtaacttactgaacgttcacctgagaagagtc acctggcaaaatctgagacatctgagtagtatgagcaattcatttcctgtagaatgtcta cgagaaaacatagcttttgagttgccccaagagtttctgcaatacacccaacctatgaag agggacatcaagaaggccttctatgaaatgtccctacaggccttcaacatcttcagccaa cacaccttcaaatattggaaagagagacacctcaaacaaatccaaataggacttgatcag caagcagagtacctgaaccaatgcttggaggaagacaagaatgaaaatgaagacatgaaa gaaatgaaagagaatgagatgaaaccctcagaagccagggtcccccagctgagcagcctg gaactgaggagatatttccacaggatagacaatttcctgaaagaaaagaaatacagtgac tgtgcctgggagattgtccgagtggaaatcagaagatgtttgtattacttttacaaattt acagctctattcaggaggaaataa >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_5|785_aa MSTLCPPPSPAVAKTEIALSGKSPLLAATFAYWDNILGPRVRHIWAPKTEQVLLSDGEIT FLANHTLNGEILRNAESGAIDVKFFVLSEKGVIIVSLIFDGNWNGDRSTYGLSIILPQTE LSFYLPLHRVCVDRLTHIIRKGRIWMHKERQENVQKIILEGTERMEDQGQSIIPMLTGEV IPVMELLSSMKSHSVPEEIDIADTVLNDDDIGDSCHEGFLLNAISSHLQTCGCSVVVGSS AEKVNKIVRTLCLFLTPAERKCSRLCEAESSFKYESGLFVQGLLKDSTGSFVLPFRQVMY APYPTTHIDVDVNTVKQMPPCHEHIYNQRRYMRSELTAFWRATSEEDMAQDTIIYTDESF TPDLNIFQDVLHRDTLVKAFLDQVFQLKPGLSLRSTFLAQFLLVLHRKALTLIKYIEDDT KSNYLEQVQISLISSSVTERGEKGMAAENSGLYHPCLLTSRSMYKFSEKKPSGALEVAVV DPEIILSDLTPSSLPWPSVFSARLEAPLDWSCPISFRREDDPKDDFSLHGAAIPQAAARG PRSASPRDSRAPAGCAARWVHEGAAPERRRLQLEPGARPSAGWLAGTSRTGGRVPCVDWQ ATPHLPRAHHFLSRWLQLTRRREPTGPWAGGKTPRDLSTQPTEIVANGLLFLRVSPIVCL SNIWRGLGREGSPSGPPPRPPRQLRQGIPAGKVEEPDPSEAAPPRPCGRISQISAQARRK GSNPRRCCFRDSGSAFRPNPERPVVGSRGFQSLAKGVGAGPLLGQWEGIGAPLKKRTQTT QHRYG >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_5|2358_bp atgtcgactctttgcccaccgccatctccagctgttgccaagacagagattgctttaagt ggcaaatcacctttattagcagctacttttgcttactgggacaatattcttggtcctaga gtaaggcacatttgggctccaaagacagaacaggtacttctcagtgatggagaaataact tttcttgccaaccacactctaaatggagaaatccttcgaaatgcagagagtggtgctata gatgtaaagttttttgtcttgtctgaaaagggagtgattattgtttcattaatctttgat ggaaactggaatggggatcgcagcacatatggactatcaattatacttccacagacagaa cttagtttctacctcccacttcatagagtgtgtgttgatagattaacacatataatccgg aaaggaagaatatggatgcataaggaaagacaagaaaatgtccagaagattatcttagaa ggcacagagagaatggaagatcagggtcagagtattattccaatgcttactggagaagtg attcctgtaatggaactgctttcatctatgaaatcacacagtgttcctgaagaaatagat atagctgatacagtactcaatgatgatgatattggtgacagctgtcatgaaggctttctt ctcaatgccatcagctcacacttgcaaacctgtggctgttccgttgtagtaggtagcagt gcagagaaagtaaataagatagtcagaacattatgcctttttctgactccagcagagaga aaatgctccaggttatgtgaagcagaatcatcatttaaatatgagtcagggctctttgta caaggcctgctaaaggattcaactggaagctttgtgctgcctttccggcaagtcatgtat gctccatatcccaccacacacatagatgtggatgtcaatactgtgaagcagatgccaccc tgtcatgaacatatttataatcagcgtagatacatgagatccgagctgacagccttctgg agagccacttcagaagaagacatggctcaggatacgatcatctacactgacgaaagcttt actcctgatttgaatatttttcaagatgtcttacacagagacactctagtgaaagccttc ctggatcaggtctttcagctgaaacctggcttatctctcagaagtactttccttgcacag tttctacttgtccttcacagaaaagccttgacactaataaaatatatagaagacgataca aaatctaattacttggaacaagttcagatttcactgataagttcatccgtaactgagaga ggtgaaaaggggatggctgcagagaactctggcttatatcatccttgcttgctgacctca aggtccatgtataaattctcagagaagaagccctctggtgccttggaagtggccgttgtg gacccagagatcatcctttctgatctgacaccttcttcactgccctggcccagtgtcttt tctgcaaggctggaagcccccttagactggtcatgtcccatctctttccggagggaagat gatcccaaagacgacttttctctccacggtgctgccataccgcaggcggccgccaggggt ccccgctcggcgtccccgcgagacagtcgagccccggccggctgcgcggcgcgctgggtg catgagggggctgctccggagcgacggcggctgcagctggagccaggcgctcgcccgtcc gccggttggctcgccgggacctcgcgcaccggcggcagagtcccttgcgtggattggcaa gcgacgccccacctgccccgagctcaccattttctttcgcgctggctgcagctgacccgg cgaagggagccgaccgggccctgggctggaggtaaaaccccacgagatctctctacgcag ccgactgagatcgtggcgaatggccttttgtttctccgcgtttcccctattgtttgcctt tccaacatctggcggggcttggggagagaaggaagcccctctggtccccctccccggccc ccacgccagctccggcaggggatcccagctgggaaagtggaggagcccgaccccagcgag gccgccccaccccgcccttgtgggcgcatttctcagatctcagcccaggcgcgccgcaaa ggctcaaatccgagaaggtgctgctttcgagacagtggaagcgcgttccgccccaatcca gagcgtccagtggttggttccagaggatttcaatctctagccaaaggcgttggggctggg ccgctgctagggcagtgggaggggatcggggcacctttgaaaaagaggactcagacgacg caacacagatacggctag >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_6|173_aa MVKYDLWGHSTGKGKKASEARVEAGALLRARRIFTFPLISLTEAGCRAFASSDWWNCLHP GPGLPGGGGGGGGAGTRDGDLASSLLSRPQYPSCLLPGDPLGALPLRAREKGASGTERPR LGEGRRVGGARLLRTKSGFARNPRRSLPARRSCGMRWGCGDACTISAQASREW >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_6|522_bp atggtgaagtatgacctttgggggcattccaccggaaaagggaagaaagcctcagaggcg cgggtagaagcgggggctctcctcagagctcgacgcatttttactttccctctcatttct ctgaccgaagctgggtgtcgggctttcgcctctagcgactggtggaattgcctgcatccg ggccccgggcttcccggcggcggcggcggcggcggcggcgcagggacaagggatggggat ctggcctcttccttgctttcccgccctcagtacccgagctgtctccttcccggggacccg ctgggagcgctgccgctgcgggctcgagaaaagggagcctcgggtactgagaggcctcgc ctgggggaaggccggagggtgggcggcgcgcggcttctgcggaccaagtcggggttcgct aggaacccgagacggtccctgccggcgaggagatcatgcgggatgagatgggggtgtgga gacgcctgcacaatttcagcccaagcttctagagagtggtga >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_7|104_aa MTQMTYIEFRIWMARKYIKIQEIIETQYKELKESRKMIKELKDEIAILRKNKTELQKLRD QLQKFHNIVRGINSRTDQAEERISEFEDQLFESTWLDKNKEKRI >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_7|315_bp atgactcaaatgacatacatagaattcagaatctggatggcaaggaagtacatcaagatc caggagatcattgaaacccaatacaaggaactcaaggaatccagaaaaatgatcaaagag cttaaagatgaaatagctattttaagaaagaacaaaactgaacttcagaaattgagggat caacttcaaaaatttcataatatagtcagaggcattaacagcagaacagaccaagctgag gaaagaatctcagagtttgaagaccagttatttgaatcaacttggttggacaaaaataaa gaaaaaagaatttaa >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_8|69_aa MKEHIEASNYLVSAQMRNRRGEGDVRFGAAEIGVMWPPIKETKDFQQPPEAGRGKEGLFP KAFKGAQPY >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_8|210_bp atgaaagaacatattgaagcatccaattacctggtgtctgctcaaatgaggaatcgaaga ggagaaggtgatgtgagatttggagcagcagagattggagtgatgtggccaccaatcaag gaaaccaaggacttccagcagccaccagaagctggaagaggcaaggaaggactcttccct aaagcctttaaaggagcacagccctactaa >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_9|228_aa MNLTIFQMNEERLKIAIKDALNENSQLQENERQLLQEAEVWKEQGFSDTGSLSPPWEQDR RMMFLPPGQSYPDSALPPQRQDRFYSNSGTLSGPAELRRFNMTSLDKVDGSMLSEMESSR NDTKDDLGNLNVPDSSLPAENEATGPYFSPPPLAPIRGPLFPGDTRSLFMRRGPPFPPPP PGTMFGASQDYFPPRDFPDPPHAPFAMRNVYPARRFLLTFPQNLDFSP >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_9|687_bp atgaacttgacgatatttcaaatgaatgaagaacgactgaagatagcaataaaagatgct ttgaatgaaaattctcaactccaggaaaacgagagacagcttttgcaagaagctgaggta tggaaagaacaaggcttctctgacactgggtccctgtcacctccatgggaacaggaccgt aggatgatgtttcttccaccaggacaatcatatcctgattcagctcttcctccacaaagg caagacagattttattctaattctggcacactgtctggaccagcagaactcagaaggttt aatatgacttctttggataaagtggatgggtcaatgctttcagaaatggaatccagcaga aatgataccaaagatgaccttggtaatttaaatgtgcctgattcatctctccctgctgaa aatgaagcaactggcccttacttttctcctccacctcttgctccaatcagaggtccattg tttccgggggatacaaggagcctgttcatgagaagaggacctcctttccccccacctcct ccaggaaccatgtttggagcttctcaagattattttccaccaagggatttcccagatcca ccacatgctccatttgcaatgagaaatgtctatccagcgaggcgtttcctccttaccttc ccccaaaacctggatttttccccataa >gi568815589f:27424337_27624957|GENSCAN_predicted_peptide_10|181_aa MNIDAKILNKILASGIQQRIKKLIRQDQVGFISGMQGWFNIRKSINHHSQQIITRTENQT PHVLTHKRELNNENTWTQGGEHHILRPVEGFGVEGVIALGEIPNPKFIVATAGAAFPAVE EPGATPQRYLGLVLGELSRVVAALPESVRPDSNPYGFPWELVICAAVHGFFAVLFFCVEK F >gi568815589f:27424337_27624957|GENSCAN_predicted_CDS_10|546_bp atgaacattgatgcgaaaatcctcaataaaatactggcaagcggaatccagcagcgcatc aaaaagcttatccgccaggatcaagtcggcttcatctctgggatgcaaggctggttcaac atacgcaaatcaataaaccatcattctcagcaaattatcacaagaacagaaaaccaaaca ccgcatgttctcactcataagagggagttgaacaatgagaacacgtggacccaaggaggg gaacatcacatactgcggcctgtcgagggatttggggttgagggagtgatagcattagga gaaatacctaatcccaagtttattgtggcaaccgccggagcagccttccccgctgtggag gagcctggggctacccctcagcggtatttggggctggtcctgggggagctaagcagggtt gtggcagcactgcctgaaagtgtgagaccagactctaatccttatggttttccatgggag ttggtgatatgtgcagctgtacatggattttttgctgttctctttttttgtgtggagaag ttttag