GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:48:27 Sequence gi568815585f:31639613_31902402 : 262790 bp : 38.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 8138 8177 40 -2.45 1.01 Init + 33858 33932 75 2 0 58 50 63 0.797 0.64 1.02 Intr + 33977 34161 185 1 2 95 56 160 0.256 11.16 1.03 Intr + 49339 49653 315 1 0 62 71 237 0.100 13.66 1.04 Intr + 52488 52632 145 0 1 26 41 99 0.006 -1.64 1.05 Term + 68722 68874 153 0 0 93 48 73 0.126 0.74 1.06 PlyA + 70794 70799 6 1.05 2.03 PlyA - 71954 71949 6 1.05 2.02 Term - 83285 83158 128 0 2 59 39 128 0.018 2.46 2.01 Init - 104271 104193 79 0 1 57 105 98 0.394 9.67 2.00 Prom - 105190 105151 40 -6.55 3.00 Prom + 107210 107249 40 -3.75 3.01 Init + 111078 111153 76 2 1 21 85 104 0.282 4.70 3.02 Intr + 118646 118792 147 0 0 77 89 65 0.483 4.79 3.03 Intr + 122112 122189 78 1 0 70 87 69 0.869 3.60 3.04 Intr + 125425 125530 106 2 1 48 74 47 0.002 -2.35 3.05 Intr + 135706 135777 72 1 0 78 87 53 0.000 1.80 3.06 Intr + 152194 152423 230 1 2 51 81 127 0.949 4.89 3.07 Intr + 153066 153476 411 1 0 114 67 271 0.980 20.83 3.08 Term + 160251 160363 113 1 2 96 43 70 0.855 1.04 3.09 PlyA + 160805 160810 6 -0.45 4.00 Prom + 162215 162254 40 -5.15 4.01 Init + 169987 170060 74 0 2 80 103 81 0.871 9.39 4.02 Term + 170960 171395 436 2 1 57 44 267 0.963 12.97 4.03 PlyA + 171674 171679 6 1.05 5.00 Prom + 183372 183411 40 -3.05 5.01 Init + 193056 193131 76 2 1 33 62 77 0.024 0.90 5.02 Intr + 207148 207327 180 2 0 47 80 107 0.151 4.72 5.03 Intr + 212389 212597 209 2 2 47 98 120 0.577 6.77 5.04 Term + 220570 220671 102 0 0 36 47 101 0.185 -1.90 5.05 PlyA + 221022 221027 6 1.05 6.00 Prom + 224151 224190 40 -3.65 6.01 Init + 234355 234406 52 0 1 66 70 35 0.100 0.87 6.02 Intr + 238605 238740 136 1 1 73 72 77 0.276 3.31 6.03 Intr + 248678 248796 119 2 2 59 34 153 0.056 6.19 6.04 Intr + 259000 259292 293 2 2 4 53 213 0.133 5.33 6.05 Term + 259336 259536 201 0 0 45 49 119 0.245 0.11 6.06 PlyA + 260759 260764 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 123986 124092 107 0 2 61 47 93 0.892 0.09 S.002 Term - 197840 197738 103 1 1 116 49 98 0.887 5.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_1|290_aa MRKHTGPLNPQKVASAGGEMDAVLKLHDQKQQSVMRTQIPDIWRTTSFMPALAPTSCMQA APGTCAQLPATRLGSGNEHIGTLTHCWRYLLAGVERHLIQESSGWHQAGAPLGQSFQKKE QAAIFAVLQSALMIPRQKGSGVDLQQTAADLQKKGLTVRRKTNKQKAITPKSTKRTPTQQ PHPKVISLKDQRLNQEEVKTLIRQITSSEIEAVINSLPTKKRPGPDGFTAKSYQRYKEEL ISSERLSPFSIAVTNKTDYKICKEKIFLTFLETGKSKSMAPIPGEGLLAA >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_1|873_bp atgagaaaacacactggccctttaaatccacagaaagttgcttcagctggaggggaaatg gatgcagtactaaaactccatgaccagaagcagcaatcagtaatgagaacacagatccct gatatttggaggacaacatcctttatgcccgctctggctcccacaagttgcatgcaagct gctcctggaacctgtgcacagctgcctgccacaaggcttgggagtgggaatgagcacatt ggaacactcacacattgctggagatacctcctagcaggggttgagagacacctcatacag gagagctctggctggcatcaggctggtgcccctctagggcaaagcttccagaagaaggaa caggcagcaatctttgctgttctgcagtctgcgctgatgatacccaggcaaaaagggtct ggagtggacctccagcaaactgcagcagacctgcagaagaagggcttgactgttagaaga aaaactaacaaacagaaagcaataacaccaaaatcaacaaaaagaacccccacacaacaa ccccatccaaaggtcatcagcctcaaagatcaaagactaaaccaggaagaagtcaaaacc ctgattagacaaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaaa aaacgcccaggaccagatggattcacagccaaatcctaccagaggtacaaagaggagctg atctcctctgagagacttagtccattttctattgctgtaacaaataagacagactacaaa atttgtaaagaaaagatatttcttacatttctggagactgggaagtccaaaagcatggca ccgatacctggtgaaggccttcttgctgcatga >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_2|68_aa MGCSQDKGRVGGYGNSESKRAEEQAQALVHLILGKYPVYECGPGKSFCSPQCDFITGMIP SQEESVKV >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_2|207_bp atgggatgtagccaggacaaaggcagagttggaggatatgggaactcagagagcaagaga gctgaagagcaggcccaagcccttgttcatctgattcttggcaaatatcccgtatatgag tgtgggcctggcaagagcttctgcagcccccagtgtgattttatcaccggaatgattcca tctcaggaagagtctgtgaaggtttaa >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_3|410_aa MQKTKQDEDYERAIGFSVKMDDSDSDFALTQGSMITPSCQKGYFPCGNLTKCLPRAFHCD GKDDCGNGADEENCGDTSGWATIFGTVHGNANSVALTQECFLKQYPQCCDCKETELECVN GDLKSVPMISNNVTLLYLNHNCITTLRPGIFKDLHQLTWLYFKNFRYCSYAPHVRICMPL TDGISSFEDLLANNILRIFVWVIAFITCFGNLFVIGMRSFIKAENTTHAMSIKILCCADC LMGVYLFFVGIFDIKYRGQYQKYALLWMESVQCRLMGFLAMLSTEVSVLLLTYLTLEKFL VIVFPFSNIRPGKRQTSVILICIWMAGFLIAVIPFWNKDYFGNFYGKNGVCFPLYYDQTE DIGSKGYSLGIFLGGVRSAVSSHGSRTHHPQRVYDTSDHDKLSKYPHEGN >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_3|1233_bp atgcagaaaaccaagcaagatgaggactatgaaagagccattggatttagtgtcaaaatg gatgacagtgattctgattttgcactgactcaaggtagcatgatcactccttcatgccaa aaaggatattttccctgtgggaatcttaccaagtgcttaccccgagcttttcactgtgat ggcaaggatgactgtgggaacggggcggacgaagagaactgtggtgacactagtggatgg gcgaccatatttggcacagtgcatggaaatgctaacagcgtggccttaacacaggagtgc tttctaaaacagtatccacaatgctgtgactgcaaagaaactgaattggaatgtgtaaat ggtgacttaaagtctgtgccgatgatttctaacaatgtgacattactatatctcaaccac aactgcatcacaaccctcagacctggaatattcaaagacttacatcagctaacttggctt tatttcaaaaactttcgatactgctcctatgctccccatgtccgaatatgtatgcccttg acggacggcatttcttcatttgaggacctcttggctaacaatatcctcagaatatttgtc tgggttatagctttcattacctgctttggaaatctttttgtcattggcatgagatctttc attaaagctgaaaatacaactcacgctatgtccatcaaaatcctttgttgtgctgattgc ctgatgggtgtttacttgttctttgttggcattttcgatataaaataccgagggcagtat cagaagtatgccttgctgtggatggagagcgtgcagtgccgcctcatggggttcctggcc atgctgtccaccgaagtctctgttctgctactgacctacttgactttggagaagttcctg gtcattgtcttccccttcagtaacattcgacctggaaaacggcagacctcagtcatcctc atttgcatctggatggcgggatttttaatagctgtaattccattttggaataaggattat tttggaaacttttatgggaaaaatggagtatgtttcccactttattatgaccaaacagaa gatattggaagcaaagggtattctcttggaattttcctaggaggcgtccgctctgctgtg tcctcacatggcagcagaactcatcatcctcagcgagtctacgacacctcagatcatgat aagcttagtaaatacccccatgaagggaactag >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_4|169_aa MTGSRLSSSPEADAAAMLLVQPVECWKPWHLPRAVKSEGVQNVRVGEAWHLPPRFQRMYL KVWMLRQKPTAGVEPPQSDSTKAMQSINVELEPCREHFTKALPNGAMKAGPQHSRPQNYG VTSMMQSQPKKATSIKLQPVKAATWALPSKALGMRLPKALGTHPSHHCA >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_4|510_bp atgactggaagcagactgagttcctcaccagaagcagatgctgctgccatgcttcttgta cagcctgtagaatgctggaagccttggcatcttccacgtgctgttaagtctgaaggtgtg cagaatgtaagagttggggaggcttggcaccttccacctagatttcagaggatgtatcta aaagtatggatgctcaggcagaagcctactgcaggtgtggagcccccacagagcgactca accaaggcaatgcagagcataaatgtggagttggagccctgcagagagcactttaccaag gcactgcctaatggtgccatgaaagcagggccacagcactccagaccccagaattatgga gtcaccagcatgatgcaatctcagcctaaaaaagcaacaagcattaaactccaacctgtg aaagcagccacatgggctctgcccagcaaagccttggggatgcggctgcccaaggccttg ggaacccacccctcacaccactgtgcctaa >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_5|188_aa MTSFGRSKEVAIDLVDQDNQLKFLKAEPGRSERSAAQGAQAVAAPGSRGSSARRSEVASG GGNYGGGAGAMLWVSLGAGVLASSSGMVKHADTEMTVIKEVYYTQESPRNRRYSMACHAG GTQGSSRIRQEAGGTRRNVGKSLTVVSMGRNEQSRCPRPTDPARERRQLDPGLENKKHNS KPTRAWGK >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_5|567_bp atgacatcatttggaagaagtaaagaagttgctatagatttagtggaccaagacaatcaa ttgaagtttcttaaagcagagcccgggcgcagcgagcgcagcgcagcgcagggagcgcag gcggtggcggcaccaggttcccggggctccagcgctcggcgcagcgaagtagcctccggc ggtggcaactacggcggcggcgccggggcgatgctgtgggtctcgttgggcgccggtgtc ctggcctcttcctcaggtatggtaaaacatgcagacacagaaatgactgtcataaaggaa gtttattatactcaagagtcccctagaaaccggaggtacagcatggcatgccatgcaggg ggcacacagggaagcagcaggatcaggcaggaggcagggggaacaagaagaaatgtgggc aagagccttacggtggtttccatgggaaggaatgagcaaagcaggtgtcccaggcccact gacccagcaagggagcgacgccagctggaccctggactagaaaacaagaaacacaattca aagccaacaagagcctggggaaaatga >gi568815585f:31639613_31902402|GENSCAN_predicted_peptide_6|266_aa MRKVSAKKIHPEFDEDKELGLINLVPSFHLSCYPFHKGCPFVSHDQDGNTSNESFTVSGM SPWAEDRGPVLPLVETQQDYREVAQYGHFNCQCRFTGTSTGPDGVESSRKNVQGRELPWS WGRDSRWHPFCDKAILELLGDRHILTVPKGAACGEEVEIQNPKQSLCTEADGRTHCRSFT LECAAKGLEVEAPAKRGSRKLLDAAGNSHSPHTGSACARAGTLRPTSLHPILLLPATWMV ERAQLPMFLSQWTVRKPLTNCVLSNE >gi568815585f:31639613_31902402|GENSCAN_predicted_CDS_6|801_bp atgaggaaagtctcagctaaaaaaatccacccagagtttgatgaggataaagagctaggc ctcattaaccttgtgccatcatttcatctgtcctgttacccgtttcataaaggatgtcct tttgtgtcccatgatcaagatggaaacacctctaatgagagcttcacagttagtggaatg agcccctgggctgaagacagaggtccagtcttgccacttgttgaaactcaacaagactac agagaagtggcacaatacggccacttcaactgccaatgccgcttcactggcactagcact ggaccagatggtgtggagagcagcaggaagaatgtccagggtagagagctgccctggagt tggggccgagatagcagatggcaccctttctgtgacaaggctattctagagttgctgggt gaccgacacatcctgacagtgcccaaaggggctgcttgtggagaggaggttgaaattcaa aatccaaagcagagtctctgcactgaggcagatggaagaactcactgtagaagcttcacg ctggagtgtgcagccaagggcttagaggtcgaggcacctgcgaaacgtggaagccggaag ctccttgatgctgctgggaactcacattctcctcacactggaagtgcttgtgccagggca gggaccctaagacctacctctctccatcccatcctacttctgccagccacatggatggtg gagagagcccagctgcccatgttcctctctcagtggactgtgagaaaaccactcacaaat tgtgtgctctccaatgaataa