GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:31:53 Sequence gi568815577f:39665537_39901354 : 235818 bp : 43.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9552 9763 212 2 2 60 32 113 0.352 0.61 1.02 Term + 15062 15233 172 2 1 72 47 112 0.679 2.80 1.03 PlyA + 15563 15568 6 1.05 2.00 Prom + 20058 20097 40 -4.16 2.01 Init + 30787 30853 67 0 1 69 75 49 0.412 3.02 2.02 Term + 35881 36047 167 2 2 53 48 125 0.209 3.08 2.03 PlyA + 36079 36084 6 -0.45 3.00 Prom + 37138 37177 40 -5.46 3.01 Init + 43737 43750 14 2 2 92 101 5 0.929 1.95 3.02 Term + 43955 44075 121 2 1 107 39 123 0.926 7.25 3.03 PlyA + 45930 45935 6 1.05 4.00 Prom + 54770 54809 40 -1.86 4.01 Init + 80616 80762 147 2 0 75 113 106 0.536 11.89 4.02 Intr + 98684 98720 37 1 1 112 72 13 0.559 -0.06 4.03 Intr + 99999 100316 318 1 0 61 93 390 0.906 32.73 4.04 Intr + 105380 105679 300 0 0 104 115 125 0.864 13.51 4.05 Term + 111583 111740 158 2 2 82 33 89 0.254 0.90 4.06 PlyA + 113212 113217 6 1.05 5.03 PlyA - 113543 113538 6 -0.45 5.02 Term - 113752 113683 70 0 1 77 48 168 0.113 9.11 5.01 Init - 120305 119902 404 2 2 60 39 184 0.060 6.80 5.00 Prom - 121856 121817 40 -4.56 6.00 Prom + 124132 124171 40 -4.86 6.01 Init + 126446 126563 118 1 1 64 109 74 0.360 7.46 6.02 Intr + 127933 128077 145 2 1 38 68 31 0.225 -4.56 6.03 Intr + 130660 130827 168 1 0 63 79 87 0.017 4.36 6.04 Intr + 140098 140210 113 1 2 89 81 21 0.022 1.52 6.05 Intr + 153589 153696 108 2 0 68 48 62 0.311 0.46 6.06 Term + 154705 154778 74 2 2 104 50 35 0.435 -0.63 6.07 PlyA + 155232 155237 6 1.05 7.00 Prom + 159114 159153 40 -3.96 7.01 Sngl + 175144 175491 348 0 0 59 43 214 0.699 8.06 7.02 PlyA + 178678 178683 6 1.05 8.05 PlyA - 180159 180154 6 1.05 8.04 Term - 184347 184282 66 0 0 127 32 26 0.253 -1.16 8.03 Intr - 191068 190941 128 2 2 53 80 111 0.385 7.20 8.02 Intr - 194511 194396 116 2 2 71 98 8 0.366 0.19 8.01 Init - 194689 194547 143 1 2 101 29 66 0.323 1.61 8.00 Prom - 198409 198370 40 -7.76 9.04 PlyA - 198633 198628 6 1.05 9.03 Term - 199157 198986 172 1 1 111 53 223 0.988 18.40 9.02 Intr - 200836 200695 142 2 1 85 -12 101 0.009 -0.79 9.01 Intr - 217912 217826 87 2 0 64 93 49 0.100 2.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 74308 74116 193 0 1 86 43 151 0.974 7.29 S.002 Intr - 200836 200716 121 2 1 85 53 102 0.949 5.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_1|127_aa GTTDESASTTDLGSLTVLEAGVPDQAVTGLVPPETPLLGLQMTSSPCVLMGPSFFVCVLI SCPYKDTCLIGVYSFTYIKKGNDKTASGRSFRRYPEEGIVIIGDGCSMPVGQDVQVEDSD MDDSDLV >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_1|384_bp ggcaccacagacgagtcggcttcaacaacagacctcggttctctcacagtcctggaggct ggcgtcccagatcaagccgtcacagggctggttcctcctgagacccctctcctgggcttg cagatgacatcttccccttgtgtcctcatggggccatccttctttgtctgtgtcctcatc tcctgtccttacaaggacacctgtctgataggagtgtactcttttacttatattaagaaa ggtaacgataaaacagcctcaggcaggtccttcaggaggtatccagaagaaggcattgtt atcatcggagatggctgctccatgccagtgggacaagatgtgcaggtggaagacagtgat atggatgattctgaccttgtgtag >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_2|77_aa MNGVIFHLAGCFHTSHPLQAAVDVIAGGAAVIFSPLESHDDDDDEAEDSKHLMSSFVGFG LLPLNSMLCEKINLVTC >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_2|234_bp atgaatggagtaattttccacctggctggctgcttccacacctctcacccgctgcaggca gcagtagatgtaatagctggaggagcagcagtcatcttttcgcctttagaatcacatgac gatgatgatgatgaagctgaagacagtaaacaccttatgagcagctttgttggctttgga cttttacccttaaactcgatgttatgtgagaaaattaaccttgttacttgttga >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_3|44_aa MAYTWESTLGKQQSKMLLERSPNPDPKRRFLDLVQEGIQGESMQ >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_3|135_bp atggcttacacatgggaatccactctggggaagcaacagtccaagatgttactggaaagg agtcccaatccagatcccaagagaaggttcttggatctcgtgcaagaaggaattcagggc gagtccatgcagtaa >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_4|319_aa MQEQWRAFSLIGSGNGRLTGSGAQQILCRIWRNGSQRRVCDGGKQQWWTGQFSPHNVPGY TGSGSGNEVIEGPQNARVLKGSQARFNCTVSQGWKLIMWALSDMVVLSVRPMEPIITNDR FTSQRYDQGGNFTSEMIIHNVEPSDSGNIRCSLQNSRLHGSAYLTVQVMGELFIPSVNLV VAENEPCEVTCLPSHWTRLPDISWELGLLVSHSSYYFVPEPSDLQSAVSILALTPQSNGT LTCVATWKSLKARKSATVNLTVIRCPQGNLKFSLSHKAPTQWSNLQLKTVKIMDPTLDFT ILLSHLWEGECESILSCRG >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_4|960_bp atgcaggaacaatggcgagcctttagcctgattggaagtggcaatgggcgcctcactgga tcaggagcacagcagatactctgccggatctggaggaatggaagtcagcggcgggtctgc gatggtggcaaacagcagtggtggacggggcagtttagcccccacaatgtgcctgggtac acaggttctgggtctggtaatgaagtcatagaaggcccccaaaatgcaagagtcctgaag ggctcccaggctcgcttcaactgcaccgtctcccagggctggaagctcatcatgtgggct ctcagtgacatggtggtgctaagcgtcaggcccatggagcccatcatcaccaatgaccgc ttcacctctcagaggtacgaccagggcgggaacttcacctcggagatgatcatccacaat gtggagcccagtgattcggggaacatcagatgcagcctccagaacagtcgcctgcatgga tctgcttaccttaccgtccaagttatgggagagctgttcattcccagtgttaatcttgta gtcgctgagaatgaaccttgtgaagttacttgtctaccctcacactggacccggctcccg gatatttcctgggagctcggtctcctggtcagccattcaagctattattttgttccggag cccagcgaccttcaaagtgcagtgagcatcctggctctgaccccacagagcaatgggact ttgacttgcgtggctacctggaagagcctgaaggcccgcaagtctgcaactgtaaatctc actgtgattcggtgtccccaaggaaacctaaagttctcgctcagtcacaaagctcccaca cagtggtccaaccttcaactcaagactgtgaagatcatggatccaactctggacttcacc attttattatcacatctgtgggagggggagtgtgagagcatcctgtcatgtcgtgggtag >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_5|157_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIM KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKGGGITLP DFKLYYKATVTKTACRNNNGSSSCSHNNDGGSSSSVL >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_5|474_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccagctt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttcttcactgaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccacatcgccaaatcaatcctaagccaaaagaacaaaggtggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatgcagaaacaacaacggc agcagcagttgcagccacaacaacgacggcggcagcagcagcagcgtattgtaa >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_6|241_aa MNIKWSNCRKSEKEKTNKETETESGNENSGYNSDEQKTTELLEHTAWFLPLNDSKIVLLL QTPLLSLPNPVNPVILNKETVAVALLTREDIDLVVVNWPLESCQNSEATACRSSQGDAGT EVPFKNFPRPGPVALQPCQAYVTSLLLVPPPDHAQPEPEVLETISVVRMGQSPGRTVLDT EGGSSHSLRGLNLPALPYQADTSTVSSGISTIISKTPCLSSSSGYFSLLAKGYNIHVILT S >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_6|726_bp atgaacataaaatggtctaattgtaggaaatctgaaaaagagaagacaaacaaagaaact gagacagaaagtggaaatgaaaactccggctacaattcagatgaacaaaagaccacagaa ttgttagagcacacagcttggtttttgccacttaatgactctaaaattgttcttctgttg cagacaccgcttctctccctcccaaatcctgtgaatccagtgatcctgaacaaagaaaca gtagctgtggccctcctcaccagagaagacatagaccttgttgtggtcaactggccactt gagagctgccagaatagtgaagctacagcttgtagaagcagtcagggtgacgctggtaca gaggttccttttaagaacttccccagaccaggccccgtagccctgcagccctgccaggcc tatgtcaccagtctcttgcttgtgcctccccctgatcacgcccaaccagagccagaggtc ctggagaccatcagtgtagtccgtatgggtcagtcccctgggaggactgtgctggataca gaaggaggttcttcacattccttaagaggactcaatcttccagcccttccttaccaagca gatacttccactgtgtcctctggaatcagcactatcatcagcaaaaccccgtgcctcagc tcatcttctggttacttctccttgctcgctaaaggttataacattcatgtcattcttact agctag >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_7|115_aa MPHLQFHLASSLMGSCLLPVTCKAGTPCQGEHCFFSEATQSASHNEAKRFSLLLHGIGVN PRKQALRRSLELLASSQDGLPLQMPPHWLQMSLSLPHREEDAALRRFYLQTFPSN >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_7|348_bp atgcctcacctccagttccatctcgcaagcagcctgatgggcagctgcctgcttcctgtc acctgcaaagcaggtacgccttgccagggtgagcactgtttcttctcggaggcaacacag tcagcctcacacaacgaagcgaaaagattctcactcctgctgcacggcattggtgtcaac cctcgcaaacaagccctgcgtcgttccctggagcttttggcatcttcccaggacgggctt cccttacaaatgcctcctcactggctgcagatgagcctgtctttgccacacagggaggag gacgctgctctaagacgattttacctccaaaccttccccagcaattag >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_8|150_aa MATSGCGNQGPRVESMPQWIHHESLEKCDEGIAFTKATRDVLIEGPQKTGLGIGEATTEP GSLIATGMAGSRNNRGWVATITHQKQVWALQKPDSLKDVSGVLQTQSSGSPNCICCIFAR GEQHGSRYMDPCNYSGPTEIIQKIFSSQNP >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_8|453_bp atggcaacaagtggatgtgggaaccaagggcctagagttgagagcatgccacagtggata caccatgaaagtctggaaaagtgcgatgagggcattgcctttaccaaagcaacaagggac gtgctaatagagggacctcaaaagacaggactgggcataggggaagccactacagaacca ggctccctgatagcaacaggcatggcaggatctagaaataatagaggctgggtggcaaca atcacccatcaaaagcaagtgtgggctctacagaaaccagacagcctgaaggatgtcagt ggagtgcttcaaactcagtcgagtggcagccccaactgcatctgctgcatctttgctaga ggagagcaacatggctccaggtacatggacccttgtaactacagtgggcccaccgagata attcagaaaatcttctcatcacaaaatccctaa >gi568815577f:39665537_39901354|GENSCAN_predicted_peptide_9|133_aa XRENTGGGNADDKEEMWMEDWENSVALLQGHRCDHILRSTLDSKQKAMRPQMGTGEHRLP SEALARSTSWSAPSPTAWALGRHLLPFILVGQYYPNKTSYQIKNAKDYLDLFIAYPGTRT VNEQSFNLFLWFR >gi568815577f:39665537_39901354|GENSCAN_predicted_CDS_9|402_bp naaagggaaaatacaggaggagggaacgctgatgacaaagaagaaatgtggatggaagac tgggagaactcagtggccctgttacaaggtcaccgatgcgaccacatcctcaggtccacc ttggacagcaagcagaaggccatgcgcccccaaatgggcaccggcgagcacaggctgccc tcagaggccttggccaggagcaccagctggtcagcaccctcgcccacagcatgggccctg ggacgacatctcctcccattcatcctcgtgggacagtactaccccaataagaccagttac cagatcaagaacgccaaagactacctggacctcttcatagcatatccagggacaaggaca gtcaatgagcaatctttcaaccttttcctgtggttccgatga