GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:54:27 Sequence gi568815581f:49476220_49679297 : 203078 bp : 47.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3363 3301 63 0 0 102 99 57 0.478 7.38 1.00 Prom - 8875 8836 40 -1.36 2.02 PlyA - 11861 11856 6 -0.45 2.01 Sngl - 15926 15732 195 2 0 95 47 224 0.860 14.16 2.00 Prom - 16954 16915 40 -7.36 3.00 Prom + 17058 17097 40 -5.66 3.01 Init + 19226 19264 39 1 0 101 84 152 0.237 14.29 3.02 Intr + 25844 25985 142 1 1 120 79 46 0.908 6.83 3.03 Intr + 30080 30439 360 0 0 45 96 836 0.990 75.29 3.04 Intr + 34193 34445 253 0 1 77 94 293 0.983 25.29 3.05 Intr + 35673 35833 161 0 2 101 84 262 0.999 26.73 3.06 Term + 36489 36790 302 1 2 110 52 421 0.648 36.08 3.07 PlyA + 41845 41850 6 1.05 4.07 PlyA - 42913 42908 6 1.05 4.06 Term - 43661 43517 145 1 1 98 45 26 0.759 -3.32 4.05 Intr - 44665 44548 118 2 1 70 96 89 0.628 7.42 4.04 Intr - 45497 45419 79 2 1 105 43 46 0.730 0.92 4.03 Intr - 48837 48779 59 1 2 97 60 47 0.494 1.40 4.02 Intr - 50256 50149 108 1 0 69 28 120 0.806 4.36 4.01 Init - 54804 54753 52 0 1 58 48 69 0.569 1.22 4.00 Prom - 62224 62185 40 -1.76 5.00 Prom + 63818 63857 40 -5.36 5.01 Init + 72774 72835 62 2 2 84 84 55 0.332 5.42 5.02 Intr + 78569 78749 181 2 1 87 15 92 0.212 1.67 5.03 Intr + 78788 78964 177 1 0 52 48 88 0.159 1.32 5.04 Intr + 91599 91640 42 2 0 77 76 39 0.356 0.04 5.05 Intr + 92243 92476 234 1 0 1 77 174 0.746 5.49 5.06 Intr + 94192 94505 314 0 2 52 76 177 0.832 7.88 5.07 Intr + 95374 95425 52 1 1 85 99 3 0.829 0.01 5.08 Intr + 95559 95650 92 2 2 102 111 27 0.895 5.19 5.09 Intr + 99646 100054 409 1 1 12 105 146 0.388 2.97 5.10 Term + 102377 103081 705 1 0 104 46 933 0.955 84.22 5.11 PlyA + 104119 104124 6 1.05 6.10 PlyA - 107477 107472 6 1.05 6.09 Term - 124303 124159 145 0 1 72 44 55 0.381 -3.12 6.08 Intr - 125788 125646 143 1 2 70 75 186 0.730 14.65 6.07 Intr - 131153 131031 123 2 0 35 116 164 0.999 14.68 6.06 Intr - 131710 131655 56 2 2 74 95 44 0.333 2.40 6.05 Intr - 135238 135061 178 1 1 103 77 111 0.978 10.99 6.04 Intr - 142889 142762 128 0 2 115 105 147 0.993 19.50 6.03 Intr - 143166 143015 152 2 2 97 34 135 0.999 8.81 6.02 Intr - 145893 145727 167 0 2 117 77 92 0.869 9.76 6.01 Init - 146555 146514 42 2 0 75 111 9 0.655 2.33 6.00 Prom - 177206 177167 40 -2.76 7.03 PlyA - 177241 177236 6 1.05 7.02 Term - 182742 182635 108 0 0 76 39 77 0.888 0.11 7.01 Init - 186952 186890 63 1 0 59 99 79 0.807 5.26 7.00 Prom - 192553 192514 40 -4.46 8.04 PlyA - 193611 193606 6 1.05 8.03 Term - 195152 195070 83 0 2 97 42 54 0.702 -0.44 8.02 Intr - 201786 201714 73 0 1 82 109 54 0.949 5.88 8.01 Init - 201946 201866 81 1 0 77 70 78 0.662 3.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 69010 68906 105 1 0 153 48 46 0.974 5.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_1|21_aa MGWGLLRAVEAGSHGLTAGPQ >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_1|63_bp atgggctggggcctcctcagagccgtggaggcagggagtcacggactgacagccgggcct cag >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_2|64_aa MTRSIYTKTSTTMTIIIMTTTSTITIITTTATITTTSNNTSTIMTIIIMPLPSVSLPESQ LSPP >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_2|195_bp atgaccagaagcatctataccaaaactagtaccactatgaccatcatcattatgaccacc accagtaccatcaccatcatcaccaccactgccaccatcaccaccacttctaacaacacc agcaccattatgaccatcatcattatgccactgccatcagtgtcactaccagaatcacaa ctatcaccaccatga >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_3|418_aa MDGPRLLLLLLLGVSLGGAKEACPTGLYTHSGECCKACNLGEGVAQPCGANQTVCEPCLD SVTFSDVVSATEPCKPCTECVGLQSMSAPCVEADDAVCRCAYGYYQDETTGRCEACRVCE AGSGLVFSCQDKQNTVCEECPDGTYSDEANHVDPCLPCTVCEDTERQLRECTRWADAECE EIPGRWITRSTPPEGSDSTAPSTQEPEAPPEQDLIASTVAGVVTTVMGSSQPVVTRGTTD NLIPVYCSILAAVVVGLVAYIAFKRWNSCKQNKQGANSRPVNQTPPPEGEKLHSDSGISV DSQSLHDQQPHTQTASGQALKGDGGLYSSLPPAKREEVEKLLNGSAGDTWRHLAGELGYQ PEHIDSFTHEACPVRALLASWATQDSATLDALLAALRRIQRADLVESLCSESTATSPV >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_3|1257_bp atggacgggccgcgcctgctgctgttgctgcttctgggggtgtcccttggaggtgccaag gaggcatgccccacaggcctgtacacacacagcggtgagtgctgcaaagcctgcaacctg ggcgagggtgtggcccagccttgtggagccaaccagaccgtgtgtgagccctgcctggac agcgtgacgttctccgacgtggtgagcgcgaccgagccgtgcaagccgtgcaccgagtgc gtggggctccagagcatgtcggcgccgtgcgtggaggccgacgacgccgtgtgccgctgc gcctacggctactaccaggatgagacgactgggcgctgcgaggcgtgccgcgtgtgcgag gcgggctcgggcctcgtgttctcctgccaggacaagcagaacaccgtgtgcgaggagtgc cccgacggcacgtattccgacgaggccaaccacgtggacccgtgcctgccctgcaccgtg tgcgaggacaccgagcgccagctccgcgagtgcacacgctgggccgacgccgagtgcgag gagatccctggccgttggattacacggtccacacccccagagggctcggacagcacagcc cccagcacccaggagcctgaggcacctccagaacaagacctcatagccagcacggtggca ggtgtggtgaccacagtgatgggcagctcccagcccgtggtgacccgaggcaccaccgac aacctcatccctgtctattgctccatcctggctgctgtggttgtgggccttgtggcctac atagccttcaagaggtggaacagctgcaagcagaacaagcaaggagccaacagccggcca gtgaaccagacgcccccaccagagggagaaaaactccacagcgacagtggcatctccgtg gacagccagagcctgcatgaccagcagccccacacgcagacagcctcgggccaggccctc aagggtgacggaggcctctacagcagcctgcccccagccaagcgggaggaggtggagaag cttctcaacggctctgcgggggacacctggcggcacctggcgggcgagctgggctaccag cccgagcacatagactcctttacccatgaggcctgccccgttcgcgccctgcttgcaagc tgggccacccaggacagcgccacactggacgccctcctggccgccctgcgccgcatccag cgagccgacctcgtggagagtctgtgcagtgagtccactgccacatccccggtgtga >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_4|186_aa MPNTDEDVEQQELSFTAVGFCVLRDGRALSSLADTARMVRSLRVEVDLSGQEGGLAYVAL CQAPGGTSLLQPGAGFKRLPCRILPGSLQPENFLPLRNPNCGHPKVRRVEGEPSGLLLKA LGPGVELGLQNVDSKGFSGAQGYRLSGPPRPSPLDEEAFHIVPSLIAASLPWMKEQREAL GEEMGP >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_4|561_bp atgccaaacactgatgaggatgtggaacagcaggaactctcattcactgctgttgggttc tgcgttctccgggatggccgtgctctgtcctcacttgctgacacagcccgcatggtccga agcctccgtgtggaagtggatctgagtgggcaggagggaggcctagcctatgtggccctg tgccaggctcctggcggcaccagcctcctgcagcctggggctggattcaaacggctcccc tgcagaattcttcctgggtccctgcagccagaaaacttcctccctcttcggaaccccaac tgcgggcatcccaaggtccgcagggtagaaggtgaaccatctggactactgttgaaagcc ttaggccctggggtggagctggggcttcagaatgtggactccaagggcttctcaggagca cagggttacaggctcagcggacccccacgcccctccccgctggatgaagaggcatttcac attgttccaagtctcatcgctgcttcactgccctggatgaaagagcagagggaggcacta ggggaagagatgggaccctga >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_5|755_aa MPATQGDCSAAASGYLYDNSSMQPVLQVDQQLMTVWTPEKELPRGPIPMANRFICTQEQE ATRGRGDRRIWGQGLFPQGPRHPGPEHLAGETFPLHTPQESEHHVTPGKPALECTSGEPY SPRPSHYRPRANSLMGALSPNQKILDSIPPVEAQHDSDTWKSVAERVNECEYNNSRHLLS TYCVPDAVLSSLKCGYSYPHEETEAQRGDVICQTTLWQIQDGDPGDLVPKIKTLKCSEKE PRTTPSRQCFTGDRALSTGYVPSHTRHVLTPGGGQDDPPLQMRRLPPKKGRDCPQITLAT ESEAPNPGCRTPAATPSPLHQSTSRSAHSQEKVPSGGSQLKGHILAEAFPDPSQPKFSMQ IGLQSPLSTLYLHTSPGPDIKVDTWSPAHLGESGAAGAEGGAERRSGAAEPRRSWGPSRR SGRTSGTSSGGRAARSWTRGGGRRLHSWTEGGGVGPGRPAEGRAAGRRGLEHGTARLKEQ EGEGGLGPRKEKGRARGRERRRKMQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDD HEGQPRPRVPRKRGHISPKSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVK KIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEF HQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFK VVCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_5|2268_bp atgcctgccacacagggtgactgctcggcggctgcctctggctatctttatgacaatagc agcatgcagcctgtgctccaggtggaccagcaactgatgacagtgtggacacctgagaag gagctgccccgtggccccatccccatggcaaacaggttcatctgtacccaggaacaggag gccacaaggggaaggggagacaggaggatctggggccaaggtctttttccacaagggccc aggcatcccggcccggagcacctggctggggagacctttccactccacacgccacaggag agtgagcaccacgtgactcctgggaagcctgccctggaatgcacgtccggggagccctat tccccccgaccttcccactacagacctcgggccaacagcctaatgggggccctcagtcct aatcagaagatcctagattccatccccccagtagaagctcagcacgacagcgacacttgg aagtctgtggctgaacgagtaaatgaatgcgaatataacaacagccggcacttactgagc acctactgtgtgccagacgctgttctcagcagtctaaaatgtggatactcttatccccac gaagaaactgaggcacagagaggtgatgtaatttgccagacgacactgtggcaaatccag gatggggacccaggcgatctggttccaaagatcaagacactgaagtgtagcgagaaggag ccccgcacgaccccttcccgccagtgcttcactggggacagagcactttctaccggctac gtgccgtctcacactcgccacgtcctcaccccaggaggcgggcaggatgatccccctttg cagatgaggaggctaccgcccaagaaggggagagactgcccccagatcaccctggccaca gagagcgaggcccccaatccaggctgccggactccagctgccacgccctccccactccac cagagcacaagcaggagcgctcacagccaggagaaggtcccttctggcggatcccagctg aaaggtcacattcttgcagaggcattccctgacccctcccagcccaaattctccatgcaa ataggactgcagtctcccctgtccacactgtacctccacaccagcccaggaccggacata aaagtagacacttggtccccagcacatctgggcgagagcggcgccgctggagccgagggg ggcgccgagcgcagatctggagcagcagagccacggcgcagctggggcccttcgaggcgc tcggggcgcacatctgggacctcgagcgggggccgtgccgcgcgcagctggaccagggga ggggggcggcggctgcacagctggaccgaagggggcggggtcggccctgggcgacccgct gaggggagggccgcgggccgccggggactggagcatgggacggcgcgcctgaaggagcag gaaggggaaggaggcctgggaccccgaaaagagaaggggagagcgaggggacgagagcgg aggaggaagatgcaactgactcgctgctgcttcgtgttcctggtgcagggtagcctctat ctggtcatctgtggccaggatgatggtcctcccggctcagaggaccctgagcgtgatgac cacgagggccagccccggccccgggtgcctcggaagcggggccacatctcacctaagtcc cgccccatggccaattccactctcctagggctgctggccccgcctggggaggcttggggc attcttgggcagccccccaaccgcccgaaccacagccccccaccctcagccaaggtgaag aaaatctttggctggggcgacttctactccaacatcaagacggtggccctgaacctgctc gtcacagggaagattgtggaccatggcaatgggaccttcagcgtccacttccaacacaat gccacaggccagggaaacatctccatcagcctcgtgccccccagtaaagctgtagagttc caccaggaacagcagatcttcatcgaagccaaggcctccaaaatcttcaactgccggatg gagtgggagaaggtagaacggggccgccggacctcgctttgcacccacgacccagccaag atctgctcccgagaccacgctcagagctcagccacctggagctgctcccagcccttcaaa gtcgtctgtgtctacatcgccttctacagcacggactatcggctggtccagaaggtgtgc ccagattacaactaccatagtgataccccctactacccatctgggtga >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_6|377_aa MSSGPVAESWCYTQVVHFLFNCFLFFYQQIKVVKFSYMWTINNFSFCREEMGEVIKSSTF SSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAM ESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMN MVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESK KNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNL SVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVSHPHLVAEAYRSLA SAQCPFLGPPRKRLKQS >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_6|1134_bp atgtcgagtggccccgtagctgagagttggtgctacacacaggttgtccacttcctattt aattgcttcctgtttttctatcaacagatcaaggtagtgaaattctcctacatgtggacc atcaataactttagcttttgccgggaggaaatgggtgaagtcattaaaagttctacattt tcatcaggagcaaatgataaactgaaatggtgtttgcgagtaaaccccaaagggttagat gaagaaagcaaagattacctgtcactttacctgttactggtcagctgtccaaagagtgaa gttcgggcaaaattcaaattctccatcctgaatgccaagggagaagaaaccaaagctatg gagagtcaacgggcatataggtttgtgcaaggcaaagactggggattcaagaaattcatc cgtagagattttcttttggatgaggccaacgggcttctccctgatgacaagcttaccctc ttctgcgaggtgagtgttgtgcaagattctgtcaacatttctggccagaataccatgaac atggtaaaggttcctgagtgccggctggcagatgagttaggaggactgtgggagaattcc cggttcacagactgctgcttgtgtgttgccggccaggaattccaggctcacaaggctatc ttagcagctcgttctccggtttttagtgccatgtttgaacatgaaatggaggagagcaaa aagaatcgagttgaaatcaatgatgtggagcctgaagtttttaaggaaatgatgtgcttc atttacacggggaaggctccaaacctcgacaaaatggctgatgatttgctggcagctgct gacaagtatgccctggagcgcttaaaggtcatgtgtgaggatgccctctgcagtaacctg tccgtggagaacgctgcagaaattctcatcctggccgacctccacagtgcagatcagttg aaaactcaggcagtggatttcatcaactatcatgcttcggatgtcttggagacctctggg tggaagtcaatggtggtgtcacatccccacttggtggctgaggcataccgctctctggct tcagcacagtgcccttttctgggacccccacgcaaacgcctgaagcaatcctaa >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_7|56_aa MLARPSTSHLLCAWFLTGHGSRDKRAEKNSEVGVWEVEKVASRSLLRCGSMDEHMK >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_7|171_bp atgcttgctcgcccgtccacctctcacctcctgtgcgcctggttcctaacaggccatggg tcgagggataagagggctgagaaaaattcagaagttggtgtctgggaagtggagaaagtt gcaagcagatctttgctaaggtgtgggagcatggatgagcacatgaagtag >gi568815581f:49476220_49679297|GENSCAN_predicted_peptide_8|78_aa MGRRPRGVGSGGTRWLRRRADAMTPRLVPDVADSMRQAQGDGDQQLSPPLSVNYEVGAKV IAVLSFNGKNLNYFCTNL >gi568815581f:49476220_49679297|GENSCAN_predicted_CDS_8|237_bp atggggaggaggccgcgcggggtggggtctggcggtacgcgctggctgcgtcgacgtgct gacgccatgacgccccggctggtcccggatgttgcggacagtatgaggcaagcgcagggg gacggggaccagcagctgtcgccgccgctctcagtaaattatgaggttggtgcaaaagta attgcggttttgtcatttaatggtaaaaacctcaattacttttgcaccaacctctaa