GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:20:58 Sequence gi568815581f:49395418_49613006 : 217589 bp : 50.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1107 1102 6 1.05 1.06 Term - 9787 9575 213 1 0 93 55 434 0.919 37.83 1.05 Intr - 11437 11342 96 1 0 99 105 95 0.999 12.51 1.04 Intr - 13741 13625 117 1 0 88 91 120 0.998 12.96 1.03 Intr - 14056 13914 143 2 2 55 89 288 0.997 25.67 1.02 Intr - 16422 16261 162 1 0 85 113 116 0.991 13.75 1.01 Init - 24040 23851 190 1 1 59 4 207 0.007 6.84 1.00 Prom - 28599 28560 40 -6.06 2.00 Prom + 33233 33272 40 -6.06 2.01 Init + 43623 43766 144 2 0 84 49 130 0.711 8.82 2.02 Intr + 50214 50300 87 2 0 51 105 76 0.626 5.77 2.03 Term + 50740 50781 42 0 0 107 43 61 0.848 0.66 2.04 PlyA + 51553 51558 6 1.05 3.05 PlyA - 57093 57088 6 1.05 3.04 Term - 58748 58569 180 2 0 86 50 51 0.133 -1.29 3.03 Intr - 63662 63631 32 0 2 131 89 24 0.394 4.65 3.02 Intr - 77166 77022 145 0 1 122 20 40 0.042 0.46 3.01 Init - 84165 84103 63 0 0 102 99 57 0.586 7.38 3.00 Prom - 89677 89638 40 -1.36 4.02 PlyA - 92663 92658 6 -0.45 4.01 Sngl - 96728 96534 195 2 0 95 47 224 0.861 14.16 4.00 Prom - 97756 97717 40 -7.36 5.00 Prom + 97860 97899 40 -5.66 5.01 Init + 100028 100066 39 1 0 101 84 152 0.237 14.29 5.02 Intr + 106646 106787 142 1 1 120 79 46 0.908 6.83 5.03 Intr + 110882 111241 360 0 0 45 96 836 0.990 75.29 5.04 Intr + 114995 115247 253 0 1 77 94 293 0.983 25.29 5.05 Intr + 116475 116635 161 0 2 101 84 262 0.999 26.73 5.06 Term + 117291 117592 302 1 2 110 52 421 0.648 36.08 5.07 PlyA + 122647 122652 6 1.05 6.07 PlyA - 123715 123710 6 1.05 6.06 Term - 124463 124319 145 1 1 98 45 26 0.759 -3.32 6.05 Intr - 125467 125350 118 2 1 70 96 89 0.628 7.42 6.04 Intr - 126299 126221 79 2 1 105 43 46 0.730 0.92 6.03 Intr - 129639 129581 59 1 2 97 60 47 0.494 1.40 6.02 Intr - 131058 130951 108 1 0 69 28 120 0.806 4.36 6.01 Init - 135606 135555 52 0 1 58 48 69 0.569 1.22 6.00 Prom - 143026 142987 40 -1.76 7.00 Prom + 144620 144659 40 -5.36 7.01 Init + 153576 153637 62 2 2 84 84 55 0.332 5.42 7.02 Intr + 159371 159551 181 2 1 87 15 92 0.212 1.67 7.03 Intr + 159590 159766 177 1 0 52 48 88 0.159 1.32 7.04 Intr + 172401 172442 42 2 0 77 76 39 0.356 0.04 7.05 Intr + 173045 173278 234 1 0 1 77 174 0.746 5.49 7.06 Intr + 174994 175307 314 0 2 52 76 177 0.832 7.88 7.07 Intr + 176176 176227 52 1 1 85 99 3 0.829 0.01 7.08 Intr + 176361 176452 92 2 2 102 111 27 0.895 5.19 7.09 Intr + 180448 180856 409 1 1 12 105 146 0.388 2.97 7.10 Term + 183179 183883 705 1 0 104 46 933 0.955 84.22 7.11 PlyA + 184921 184926 6 1.05 8.06 PlyA - 188279 188274 6 1.05 8.05 Term - 205105 204961 145 0 1 72 44 55 0.381 -3.12 8.04 Intr - 206590 206448 143 1 2 70 75 186 0.730 14.65 8.03 Intr - 211955 211833 123 2 0 35 116 164 0.999 14.68 8.02 Intr - 212512 212457 56 2 2 74 95 44 0.333 2.40 8.01 Intr - 216040 215863 178 1 1 103 77 111 0.944 10.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 17847 17760 88 0 1 104 77 50 0.983 6.34 S.002 Sngl - 24040 23828 213 1 0 59 44 214 0.969 7.55 S.003 Sngl - 29505 28852 654 0 0 78 48 161 0.843 7.28 S.004 Term - 149812 149708 105 1 0 153 48 46 0.974 5.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_1|306_aa MRSASARLPCLGSEKRLCLAAVQSSKCEVTAFLQVYPTAPNRQRPSRTGHDDDGSFVKKK RGKLDAGHRAVIFDRFRGVQDIVVGEGTHFLIPWVQKPIIFDCRSRPRNVPVITGSKDLQ NVNITLRILFRPVASQLPRIFTSIGEDYDERVLPSITTEILKSVVARFDAGELITQRELV SRQVSDDLTERAATFGLILDDVSLTHLTFGKEFTEAVEAKQVAQQEAERARFVVEKAEQQ KKAAIISAEGDSKAAELIANSLATAGDGLIELRKLEAAEDIAYQLSRSRNITYLPAGQSV LLQLPQ >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_1|921_bp atgaggagcgcctctgcccggctgccctgtctgggaagtgagaagcgcctctgcctggcc gctgtgcaatcttccaagtgtgaagtgacagcctttctgcaggtgtacccaacagctccg aatagacagcgaccatcgagaacgggccatgatgacgatggcagttttgtcaaaaagaaa agggggaaattggatgctgggcacagagctgtcatctttgaccgattccgtggagtgcag gacattgtggtaggggaagggactcattttctcatcccgtgggtacagaaaccaattatc tttgactgccgttctcgaccacgtaatgtgccagtcatcactggtagcaaagatttacag aatgtcaacatcacactgcgcatcctcttccggcctgtcgccagccagcttcctcgcatc ttcaccagcatcggagaggactatgatgagcgtgtgctgccgtccatcacaactgagatc ctcaagtcagtggtggctcgctttgatgctggagaactaatcacccagagagagctggtc tccaggcaggtgagcgacgaccttacagagcgagccgccacctttgggctcatcctggat gacgtgtccttgacacatctgaccttcgggaaggagttcacagaagcggtggaagccaaa caggtggctcagcaggaagcagagagggccagatttgtggtggaaaaggctgagcaacag aaaaaggcggccatcatctctgctgagggcgactccaaggcagctgagctgattgccaac tcactggccactgcaggggatggcctgatcgagctgcgcaagctggaagctgcagaggac atcgcgtaccagctctcacgctctcggaacatcacctacctgccagcggggcagtccgtg ctcctccagctgccccagtga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_2|90_aa MAILGPESNQIRCSPQSPQQLRRTAVRLRALSGWNGTSCPPLISGCPRYPRPDNGGFRGA WMAGHSPLCEAELTAKLEVFVDLSGRSMAD >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_2|273_bp atggctattttggggcctgaatcaaatcaaatccgctgcagcccccagagcccccagcag ctgcgcagaacagctgtccggctgagggcgctgagcggctggaatggcacttcctgcccc ccactgatctctggctgcccaaggtacccccggcccgacaatgggggcttcagaggagcc tggatggcaggacacagccccctctgtgaagcggagttgaccgccaagctggaggtcttc gtggatttatcagggagaagcatggctgactaa >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_3|139_aa MGWGLLRAVEAGSHGLTAGPQGQRLGMEMWAAARLLDAAGSSSLPASEPKSHLSSYKATE TLSKEGPPSGRPLKSSPPPQSQGDTKAESRQPPLILCPQTPEHYPIPHCPHEGIFQGHRK EHLKGRRLVRQALVEVKKV >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_3|420_bp atgggctggggcctcctcagagccgtggaggcagggagtcacggactgacagccgggcct caggggcagaggttgggcatggagatgtgggcagctgctagactcctggatgctgcaggg tcctcctccctgccagcttcagagcccaaaagccacctgtcctcttacaaagccacagag accctgagcaaagaggggccaccttctggaaggcccctcaagtcatctccacccccacag agccaaggagacaccaaggctgagtccaggcagcctcccctgatcctctgtccccaaacc cctgagcactaccccatccctcactgtccccatgaagggattttccagggccacaggaag gaacacttgaagggaaggagactggtgcggcaggcactggtggaggtgaagaaagtatga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_4|64_aa MTRSIYTKTSTTMTIIIMTTTSTITIITTTATITTTSNNTSTIMTIIIMPLPSVSLPESQ LSPP >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_4|195_bp atgaccagaagcatctataccaaaactagtaccactatgaccatcatcattatgaccacc accagtaccatcaccatcatcaccaccactgccaccatcaccaccacttctaacaacacc agcaccattatgaccatcatcattatgccactgccatcagtgtcactaccagaatcacaa ctatcaccaccatga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_5|418_aa MDGPRLLLLLLLGVSLGGAKEACPTGLYTHSGECCKACNLGEGVAQPCGANQTVCEPCLD SVTFSDVVSATEPCKPCTECVGLQSMSAPCVEADDAVCRCAYGYYQDETTGRCEACRVCE AGSGLVFSCQDKQNTVCEECPDGTYSDEANHVDPCLPCTVCEDTERQLRECTRWADAECE EIPGRWITRSTPPEGSDSTAPSTQEPEAPPEQDLIASTVAGVVTTVMGSSQPVVTRGTTD NLIPVYCSILAAVVVGLVAYIAFKRWNSCKQNKQGANSRPVNQTPPPEGEKLHSDSGISV DSQSLHDQQPHTQTASGQALKGDGGLYSSLPPAKREEVEKLLNGSAGDTWRHLAGELGYQ PEHIDSFTHEACPVRALLASWATQDSATLDALLAALRRIQRADLVESLCSESTATSPV >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_5|1257_bp atggacgggccgcgcctgctgctgttgctgcttctgggggtgtcccttggaggtgccaag gaggcatgccccacaggcctgtacacacacagcggtgagtgctgcaaagcctgcaacctg ggcgagggtgtggcccagccttgtggagccaaccagaccgtgtgtgagccctgcctggac agcgtgacgttctccgacgtggtgagcgcgaccgagccgtgcaagccgtgcaccgagtgc gtggggctccagagcatgtcggcgccgtgcgtggaggccgacgacgccgtgtgccgctgc gcctacggctactaccaggatgagacgactgggcgctgcgaggcgtgccgcgtgtgcgag gcgggctcgggcctcgtgttctcctgccaggacaagcagaacaccgtgtgcgaggagtgc cccgacggcacgtattccgacgaggccaaccacgtggacccgtgcctgccctgcaccgtg tgcgaggacaccgagcgccagctccgcgagtgcacacgctgggccgacgccgagtgcgag gagatccctggccgttggattacacggtccacacccccagagggctcggacagcacagcc cccagcacccaggagcctgaggcacctccagaacaagacctcatagccagcacggtggca ggtgtggtgaccacagtgatgggcagctcccagcccgtggtgacccgaggcaccaccgac aacctcatccctgtctattgctccatcctggctgctgtggttgtgggccttgtggcctac atagccttcaagaggtggaacagctgcaagcagaacaagcaaggagccaacagccggcca gtgaaccagacgcccccaccagagggagaaaaactccacagcgacagtggcatctccgtg gacagccagagcctgcatgaccagcagccccacacgcagacagcctcgggccaggccctc aagggtgacggaggcctctacagcagcctgcccccagccaagcgggaggaggtggagaag cttctcaacggctctgcgggggacacctggcggcacctggcgggcgagctgggctaccag cccgagcacatagactcctttacccatgaggcctgccccgttcgcgccctgcttgcaagc tgggccacccaggacagcgccacactggacgccctcctggccgccctgcgccgcatccag cgagccgacctcgtggagagtctgtgcagtgagtccactgccacatccccggtgtga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_6|186_aa MPNTDEDVEQQELSFTAVGFCVLRDGRALSSLADTARMVRSLRVEVDLSGQEGGLAYVAL CQAPGGTSLLQPGAGFKRLPCRILPGSLQPENFLPLRNPNCGHPKVRRVEGEPSGLLLKA LGPGVELGLQNVDSKGFSGAQGYRLSGPPRPSPLDEEAFHIVPSLIAASLPWMKEQREAL GEEMGP >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_6|561_bp atgccaaacactgatgaggatgtggaacagcaggaactctcattcactgctgttgggttc tgcgttctccgggatggccgtgctctgtcctcacttgctgacacagcccgcatggtccga agcctccgtgtggaagtggatctgagtgggcaggagggaggcctagcctatgtggccctg tgccaggctcctggcggcaccagcctcctgcagcctggggctggattcaaacggctcccc tgcagaattcttcctgggtccctgcagccagaaaacttcctccctcttcggaaccccaac tgcgggcatcccaaggtccgcagggtagaaggtgaaccatctggactactgttgaaagcc ttaggccctggggtggagctggggcttcagaatgtggactccaagggcttctcaggagca cagggttacaggctcagcggacccccacgcccctccccgctggatgaagaggcatttcac attgttccaagtctcatcgctgcttcactgccctggatgaaagagcagagggaggcacta ggggaagagatgggaccctga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_7|755_aa MPATQGDCSAAASGYLYDNSSMQPVLQVDQQLMTVWTPEKELPRGPIPMANRFICTQEQE ATRGRGDRRIWGQGLFPQGPRHPGPEHLAGETFPLHTPQESEHHVTPGKPALECTSGEPY SPRPSHYRPRANSLMGALSPNQKILDSIPPVEAQHDSDTWKSVAERVNECEYNNSRHLLS TYCVPDAVLSSLKCGYSYPHEETEAQRGDVICQTTLWQIQDGDPGDLVPKIKTLKCSEKE PRTTPSRQCFTGDRALSTGYVPSHTRHVLTPGGGQDDPPLQMRRLPPKKGRDCPQITLAT ESEAPNPGCRTPAATPSPLHQSTSRSAHSQEKVPSGGSQLKGHILAEAFPDPSQPKFSMQ IGLQSPLSTLYLHTSPGPDIKVDTWSPAHLGESGAAGAEGGAERRSGAAEPRRSWGPSRR SGRTSGTSSGGRAARSWTRGGGRRLHSWTEGGGVGPGRPAEGRAAGRRGLEHGTARLKEQ EGEGGLGPRKEKGRARGRERRRKMQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDD HEGQPRPRVPRKRGHISPKSRPMANSTLLGLLAPPGEAWGILGQPPNRPNHSPPPSAKVK KIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNISISLVPPSKAVEF HQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDPAKICSRDHAQSSATWSCSQPFK VVCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_7|2268_bp atgcctgccacacagggtgactgctcggcggctgcctctggctatctttatgacaatagc agcatgcagcctgtgctccaggtggaccagcaactgatgacagtgtggacacctgagaag gagctgccccgtggccccatccccatggcaaacaggttcatctgtacccaggaacaggag gccacaaggggaaggggagacaggaggatctggggccaaggtctttttccacaagggccc aggcatcccggcccggagcacctggctggggagacctttccactccacacgccacaggag agtgagcaccacgtgactcctgggaagcctgccctggaatgcacgtccggggagccctat tccccccgaccttcccactacagacctcgggccaacagcctaatgggggccctcagtcct aatcagaagatcctagattccatccccccagtagaagctcagcacgacagcgacacttgg aagtctgtggctgaacgagtaaatgaatgcgaatataacaacagccggcacttactgagc acctactgtgtgccagacgctgttctcagcagtctaaaatgtggatactcttatccccac gaagaaactgaggcacagagaggtgatgtaatttgccagacgacactgtggcaaatccag gatggggacccaggcgatctggttccaaagatcaagacactgaagtgtagcgagaaggag ccccgcacgaccccttcccgccagtgcttcactggggacagagcactttctaccggctac gtgccgtctcacactcgccacgtcctcaccccaggaggcgggcaggatgatccccctttg cagatgaggaggctaccgcccaagaaggggagagactgcccccagatcaccctggccaca gagagcgaggcccccaatccaggctgccggactccagctgccacgccctccccactccac cagagcacaagcaggagcgctcacagccaggagaaggtcccttctggcggatcccagctg aaaggtcacattcttgcagaggcattccctgacccctcccagcccaaattctccatgcaa ataggactgcagtctcccctgtccacactgtacctccacaccagcccaggaccggacata aaagtagacacttggtccccagcacatctgggcgagagcggcgccgctggagccgagggg ggcgccgagcgcagatctggagcagcagagccacggcgcagctggggcccttcgaggcgc tcggggcgcacatctgggacctcgagcgggggccgtgccgcgcgcagctggaccagggga ggggggcggcggctgcacagctggaccgaagggggcggggtcggccctgggcgacccgct gaggggagggccgcgggccgccggggactggagcatgggacggcgcgcctgaaggagcag gaaggggaaggaggcctgggaccccgaaaagagaaggggagagcgaggggacgagagcgg aggaggaagatgcaactgactcgctgctgcttcgtgttcctggtgcagggtagcctctat ctggtcatctgtggccaggatgatggtcctcccggctcagaggaccctgagcgtgatgac cacgagggccagccccggccccgggtgcctcggaagcggggccacatctcacctaagtcc cgccccatggccaattccactctcctagggctgctggccccgcctggggaggcttggggc attcttgggcagccccccaaccgcccgaaccacagccccccaccctcagccaaggtgaag aaaatctttggctggggcgacttctactccaacatcaagacggtggccctgaacctgctc gtcacagggaagattgtggaccatggcaatgggaccttcagcgtccacttccaacacaat gccacaggccagggaaacatctccatcagcctcgtgccccccagtaaagctgtagagttc caccaggaacagcagatcttcatcgaagccaaggcctccaaaatcttcaactgccggatg gagtgggagaaggtagaacggggccgccggacctcgctttgcacccacgacccagccaag atctgctcccgagaccacgctcagagctcagccacctggagctgctcccagcccttcaaa gtcgtctgtgtctacatcgccttctacagcacggactatcggctggtccagaaggtgtgc ccagattacaactaccatagtgataccccctactacccatctgggtga >gi568815581f:49395418_49613006|GENSCAN_predicted_peptide_8|214_aa VSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAA RSPVFSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKY ALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKS MVVSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS >gi568815581f:49395418_49613006|GENSCAN_predicted_CDS_8|645_bp gtgagtgttgtgcaagattctgtcaacatttctggccagaataccatgaacatggtaaag gttcctgagtgccggctggcagatgagttaggaggactgtgggagaattcccggttcaca gactgctgcttgtgtgttgccggccaggaattccaggctcacaaggctatcttagcagct cgttctccggtttttagtgccatgtttgaacatgaaatggaggagagcaaaaagaatcga gttgaaatcaatgatgtggagcctgaagtttttaaggaaatgatgtgcttcatttacacg gggaaggctccaaacctcgacaaaatggctgatgatttgctggcagctgctgacaagtat gccctggagcgcttaaaggtcatgtgtgaggatgccctctgcagtaacctgtccgtggag aacgctgcagaaattctcatcctggccgacctccacagtgcagatcagttgaaaactcag gcagtggatttcatcaactatcatgcttcggatgtcttggagacctctgggtggaagtca atggtggtgtcacatccccacttggtggctgaggcataccgctctctggcttcagcacag tgcccttttctgggacccccacgcaaacgcctgaagcaatcctaa