GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:26:48 Sequence gi568815581r:49324272_49524922 : 200651 bp : 49.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1742 1865 124 1 1 70 92 74 0.746 6.33 1.02 Term + 28361 28488 128 0 2 86 39 71 0.233 0.44 1.03 PlyA + 28525 28530 6 1.05 2.03 PlyA - 29806 29801 6 1.05 2.02 Term - 37072 37063 10 0 1 100 49 6 0.374 -4.43 2.01 Init - 37690 37638 53 1 2 84 100 119 0.774 11.33 2.00 Prom - 66023 65984 40 -1.96 3.03 PlyA - 66464 66459 6 1.05 3.02 Term - 70931 70846 86 0 2 74 49 63 0.304 -1.18 3.01 Init - 76499 76208 292 2 1 49 91 141 0.409 7.81 3.00 Prom - 78820 78781 40 -7.66 4.07 PlyA - 79809 79804 6 1.05 4.06 Term - 80933 80721 213 2 0 93 55 434 0.916 37.83 4.05 Intr - 82583 82488 96 2 0 99 105 95 0.999 12.51 4.04 Intr - 84887 84771 117 2 0 88 91 120 0.998 12.96 4.03 Intr - 85202 85060 143 0 2 55 89 288 0.997 25.67 4.02 Intr - 87568 87407 162 2 0 85 113 116 0.991 13.75 4.01 Init - 95186 94997 190 2 1 59 4 207 0.007 6.84 4.00 Prom - 99745 99706 40 -6.06 5.00 Prom + 104379 104418 40 -6.06 5.01 Init + 114769 114912 144 0 0 84 49 130 0.711 8.82 5.02 Intr + 121360 121446 87 0 0 51 105 76 0.626 5.77 5.03 Term + 121886 121927 42 1 0 107 43 61 0.848 0.66 5.04 PlyA + 122699 122704 6 1.05 6.05 PlyA - 128239 128234 6 1.05 6.04 Term - 129894 129715 180 0 0 86 50 51 0.133 -1.29 6.03 Intr - 134808 134777 32 1 2 131 89 24 0.394 4.65 6.02 Intr - 148312 148168 145 1 1 122 20 40 0.042 0.46 6.01 Init - 155311 155249 63 1 0 102 99 57 0.586 7.38 6.00 Prom - 160823 160784 40 -1.36 7.02 PlyA - 163809 163804 6 -0.45 7.01 Sngl - 167874 167680 195 0 0 95 47 224 0.861 14.16 7.00 Prom - 168902 168863 40 -7.36 8.00 Prom + 169006 169045 40 -5.66 8.01 Init + 171174 171212 39 2 0 101 84 152 0.237 14.29 8.02 Intr + 177792 177933 142 2 1 120 79 46 0.908 6.83 8.03 Intr + 182028 182387 360 1 0 45 96 836 0.990 75.29 8.04 Intr + 186141 186393 253 1 1 77 94 293 0.983 25.29 8.05 Intr + 187621 187781 161 1 2 101 84 262 0.999 26.73 8.06 Term + 188437 188738 302 2 2 110 52 421 0.648 36.08 8.07 PlyA + 193793 193798 6 1.05 9.04 PlyA - 194861 194856 6 1.05 9.03 Term - 195609 195465 145 2 1 98 45 26 0.777 -3.32 9.02 Intr - 196613 196496 118 0 1 70 96 89 0.462 7.42 9.01 Intr - 197445 197367 79 0 1 105 43 46 0.331 0.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 3250 3323 74 2 2 50 43 94 0.821 -0.83 S.002 Init - 88993 88906 88 1 1 104 77 50 0.983 6.34 S.003 Sngl - 95186 94974 213 2 0 59 44 214 0.969 7.55 S.004 Sngl - 100651 99998 654 1 0 78 48 161 0.843 7.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_1|83_aa MGAFRKSIYGSIAETSVLKIKSRKKYGTNKAELCSKHKYGPGIYPQEMKLIFTPKPAMNI YSGSVHNHPKLEAKCPSNGYECT >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_1|252_bp atgggggcttttcggaaaagcatatatggcagtatagcagaaacatctgtactcaaaatt aagtcaaggaagaagtatgggaccaataaagcagagctatgtagcaaacataagtatggg ccaggtatttacccacaagaaatgaaacttatattcaccccaaaacctgccatgaacatt tacagtggctctgttcataatcacccgaaactggaagccaaatgtccctcaaatggttac gaatgtacgtga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_2|20_aa MDRRGGGRGGGGGGGRPGLC >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_2|63_bp atggaccgccgaggcggcggccggggcggcggaggcggaggcggccggcccgggctctgc tga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_3|125_aa MGAVQKGIPHKHFHGKTQKSLQCYQQAVGIVGNKGKNLVKRMNVLIEHIKHSESWNRFLK HIKINDQKKKDAKEKSMWVQLKRQPAPPREAHIVKTNDGSNLPVTSRASLCQPAQDVQKN IPGKD >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_3|378_bp atgggtgctgttcaaaaaggaattccccacaaacatttccatggcaaaactcaaaagagt ctacagtgttaccagcaagctgttggcattgttggaaacaagggcaagaatcttgtcaag agaatgaatgtacttattgagcacattaagcactctgagagctggaatcgcttcctgaaa catatcaaaataaatgatcagaaaaagaaagacgccaaagagaaaagtatgtgggttcaa ctgaagcgccagcctgctccacccagagaagcacacattgtgaaaaccaatgatggcagc aacctgcctgttactagccgtgccagcctgtgtcaaccagcacaagatgttcagaaaaat atcccggggaaagactag >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_4|306_aa MRSASARLPCLGSEKRLCLAAVQSSKCEVTAFLQVYPTAPNRQRPSRTGHDDDGSFVKKK RGKLDAGHRAVIFDRFRGVQDIVVGEGTHFLIPWVQKPIIFDCRSRPRNVPVITGSKDLQ NVNITLRILFRPVASQLPRIFTSIGEDYDERVLPSITTEILKSVVARFDAGELITQRELV SRQVSDDLTERAATFGLILDDVSLTHLTFGKEFTEAVEAKQVAQQEAERARFVVEKAEQQ KKAAIISAEGDSKAAELIANSLATAGDGLIELRKLEAAEDIAYQLSRSRNITYLPAGQSV LLQLPQ >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_4|921_bp atgaggagcgcctctgcccggctgccctgtctgggaagtgagaagcgcctctgcctggcc gctgtgcaatcttccaagtgtgaagtgacagcctttctgcaggtgtacccaacagctccg aatagacagcgaccatcgagaacgggccatgatgacgatggcagttttgtcaaaaagaaa agggggaaattggatgctgggcacagagctgtcatctttgaccgattccgtggagtgcag gacattgtggtaggggaagggactcattttctcatcccgtgggtacagaaaccaattatc tttgactgccgttctcgaccacgtaatgtgccagtcatcactggtagcaaagatttacag aatgtcaacatcacactgcgcatcctcttccggcctgtcgccagccagcttcctcgcatc ttcaccagcatcggagaggactatgatgagcgtgtgctgccgtccatcacaactgagatc ctcaagtcagtggtggctcgctttgatgctggagaactaatcacccagagagagctggtc tccaggcaggtgagcgacgaccttacagagcgagccgccacctttgggctcatcctggat gacgtgtccttgacacatctgaccttcgggaaggagttcacagaagcggtggaagccaaa caggtggctcagcaggaagcagagagggccagatttgtggtggaaaaggctgagcaacag aaaaaggcggccatcatctctgctgagggcgactccaaggcagctgagctgattgccaac tcactggccactgcaggggatggcctgatcgagctgcgcaagctggaagctgcagaggac atcgcgtaccagctctcacgctctcggaacatcacctacctgccagcggggcagtccgtg ctcctccagctgccccagtga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_5|90_aa MAILGPESNQIRCSPQSPQQLRRTAVRLRALSGWNGTSCPPLISGCPRYPRPDNGGFRGA WMAGHSPLCEAELTAKLEVFVDLSGRSMAD >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_5|273_bp atggctattttggggcctgaatcaaatcaaatccgctgcagcccccagagcccccagcag ctgcgcagaacagctgtccggctgagggcgctgagcggctggaatggcacttcctgcccc ccactgatctctggctgcccaaggtacccccggcccgacaatgggggcttcagaggagcc tggatggcaggacacagccccctctgtgaagcggagttgaccgccaagctggaggtcttc gtggatttatcagggagaagcatggctgactaa >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_6|139_aa MGWGLLRAVEAGSHGLTAGPQGQRLGMEMWAAARLLDAAGSSSLPASEPKSHLSSYKATE TLSKEGPPSGRPLKSSPPPQSQGDTKAESRQPPLILCPQTPEHYPIPHCPHEGIFQGHRK EHLKGRRLVRQALVEVKKV >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_6|420_bp atgggctggggcctcctcagagccgtggaggcagggagtcacggactgacagccgggcct caggggcagaggttgggcatggagatgtgggcagctgctagactcctggatgctgcaggg tcctcctccctgccagcttcagagcccaaaagccacctgtcctcttacaaagccacagag accctgagcaaagaggggccaccttctggaaggcccctcaagtcatctccacccccacag agccaaggagacaccaaggctgagtccaggcagcctcccctgatcctctgtccccaaacc cctgagcactaccccatccctcactgtccccatgaagggattttccagggccacaggaag gaacacttgaagggaaggagactggtgcggcaggcactggtggaggtgaagaaagtatga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_7|64_aa MTRSIYTKTSTTMTIIIMTTTSTITIITTTATITTTSNNTSTIMTIIIMPLPSVSLPESQ LSPP >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_7|195_bp atgaccagaagcatctataccaaaactagtaccactatgaccatcatcattatgaccacc accagtaccatcaccatcatcaccaccactgccaccatcaccaccacttctaacaacacc agcaccattatgaccatcatcattatgccactgccatcagtgtcactaccagaatcacaa ctatcaccaccatga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_8|418_aa MDGPRLLLLLLLGVSLGGAKEACPTGLYTHSGECCKACNLGEGVAQPCGANQTVCEPCLD SVTFSDVVSATEPCKPCTECVGLQSMSAPCVEADDAVCRCAYGYYQDETTGRCEACRVCE AGSGLVFSCQDKQNTVCEECPDGTYSDEANHVDPCLPCTVCEDTERQLRECTRWADAECE EIPGRWITRSTPPEGSDSTAPSTQEPEAPPEQDLIASTVAGVVTTVMGSSQPVVTRGTTD NLIPVYCSILAAVVVGLVAYIAFKRWNSCKQNKQGANSRPVNQTPPPEGEKLHSDSGISV DSQSLHDQQPHTQTASGQALKGDGGLYSSLPPAKREEVEKLLNGSAGDTWRHLAGELGYQ PEHIDSFTHEACPVRALLASWATQDSATLDALLAALRRIQRADLVESLCSESTATSPV >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_8|1257_bp atggacgggccgcgcctgctgctgttgctgcttctgggggtgtcccttggaggtgccaag gaggcatgccccacaggcctgtacacacacagcggtgagtgctgcaaagcctgcaacctg ggcgagggtgtggcccagccttgtggagccaaccagaccgtgtgtgagccctgcctggac agcgtgacgttctccgacgtggtgagcgcgaccgagccgtgcaagccgtgcaccgagtgc gtggggctccagagcatgtcggcgccgtgcgtggaggccgacgacgccgtgtgccgctgc gcctacggctactaccaggatgagacgactgggcgctgcgaggcgtgccgcgtgtgcgag gcgggctcgggcctcgtgttctcctgccaggacaagcagaacaccgtgtgcgaggagtgc cccgacggcacgtattccgacgaggccaaccacgtggacccgtgcctgccctgcaccgtg tgcgaggacaccgagcgccagctccgcgagtgcacacgctgggccgacgccgagtgcgag gagatccctggccgttggattacacggtccacacccccagagggctcggacagcacagcc cccagcacccaggagcctgaggcacctccagaacaagacctcatagccagcacggtggca ggtgtggtgaccacagtgatgggcagctcccagcccgtggtgacccgaggcaccaccgac aacctcatccctgtctattgctccatcctggctgctgtggttgtgggccttgtggcctac atagccttcaagaggtggaacagctgcaagcagaacaagcaaggagccaacagccggcca gtgaaccagacgcccccaccagagggagaaaaactccacagcgacagtggcatctccgtg gacagccagagcctgcatgaccagcagccccacacgcagacagcctcgggccaggccctc aagggtgacggaggcctctacagcagcctgcccccagccaagcgggaggaggtggagaag cttctcaacggctctgcgggggacacctggcggcacctggcgggcgagctgggctaccag cccgagcacatagactcctttacccatgaggcctgccccgttcgcgccctgcttgcaagc tgggccacccaggacagcgccacactggacgccctcctggccgccctgcgccgcatccag cgagccgacctcgtggagagtctgtgcagtgagtccactgccacatccccggtgtga >gi568815581r:49324272_49524922|GENSCAN_predicted_peptide_9|113_aa AGFKRLPCRILPGSLQPENFLPLRNPNCGHPKVRRVEGEPSGLLLKALGPGVELGLQNVD SKGFSGAQGYRLSGPPRPSPLDEEAFHIVPSLIAASLPWMKEQREALGEEMGP >gi568815581r:49324272_49524922|GENSCAN_predicted_CDS_9|342_bp gctggattcaaacggctcccctgcagaattcttcctgggtccctgcagccagaaaacttc ctccctcttcggaaccccaactgcgggcatcccaaggtccgcagggtagaaggtgaacca tctggactactgttgaaagccttaggccctggggtggagctggggcttcagaatgtggac tccaagggcttctcaggagcacagggttacaggctcagcggacccccacgcccctccccg ctggatgaagaggcatttcacattgttccaagtctcatcgctgcttcactgccctggatg aaagagcagagggaggcactaggggaagagatgggaccctga