GENSCAN 1.0 Date run: 1-Aug-119 Time: 17:01:32 Sequence gi568815580f:65662843_65980891 : 318049 bp : 35.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8833 9031 199 0 1 13 39 225 0.224 8.10 1.02 Term + 9346 9788 443 2 2 32 35 262 0.562 9.73 1.03 PlyA + 9984 9989 6 1.05 2.04 PlyA - 10680 10675 6 1.05 2.03 Term - 29411 29209 203 0 2 47 42 153 0.697 3.17 2.02 Intr - 30062 29804 259 2 1 -60 58 224 0.023 0.71 2.01 Init - 43694 43587 108 2 0 83 66 110 0.900 8.57 2.00 Prom - 61614 61575 40 -2.85 3.04 PlyA - 62931 62926 6 1.05 3.03 Term - 78407 77833 575 0 2 55 38 329 0.048 18.23 3.02 Intr - 87373 87250 124 1 1 100 -22 82 0.001 -2.36 3.01 Init - 111817 111761 57 1 0 34 87 97 0.697 5.56 3.00 Prom - 116946 116907 40 -3.65 4.00 Prom + 124788 124827 40 -3.15 4.01 Init + 141952 141996 45 0 0 72 103 25 0.437 3.14 4.02 Intr + 146862 147156 295 2 1 45 106 431 0.968 36.26 4.03 Intr + 151643 151762 120 0 0 21 98 156 0.912 9.45 4.04 Intr + 159239 159406 168 0 0 59 100 189 0.190 16.20 4.05 Intr + 161802 161989 188 1 2 84 90 215 0.961 19.79 4.06 Intr + 180970 181223 254 0 2 103 83 425 0.624 38.81 4.07 Intr + 194974 195110 137 1 2 52 97 163 0.956 12.89 4.08 Intr + 196083 196204 122 1 2 53 110 146 0.924 12.59 4.09 Intr + 196866 196983 118 2 1 117 90 104 0.933 12.62 4.10 Intr + 199824 200075 252 1 0 68 69 292 0.788 21.78 4.11 Term + 217559 218052 494 0 2 99 38 612 0.832 51.08 4.12 PlyA + 218101 218106 6 1.05 5.00 Prom + 219757 219796 40 -5.35 5.01 Init + 230563 230636 74 0 2 93 103 28 0.089 5.39 5.02 Intr + 240628 240805 178 1 1 9 75 123 0.582 2.10 5.03 Intr + 263699 263832 134 1 2 71 86 60 0.060 2.62 5.04 Intr + 268818 269111 294 0 0 25 71 154 0.043 2.60 5.05 Intr + 272200 272267 68 1 2 80 56 57 0.046 -0.67 5.06 Term + 274792 274841 50 2 2 110 47 68 0.757 1.39 5.07 PlyA + 277548 277553 6 1.05 6.03 PlyA - 277673 277668 6 1.05 6.02 Term - 293178 293033 146 1 2 48 32 187 0.731 6.19 6.01 Init - 295649 295601 49 2 1 89 95 1 0.735 1.51 6.00 Prom - 306005 305966 40 -6.45 7.00 Prom + 312557 312596 40 -3.75 7.01 Sngl + 313616 313993 378 1 0 68 45 304 0.985 20.01 7.02 PlyA + 314694 314699 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_1|213_aa GQPDRFLKQVPDPVAPNWVRSPNKRLHTHPTGAFGPASGQCPPGTELPEEGAGRHVCCFA AITGDTSRAQNLAEAEMDEMTEVGFRRWVITNFTELKEHVLTQCKEAKNYEKTLQELITR RAILERNINDLTELKNTTQKLHNAITSSNSWIDQVEERISEREDYLSEIRQRGIEKKRMK RNEQNTVAISQRPKNRTTIQPSNPITEYILKGI >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_1|642_bp gggcagccagaccgcttccttaagcaggtccctgatcctgttgctcctaactgggtgaga tctcccaataagcgtctccatacacatcctacaggagcatttgggccggcatcaggtcag tgccctcctggaacagagctcccagaggaaggagcaggccgccatgtttgctgttttgca gccatcactggtgatacttcaagggcacagaacttggcggaggctgagatggatgaaatg acagaagtaggcttccgaaggtgggtaataacgaacttcactgagctgaaagaacatgtt ttaacccaatgcaaggaagctaagaattatgagaaaacattacaggagctgataaccaga agagccattttggaaaggaacataaatgacctgacagagctgaaaaacacaacacaaaaa cttcacaatgcaatcacaagtagcaacagctggatagaccaagtggaggaaagaatctca gagcgtgaagactatctttctgaaataagacagagaggaatagagaaaaaaagaatgaaa aggaatgagcaaaacactgtggcaatttctcaaagacctaaaaacagaactaccattcaa ccaagcaatcccattactgagtatatactcaaaggaatataa >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_2|189_aa MGLFFGITQLYEFQATLYTELRDCEDCRALQQQRLQCSSAPTPGTCSAEAKAAFTTEVTN SHSNYRHPGERDTVAQPPPPPEGSAVPSHWEHGCQLPTTISMLQIPRPQLHCVCSAPDPE SAGNRDAWRLHCQAILRAVSATFHLVGGLIPPEVKGLYPPKLSLSLKEVTVSSASNMQAS VEDYQKHKK >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_2|570_bp atgggcctcttctttggaataacacagctgtatgaattccaggctactctctacactgag ctcagggactgtgaagactgcagggctcttcaacagcaaagactacagtgttcatcggca ccaactccagggacctgctctgcagaggcaaaagcagcgttcaccactgaagtcaccaac agccatagcaattatagacaccctggagaaagagacactgttgcacaaccaccaccacct ccagaaggatctgctgtcccatcacattgggagcatggctgccaacttcctacaactatt tctatgctccagatcccaagaccacagctgcattgtgtatgttcagctccagaccctgaa tctgcaggtaacagagatgcatggagactacactgtcaggccatcctcagagctgtaagt gccacatttcatctcgttggtggcctcatcccacctgaagttaaaggcctttatccacca aaattgtccttaagtctaaaagaggtgactgtttcttcagcttcaaatatgcaggcatca gtggaagactaccagaaacataaaaagtaa >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_3|251_aa MRVAVDTPNDGPEEIVLTQTIFSPAGLSNAKLPKCMIYSQCGTEDPCKVWKEGKKEGRGS AQNLLELISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAVPIKLPMT FFTELEKTTLKFTWNQKRACIAKTILSQKNKAGGITLPDFKLYYKATVTETAWYWYQNRD IDKWNRTEPQK >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_3|756_bp atgagagtagctgtagacactcccaatgacggccccgaggaaatcgtgctgactcagaca attttctcccctgctgggctctctaatgctaaacttccaaagtgtatgatttactcgcag tgcgggacagaggacccttgcaaggtgtggaaggaaggaaagaaggaaggaagaggatct gcccaaaatctccttgagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccgtccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcacgtggaaccaaaaaagagcctgc attgccaagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttc aaactatactacaaggctacagtaaccgaaacagcatggtactggtaccaaaacagagat atagacaaatggaacagaacagagccccagaaataa >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_4|730_aa MPVSTRALSPGEHKHLHSDVDKGDGSIKYILSGEGASSIFIIDENTGDIHATKRLDREEQ AYYTLRAQALDRLTNKPVEPESEFVIKIQDINDNEPKFLDGPYTAGVPEMSPVGTSVVQV TATDADDPTYGNSARVVYSILQGQPYFSVEPKTGVIKTALPNMDREAKDQYLLVIQAKDM VGQNGGLSGTTSVTVTLTDVNDNPPRFPRRSYQYNVPESLPVASVVARIKAADADIGANA EMEYKIVDGDGLGIFKISVDKETQEGIITIQKELDFEAKTSYTLRIEAANKDADPRFLSL GPFSDTTTVKIIVEDVDEPPVFSSPLYPMEVSEATQVGNIIGTVAAHDPDSSNSPVRYSI DRNTDLERYFNIDANSGVITTAKSLDRETNAIHNITVLAMESQNPSQVGRGYVAITILDI NDNAPEFAMDYETTVCENAQPGQVIQKISAVDKDEPSNGHQFYFSLTTDATNNHNFSLKD NKDNTASILTRRNGFRRQEQSVYYLPIFIVDSGSPSLSSTNTLTIRVCDCDADGVAQTCN AEAYVLPAGLSTGALIAILACVLTLLVLILLIVTMRRRKKEPLIFDEERDIRENIVRYDD EGGGEEDTEAFDMAALRNLNVIRDTKTRRDVTPEIQFLSRPAFKSIPDNVIFREFIWERL KEADVDPGAPPYDSLQTYAFEGNGSVAESLSSLDSISSNSDQNYDYLSDWGPRFKRLADM YGTGQESLYS >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_4|2193_bp atgcctgtttctaccagggctctgtctcctggggaacataaacatcttcactctgatgtt gataaaggagatggttccatcaaatacatcttgtcaggcgaaggggcaagttccattttc attattgatgagaacactggggatattcatgccaccaagagactggatcgtgaggagcag gcctactacacgctccgagctcaagcgctggataggctcaccaacaaacccgtggagccc gagtcggagtttgtcatcaaaattcaggatatcaacgacaatgaacccaaatttttggat ggcccatacacggcaggagttcccgaaatgtctcccgtggggacctcagtggtacaagtg acagcgacggatgctgatgatcctacatatggcaacagtgccagagtggtctacagtatt ctgcaaggacagccgtacttctcagtggagccaaagacaggagtcatcaagactgccctt ccaaacatggatagagaggctaaagaccagtatttgcttgtcattcaggcaaaggatatg gttggtcaaaatggaggactgtcaggaactacatcagtcactgtgaccctaactgatgtc aacgataatccacctcgctttcctcgaaggtcttatcaatataacgtcccagagtcatta cctgtagcctcagttgtggccagaattaaagctgctgatgcagatattggagctaatgct gaaatggagtacaagattgtggatggtgatggtttgggcatttttaagatttctgttgac aaagaaacccaggaaggaatcattactatacagaaggagctggattttgaagccaaaaca agttacacgctacggatagaagctgcaaataaagatgccgaccctcgctttctgagcttg ggtccgttcagtgacacgacaactgtgaagataattgtggaagatgtagatgagccccct gtgttctcttcacccttgtaccctatggaggtgtcggaagctacccaggttgggaatatc attggcactgtagcagctcatgacccagattcttccaatagccctgtgaggtactcaatt gacagaaacacagacttggagagatacttcaatattgatgccaacagtggggtcatcaca actgccaagtctttggatcgagagacaaatgctattcacaatatcacagtccttgcaatg gagagccagaatccatctcaagtaggaagaggctatgtggccatcactatacttgacatc aatgataacgcccctgaatttgccatggactatgagaccaccgtctgtgaaaatgcccag ccggggcaggttatccagaaaatcagtgctgtggataaagatgagccatccaatggacac cagttttacttcagcttaacaacggatgcaacaaataaccacaacttttcattgaaagat aacaaagacaacacagcctcaatactgaccaggagaaacggcttccggagacaggaacaa tcagtttactatctgccaattttcattgtggacagtggatctccctcacttagcagcacc aacaccctcaccatccgcgtgtgtgactgtgatgctgacggcgtagcccagacctgcaat gcagaggcctatgtcctacctgctggcctcagtacaggagccctgatagccatactcgcc tgtgtcttgacattattggtgttgatcctccttatcgtcactatgagaagacggaaaaaa gagccccttatttttgacgaagaaagagacatcagagaaaatattgtgagatacgatgac gagggcgggggagaggaggacacggaagcgtttgacatggctgcactgagaaacctcaac gtcatccgagacaccaagacccggagggatgtgactccagaaattcaattcctgagtcga ccagcttttaaaagcatcccagataatgtcatctttagggaatttatttgggaaagatta aaagaagccgatgttgatcctggtgctcctccttatgactccctgcagacatatgctttt gaaggaaatggctcagttgctgaatcactcagctctttagattccatcagctcaaactct gatcagaactatgactacctaagtgactggggacctcgctttaaacgactcgcggacatg tatgggactggccaagagagtttgtactcatag >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_5|265_aa MEAPSSLFFVTMHTVKEAGNMAPASGSYAVRMQGHNNDTIIFGESEGKVERGVRDKTLRI GYSVHCFDGRFTKISEIPSKELLNKSLKAENLLWLELEKCNRRQNQKDWKHKRVSPLTDG GGLMESLRRVRQTLHTGELWLASGRCPSGTNLPEEGAGSNLYCSAASASDTQANRVWSGP PANSSRLAEKGSTRRKTNKQKAVTSTSTKRMPMQKLHPKVTNIKHQRTRPSPEANQVWPI DLGLSSLQECRPTNRQIKELSDKGA >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_5|798_bp atggaggctccatcttcccttttctttgtcaccatgcatacagtaaaagaagcaggcaac atggcaccagccagtgggagctacgctgtgaggatgcaagggcataacaatgatacaatc atttttggggagtcggaaggcaaggttgagagaggagtgagggataaaacactacgtatt gggtacagtgtacactgctttgacggcaggttcactaaaatctcagaaatccccagtaaa gaacttcttaataaatccttaaaagcagagaaccttctctggctggagttagagaaatgc aacagaaggcaaaatcagaaagattggaaacataaaagggtctcacctctcactgatgga gggggcctcatggaaagcctgagaagggttcgacagacacttcatactggagagctctgg ctggcatctggcaggtgcccctctgggacgaaccttccagaggaaggggcaggcagcaat ctttactgttctgcagcctctgctagtgatacgcaggcaaacagggtctggagtggacct ccagcaaactccagcagacttgcagaaaaggggtctactagaaggaaaactaacaaacag aaagcagtaacatcaacatcaacaaaaaggatgcccatgcaaaaactccatccaaaggtc accaacatcaaacaccaaaggacaaggccctcaccagaagccaaccaggtatggcccatt gatcttggactttccagcctccaggaatgcagaccaacaaataggcagatcaaggagctg tctgacaaaggagcctaa >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_6|64_aa MSFMLLQFLRSSQISSVHCCSDELLLQYTIEAVTGMMYEKGILMMFYAKDGAAQGTKGLG EDST >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_6|195_bp atgagcttcatgttgcttcagttcctccgctcctcccagatctcatcagtgcactgctgc tctgatgagctgttgcttcagtacaccattgaagctgtcacagggatgatgtatgagaaa ggcatactgatgatgttttatgcaaaagatggtgctgcccagggaacaaaaggtttagga gaagattcaacataa >gi568815580f:65662843_65980891|GENSCAN_predicted_peptide_7|125_aa MNALGTGRSITPCNPAAASRGVPVTLQSPRGHVLTVHYFSFAVHGWLMCLTAQCDSPLYP EPLLRIQEESGHTNELKMVNVGDFIANESGFQHDGELKRKWNGKVAGVWLPQPDSLRGPT IKLSV >gi568815580f:65662843_65980891|GENSCAN_predicted_CDS_7|378_bp atgaatgctctgggcactggcagaagcataactccatgcaatcctgcggcagcgtctagg ggggtgcccgtgaccctccagagcccgagaggacatgtgctaacagtgcactattttagc ttcgctgtccatggatggcttatgtgtttaacagctcagtgtgacagccctctgtatcct gagcccttgctcagaatccaggaagaatcaggtcacacgaatgaattgaagatggtaaat gtgggggattttattgccaatgaaagtggttttcagcacgacggagagctgaaaaggaaa tggaatgggaaggtggcaggagtttggctgccccaaccagactctctccgaggtcccacc atcaagctgtccgtctga