GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:38:44 Sequence gi568815596f:72817952_73033951 : 216000 bp : 47.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 7944 7847 98 0 2 89 80 177 0.735 16.79 1.00 Prom - 18880 18841 40 -4.56 2.00 Prom + 29497 29536 40 -2.26 2.01 Init + 33078 33219 142 2 1 46 50 111 0.225 3.40 2.02 Term + 57242 57426 185 0 2 71 51 109 0.674 3.21 2.03 PlyA + 58571 58576 6 1.05 3.00 Prom + 66571 66610 40 -4.26 3.01 Init + 69482 69785 304 1 1 84 94 591 0.975 54.74 3.02 Intr + 70363 70653 291 2 0 44 96 344 0.968 27.91 3.03 Term + 73396 73586 191 2 2 129 46 347 0.999 32.21 3.04 PlyA + 74185 74190 6 1.05 4.00 Prom + 79741 79780 40 -3.86 4.01 Init + 99902 100421 520 1 1 26 96 799 0.396 67.53 4.02 Intr + 102128 102559 432 0 0 39 -68 363 0.036 9.42 4.03 Intr + 103074 103172 99 1 0 98 16 91 0.042 2.98 4.04 Intr + 106358 106542 185 0 2 1 89 413 0.067 32.21 4.05 Intr + 107657 107752 96 1 0 119 2 63 0.404 1.01 4.06 Term + 115836 116003 168 2 0 94 46 352 0.999 29.48 4.07 PlyA + 116375 116380 6 1.05 5.07 PlyA - 116555 116550 6 -3.64 5.06 Term - 119434 119311 124 0 1 124 55 91 0.742 7.26 5.05 Intr - 127148 127075 74 2 2 144 62 84 0.907 9.60 5.04 Intr - 143297 143180 118 1 1 136 78 97 0.947 13.97 5.03 Intr - 150582 150497 86 0 2 116 81 176 0.926 18.32 5.02 Intr - 153734 153619 116 0 2 121 44 215 0.904 20.57 5.01 Init - 154124 154001 124 2 1 74 123 34 0.692 6.05 5.00 Prom - 159584 159545 40 -3.56 6.11 PlyA - 163995 163990 6 1.05 6.10 Term - 165169 165039 131 2 2 102 44 69 0.119 2.24 6.09 Intr - 170397 170307 91 0 1 80 82 73 0.648 5.47 6.08 Intr - 181063 180998 66 1 0 95 68 84 0.774 6.20 6.07 Intr - 182536 182480 57 1 0 97 105 87 0.952 10.38 6.06 Intr - 183627 183574 54 0 0 121 92 24 0.786 5.28 6.05 Intr - 185513 185388 126 2 0 51 98 36 0.616 1.78 6.04 Intr - 188228 188029 200 0 2 86 51 95 0.827 4.67 6.03 Intr - 204625 204571 55 1 1 96 115 58 0.983 7.85 6.02 Intr - 205263 205237 27 0 0 108 89 8 0.690 1.21 6.01 Init - 206532 206446 87 0 0 86 64 74 0.929 3.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 106213 106542 330 0 0 93 89 380 0.819 33.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_1|33_aa MAEAESLETAAEHERILREIESTDTACIGPTLS >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_1|99_bp atggcggaggcggagagcctggagacagcggcagagcacgagcggatcctgcgagagatc gagagcactgacacggcctgcatcgggcccacgctcagn >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_2|108_aa MHGCLERPSSIHSCTNGGFLGHSLDVEAEYTGSSMVCPQDEKEETHTATLAADYMVPTQI EGGSASPSPLTQTLISFGNTLTDTQEQYFASFDPIKLTLSINHHNICI >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_2|327_bp atgcatggatgccttgaacgtccaagctccatccacagctgcacaaacggcggcttcctt ggccactctttagatgtagaggctgaatataccggcagcagcatggtctgccctcaggat gagaaggaagagacccacacagccacactggcagctgattacatggtgcccacccagatt gagggtgggtcagcctctcccagtccactgactcaaacgttaatctcctttggcaacacc ctcacagacacccaggaacaatactttgcatccttcgatccaatcaagttgacactcagt attaaccatcacaacatctgcatctga >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_3|261_aa MEGGLGRAVCLLTGASRGFGRTLAPLLASLLSPGSVLVLSARNDEALRQLEAELGAERSG LRVVRVPADLGAEAGLQQLLGALRELPRPKGLQRLLLINNAGSLGDVSKGFVDLSDSTQV NNYWALNLTSMLCLTSSVLKAFPDSPGLNRTVVNISSLCALQPFKGWALYCAGKAARDML FQVLALEEPNVRVLNYAPGPLDTDMQQLARETSVDPDMRKGLQELKAKGKLVDCKVSAQK LLSLLEKDEFKSGAHVDFYDK >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_3|786_bp atggagggcgggctggggcgtgctgtgtgcttgctgaccggggcctcccgcggcttcggc cggacgctggccccgctcctggcctcgctgctgtcgcccggctccgtgcttgtccttagc gcccgcaacgacgaggcactgcgccagctggaggccgagctgggcgccgagcggtctggc ctgcgcgtggtgcgggtgcccgccgacctgggcgccgaggccggcttgcagcagctgctc ggcgccctgcgcgagctcccccggcccaaggggctgcagcgactgctgcttatcaacaac gcgggctctcttggggatgtgtccaaaggcttcgtggacctgagtgactccactcaagtg aacaactactgggcactgaacttgacctccatgctctgcctgacttccagcgtcctgaag gccttcccggacagtcctggcctcaacagaaccgtggttaacatctcgtccctctgtgcc ctgcaacctttcaaaggctgggcgctgtactgtgcaggaaaggctgctcgtgatatgctg ttccaggtcctggcgctggaggaacctaatgtgagggtgctgaactatgccccaggtcct ctggacacagacatgcagcagttggcccgggagacctccgtggacccagacatgcgaaaa gggctgcaggagctgaaggcaaaggggaagctggtggattgcaaggtgtcagcccagaaa ctgctgagcttactggaaaaggacgagttcaagtctggagcccacgtggacttctatgac aaataa >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_4|499_aa MCLAGCTPRKAAAPGRGALPRARLPRTAPAAATMFQPAAKRGFTIESLVAKDGGTGGGTG GGGAGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFP EAMNHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQAVQLVRS RHPGPHPRLRIQPLTPSAPRFLPPGAEGRGRLERNPAPALSVSSQARRINPSQNPLGNCR RFLSHQSQRVRYRVAQTVGFFPQEPSLDAEMEPSVAAFSDPEIRWALKDFSNPVALLLRG PSSPLQTDALPGGSTPSTLGPSAQPSGPEPLAGGGLSSEDCGSKTAARTPASDVPQDGLL LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQSRPYYTTV PSQVPGSEKMPGMGGSRSLPLAARVKVWFQNRRTKYKRQKLEEEGPESEQKKKGSHHINR WRIATKQANGEDIDVTSND >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_4|1500_bp atgtgcctggctgggtgcacaccccgcaaggcggcggcgccaggacgcggagcgctcccc agagcccggctgcctcgcacagctcccgcggctgcgaccatgttccagcccgcggccaag cgcggctttaccatagagtccttggtggccaaggacggcggcaccggcgggggcactggc ggcgggggcgcgggctcccatctcctggcggcggccgcctccgaggaaccgctccggccc acggcgctcaactaccctcaccccagcgcggccgaggcggccttcgtgagtggcttccct gccgcggccgccgcgggcgcgggccgctcgctctacggtgggcccgagctcgtgttcccc gaggccatgaaccaccccgcgctgaccgtgcatccggcgcaccagctgggcgcctccccg ctgcagcccccgcactccttcttcggcgcccagcaccgggaccctctccatttctacccc tgggtcctgcggaaccgcttcttcggccaccgcttccaggccgttcagcttgtccggagt cggcatcctgggccgcaccctcggcttcgaatccagcccctgacgccctccgcaccgcgg ttcctgcctccgggcgccgagggccgggggcgcctggagagaaatccagctccggctctg agcgtctccagtcaggcgaggcggataaatccttcgcaaaaccctcttggaaattgccgc cgcttcctgagccatcagtcccagcgggtacgttatcgagtagcacaaacagttggattt ttccctcaagaaccgagtctggacgcggagatggagccaagtgtggctgcattttcggac ccggaaatccgttgggcactgaaggacttttcgaaccctgtagcgctgttgcttcgcggt ccatcgtcgccgctgcagacggatgcgctccccggcggctctacgccctccaccctaggg ccctccgcccagccgtccggccctgagcccctggccggcggcggcctctccagcgaagac tgcggctcgaagactgcagctcggaccccggccagcgacgtgccccaggacgggctgctt ctgcacggccccttcgcacgcaagcccaagcggatccgcacggccttctcgccctcgcag ctgctgcggctggagcgcgccttcgagaagaaccactacgtggtgggcgccgagcggaag cagctggccggcagtctcagcctctccgagacgcagtctcgaccctactacaccaccgtc ccctcccaagtcccgggcagtgagaagatgcccggcatggggggcagccggagcctccct ttagcagccagagtgaaggtgtggttccagaaccggaggacaaagtacaaacggcagaag ctggaggaggaagggcctgagtccgagcagaagaagaagggctcccatcacatcaaccgg tggcgcattgccacgaagcaggccaatggggaggacatcgatgtcacctccaatgactag >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_5|213_aa MVEHQASGSGGMRLQVREPSQMDTHFACLVGLPWRRGKQPLASANICNVVLMRYGELEEG IDVLDSDGNLVGSSKIAARHALLETALTRVVLPMPILVLPPIVMSMLEKTALLQARPRLL LPVQSLVCLAAFGLALPLAISLFPQMSEIETSQLEPEIAQATSSRTVVYNKGLDALEQGY SRGLWALAGSLWAEPGSLSQSALMPPGKRSQHV >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_5|642_bp atggtggaacatcaggcctcggggtcaggggggatgcgacttcaggtcagagagccctcc cagatggacacccactttgcctgcctagtgggcttgccctggaggagaggaaagcagccc ctcgccagtgccaatatctgcaatgtggtcctgatgcggtacggggagctggaggaaggg attgatgtcctggacagcgatggcaacctcgtgggctcctccaagatcgcagcccgacac gccctgctggagacggcgctgacgcgagtggtcctgcccatgcccatcctggtgctaccc ccgatcgtcatgtccatgctggagaagacggctctcctgcaggcacgcccccggctgctc ctccctgtgcaaagcctcgtgtgcctggcagccttcggcctggccctgccgctggccatc agcctcttcccgcaaatgtcagagattgaaacatcccaattagagccggagatagcccag gccacgagcagccggacagtggtgtacaacaaggggttggatgccctggagcagggctac agcagaggcctgtgggccctggccggctccctctgggcagagcccggttcacttagccag agcgccttgatgcctcctgggaagcggtcccagcacgtgtga >gi568815596f:72817952_73033951|GENSCAN_predicted_peptide_6|297_aa MGSPFVAQGGLILGSAILLLWPPKVLGLQLWSAQKIKQAILHPDTNEKIFMPFRMSGICQ GCLLVATSDAQQGKQEGMRPLSQGSELTRCHVLPRAVSQSKLDDQAEPKSEEINSFCDEA VARRIIKGTLWCHGQCPGQQLSGKCSECYWAVSYDVPGGPSTVRGVVGLLLPNQTLASTV FWQWLNQSHNACVNYANRNATKPSPASKFIQGYLGAVISAVSIAVGLNVLVQKANKFTPA TRLLIQRFVPFPAVGVAMAVLPKSGGPTPRSEHSSPGLIPCEQEAAVCTCGSLSWAL >gi568815596f:72817952_73033951|GENSCAN_predicted_CDS_6|894_bp atggggtctccctttgttgcccagggtggactgatcctgggctcagcaatcctcctgctt tggcctcctaaagtgttgggactacagctctggagtgcacagaaaatcaagcaggctatt ctacatccggacaccaatgagaagatcttcatgccatttagaatgtcagggatttgtcag ggatgtcttctcgtggcaacctcagatgctcagcagggcaagcaggaaggcatgaggcct ctgagccagggctcagaactgacacgctgccacgtcctcccacgtgctgtcagtcagagc aagttagatgaccaagcagagccaaaaagtgaggaaataaattccttctgtgatgaggcc gtggcaaggaggattataaagggaaccctgtggtgtcacgggcagtgtcctggacagcag ttgtctggtaaatgcagtgagtgctattgggctgtatcttacgatgttcctggtggtcca agtacagttcgaggggtagtcggtcttctcttgcccaaccagacactggcatccactgtc ttctggcagtggctgaaccagagccacaatgcctgtgtcaactatgcaaaccgcaatgcg accaagccttcacctgcatccaagttcatccagggatacctgggagctgtcatcagcgcc gtctccattgctgtgggccttaatgtcctggttcagaaagccaacaagttcaccccagcc acccgccttctcatccagaggtttgtgccgttccctgctgtaggagtggccatggcagtg ctgcccaagtctggagggcctacccccaggtcagagcacagctccccaggcctcatcccc tgtgagcaggaggcagcagtgtgcacctgtggcagcttgtcctgggccctgtga