GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:25:28 Sequence gi568815596f:72787433_72991534 : 204102 bp : 46.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 38463 38366 98 0 2 89 80 177 0.815 16.79 1.00 Prom - 49399 49360 40 -4.56 2.00 Prom + 60016 60055 40 -2.26 2.01 Init + 63597 63738 142 2 1 46 50 111 0.225 3.40 2.02 Term + 87761 87945 185 0 2 71 51 109 0.674 3.21 2.03 PlyA + 89090 89095 6 1.05 3.00 Prom + 97090 97129 40 -4.26 3.01 Init + 100001 100304 304 1 1 84 94 591 0.975 54.74 3.02 Intr + 100882 101172 291 2 0 44 96 344 0.968 27.91 3.03 Term + 103915 104105 191 2 2 129 46 347 0.999 32.21 3.04 PlyA + 104704 104709 6 1.05 4.00 Prom + 110260 110299 40 -3.86 4.01 Init + 130421 130940 520 1 1 26 96 799 0.396 67.53 4.02 Intr + 132647 133078 432 0 0 39 -68 363 0.036 9.42 4.03 Intr + 133593 133691 99 1 0 98 16 91 0.042 2.98 4.04 Intr + 136877 137061 185 0 2 1 89 413 0.067 32.21 4.05 Intr + 138176 138271 96 1 0 119 2 63 0.404 1.01 4.06 Term + 146355 146522 168 2 0 94 46 352 0.999 29.48 4.07 PlyA + 146894 146899 6 1.05 5.07 PlyA - 147074 147069 6 -3.64 5.06 Term - 149953 149830 124 0 1 124 55 91 0.742 7.26 5.05 Intr - 157667 157594 74 2 2 144 62 84 0.907 9.60 5.04 Intr - 173816 173699 118 1 1 136 78 97 0.947 13.97 5.03 Intr - 181101 181016 86 0 2 116 81 176 0.926 18.32 5.02 Intr - 184253 184138 116 0 2 121 44 215 0.904 20.57 5.01 Init - 184643 184520 124 2 1 74 123 34 0.692 6.05 5.00 Prom - 190103 190064 40 -3.56 6.03 PlyA - 194514 194509 6 1.05 6.02 Term - 195688 195558 131 2 2 102 44 69 0.120 2.24 6.01 Intr - 200916 200826 91 0 1 80 82 73 0.649 5.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 136732 137061 330 0 0 93 89 380 0.819 33.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_1|33_aa MAEAESLETAAEHERILREIESTDTACIGPTLS >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_1|99_bp atggcggaggcggagagcctggagacagcggcagagcacgagcggatcctgcgagagatc gagagcactgacacggcctgcatcgggcccacgctcagn >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_2|108_aa MHGCLERPSSIHSCTNGGFLGHSLDVEAEYTGSSMVCPQDEKEETHTATLAADYMVPTQI EGGSASPSPLTQTLISFGNTLTDTQEQYFASFDPIKLTLSINHHNICI >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_2|327_bp atgcatggatgccttgaacgtccaagctccatccacagctgcacaaacggcggcttcctt ggccactctttagatgtagaggctgaatataccggcagcagcatggtctgccctcaggat gagaaggaagagacccacacagccacactggcagctgattacatggtgcccacccagatt gagggtgggtcagcctctcccagtccactgactcaaacgttaatctcctttggcaacacc ctcacagacacccaggaacaatactttgcatccttcgatccaatcaagttgacactcagt attaaccatcacaacatctgcatctga >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_3|261_aa MEGGLGRAVCLLTGASRGFGRTLAPLLASLLSPGSVLVLSARNDEALRQLEAELGAERSG LRVVRVPADLGAEAGLQQLLGALRELPRPKGLQRLLLINNAGSLGDVSKGFVDLSDSTQV NNYWALNLTSMLCLTSSVLKAFPDSPGLNRTVVNISSLCALQPFKGWALYCAGKAARDML FQVLALEEPNVRVLNYAPGPLDTDMQQLARETSVDPDMRKGLQELKAKGKLVDCKVSAQK LLSLLEKDEFKSGAHVDFYDK >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_3|786_bp atggagggcgggctggggcgtgctgtgtgcttgctgaccggggcctcccgcggcttcggc cggacgctggccccgctcctggcctcgctgctgtcgcccggctccgtgcttgtccttagc gcccgcaacgacgaggcactgcgccagctggaggccgagctgggcgccgagcggtctggc ctgcgcgtggtgcgggtgcccgccgacctgggcgccgaggccggcttgcagcagctgctc ggcgccctgcgcgagctcccccggcccaaggggctgcagcgactgctgcttatcaacaac gcgggctctcttggggatgtgtccaaaggcttcgtggacctgagtgactccactcaagtg aacaactactgggcactgaacttgacctccatgctctgcctgacttccagcgtcctgaag gccttcccggacagtcctggcctcaacagaaccgtggttaacatctcgtccctctgtgcc ctgcaacctttcaaaggctgggcgctgtactgtgcaggaaaggctgctcgtgatatgctg ttccaggtcctggcgctggaggaacctaatgtgagggtgctgaactatgccccaggtcct ctggacacagacatgcagcagttggcccgggagacctccgtggacccagacatgcgaaaa gggctgcaggagctgaaggcaaaggggaagctggtggattgcaaggtgtcagcccagaaa ctgctgagcttactggaaaaggacgagttcaagtctggagcccacgtggacttctatgac aaataa >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_4|499_aa MCLAGCTPRKAAAPGRGALPRARLPRTAPAAATMFQPAAKRGFTIESLVAKDGGTGGGTG GGGAGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFP EAMNHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQAVQLVRS RHPGPHPRLRIQPLTPSAPRFLPPGAEGRGRLERNPAPALSVSSQARRINPSQNPLGNCR RFLSHQSQRVRYRVAQTVGFFPQEPSLDAEMEPSVAAFSDPEIRWALKDFSNPVALLLRG PSSPLQTDALPGGSTPSTLGPSAQPSGPEPLAGGGLSSEDCGSKTAARTPASDVPQDGLL LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQSRPYYTTV PSQVPGSEKMPGMGGSRSLPLAARVKVWFQNRRTKYKRQKLEEEGPESEQKKKGSHHINR WRIATKQANGEDIDVTSND >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_4|1500_bp atgtgcctggctgggtgcacaccccgcaaggcggcggcgccaggacgcggagcgctcccc agagcccggctgcctcgcacagctcccgcggctgcgaccatgttccagcccgcggccaag cgcggctttaccatagagtccttggtggccaaggacggcggcaccggcgggggcactggc ggcgggggcgcgggctcccatctcctggcggcggccgcctccgaggaaccgctccggccc acggcgctcaactaccctcaccccagcgcggccgaggcggccttcgtgagtggcttccct gccgcggccgccgcgggcgcgggccgctcgctctacggtgggcccgagctcgtgttcccc gaggccatgaaccaccccgcgctgaccgtgcatccggcgcaccagctgggcgcctccccg ctgcagcccccgcactccttcttcggcgcccagcaccgggaccctctccatttctacccc tgggtcctgcggaaccgcttcttcggccaccgcttccaggccgttcagcttgtccggagt cggcatcctgggccgcaccctcggcttcgaatccagcccctgacgccctccgcaccgcgg ttcctgcctccgggcgccgagggccgggggcgcctggagagaaatccagctccggctctg agcgtctccagtcaggcgaggcggataaatccttcgcaaaaccctcttggaaattgccgc cgcttcctgagccatcagtcccagcgggtacgttatcgagtagcacaaacagttggattt ttccctcaagaaccgagtctggacgcggagatggagccaagtgtggctgcattttcggac ccggaaatccgttgggcactgaaggacttttcgaaccctgtagcgctgttgcttcgcggt ccatcgtcgccgctgcagacggatgcgctccccggcggctctacgccctccaccctaggg ccctccgcccagccgtccggccctgagcccctggccggcggcggcctctccagcgaagac tgcggctcgaagactgcagctcggaccccggccagcgacgtgccccaggacgggctgctt ctgcacggccccttcgcacgcaagcccaagcggatccgcacggccttctcgccctcgcag ctgctgcggctggagcgcgccttcgagaagaaccactacgtggtgggcgccgagcggaag cagctggccggcagtctcagcctctccgagacgcagtctcgaccctactacaccaccgtc ccctcccaagtcccgggcagtgagaagatgcccggcatggggggcagccggagcctccct ttagcagccagagtgaaggtgtggttccagaaccggaggacaaagtacaaacggcagaag ctggaggaggaagggcctgagtccgagcagaagaagaagggctcccatcacatcaaccgg tggcgcattgccacgaagcaggccaatggggaggacatcgatgtcacctccaatgactag >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_5|213_aa MVEHQASGSGGMRLQVREPSQMDTHFACLVGLPWRRGKQPLASANICNVVLMRYGELEEG IDVLDSDGNLVGSSKIAARHALLETALTRVVLPMPILVLPPIVMSMLEKTALLQARPRLL LPVQSLVCLAAFGLALPLAISLFPQMSEIETSQLEPEIAQATSSRTVVYNKGLDALEQGY SRGLWALAGSLWAEPGSLSQSALMPPGKRSQHV >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_5|642_bp atggtggaacatcaggcctcggggtcaggggggatgcgacttcaggtcagagagccctcc cagatggacacccactttgcctgcctagtgggcttgccctggaggagaggaaagcagccc ctcgccagtgccaatatctgcaatgtggtcctgatgcggtacggggagctggaggaaggg attgatgtcctggacagcgatggcaacctcgtgggctcctccaagatcgcagcccgacac gccctgctggagacggcgctgacgcgagtggtcctgcccatgcccatcctggtgctaccc ccgatcgtcatgtccatgctggagaagacggctctcctgcaggcacgcccccggctgctc ctccctgtgcaaagcctcgtgtgcctggcagccttcggcctggccctgccgctggccatc agcctcttcccgcaaatgtcagagattgaaacatcccaattagagccggagatagcccag gccacgagcagccggacagtggtgtacaacaaggggttggatgccctggagcagggctac agcagaggcctgtgggccctggccggctccctctgggcagagcccggttcacttagccag agcgccttgatgcctcctgggaagcggtcccagcacgtgtga >gi568815596f:72787433_72991534|GENSCAN_predicted_peptide_6|73_aa VGLNVLVQKANKFTPATRLLIQRFVPFPAVGVAMAVLPKSGGPTPRSEHSSPGLIPCEQE AAVCTCGSLSWAL >gi568815596f:72787433_72991534|GENSCAN_predicted_CDS_6|222_bp gtgggccttaatgtcctggttcagaaagccaacaagttcaccccagccacccgccttctc atccagaggtttgtgccgttccctgctgtaggagtggccatggcagtgctgcccaagtct ggagggcctacccccaggtcagagcacagctccccaggcctcatcccctgtgagcaggag gcagcagtgtgcacctgtggcagcttgtcctgggccctgtga