GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:27:44 Sequence gi568815589r:20977309_21177869 : 200561 bp : 37.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1031 1146 116 0 2 107 31 81 0.773 3.55 1.02 Intr + 5049 5138 90 2 0 56 85 124 0.924 8.17 1.03 Intr + 8980 9157 178 0 1 121 76 56 0.674 6.27 1.04 Intr + 11024 11121 98 0 2 57 93 57 0.633 1.91 1.05 Intr + 12815 13066 252 1 0 109 100 54 0.769 5.41 1.06 Intr + 15945 16020 76 2 1 77 62 86 0.392 3.07 1.07 Term + 19034 19122 89 0 2 69 55 78 0.224 -0.56 1.08 PlyA + 22001 22006 6 1.05 2.00 Prom + 33097 33136 40 -4.15 2.01 Sngl + 43673 44851 1179 1 0 54 49 371 0.633 25.52 2.02 PlyA + 46074 46079 6 1.05 3.00 Prom + 51469 51508 40 -4.85 3.01 Init + 54653 54692 40 1 1 77 105 13 0.325 2.33 3.02 Intr + 55234 55343 110 2 2 116 108 7 0.967 4.58 3.03 Term + 59476 59598 123 0 0 123 48 81 0.980 5.00 3.04 PlyA + 61930 61935 6 1.05 4.03 PlyA - 61996 61991 6 1.05 4.02 Term - 68368 68180 189 1 0 59 43 137 0.990 2.77 4.01 Init - 70387 70280 108 1 0 62 25 177 0.747 9.07 4.00 Prom - 71867 71828 40 -8.15 5.04 PlyA - 72566 72561 6 1.05 5.03 Term - 73922 73545 378 2 0 75 42 349 0.496 22.80 5.02 Intr - 77409 77336 74 1 2 99 78 42 0.558 2.51 5.01 Init - 81104 80987 118 2 1 82 28 60 0.351 -0.19 5.00 Prom - 82110 82071 40 -7.25 6.00 Prom + 85003 85042 40 -3.35 6.01 Init + 90675 90797 123 2 0 67 68 129 0.817 9.02 6.02 Term + 99031 99129 99 0 0 82 29 117 0.426 2.35 6.03 PlyA + 99276 99281 6 1.05 7.02 PlyA - 99595 99590 6 1.05 7.01 Sngl - 100561 99998 564 1 0 102 49 441 0.824 35.49 7.00 Prom - 105579 105540 40 -0.45 8.00 Prom + 116222 116261 40 -3.65 8.01 Init + 119203 119415 213 0 0 67 88 133 0.278 9.99 8.02 Intr + 129325 129452 128 0 2 18 65 160 0.197 5.26 8.03 Term + 129973 130081 109 1 1 26 35 118 0.129 -2.70 8.04 PlyA + 130610 130615 6 1.05 9.04 PlyA - 131582 131577 6 1.05 9.03 Term - 147212 146909 304 1 1 25 44 905 0.833 73.46 9.02 Intr - 159060 158954 107 0 2 92 105 36 0.525 3.79 9.01 Init - 162576 162520 57 0 0 59 100 36 0.501 3.26 9.00 Prom - 162716 162677 40 -10.25 10.02 PlyA - 163041 163036 6 1.05 10.01 Sngl - 164262 163675 588 0 0 54 44 431 0.794 29.23 10.00 Prom - 175543 175504 40 -3.15 11.02 PlyA - 176481 176476 6 1.05 11.01 Sngl - 189304 188735 570 1 0 59 47 330 0.800 20.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_1|299_aa XEEIQQLCLEIMVTQAQSSQNAAALLGLWVTPPLIHSLSRKDLELYISIAKCLLEMTDDD ANRIAQVTKSNIEKAAFVKLYLVSQGRFPLVNLTDMLSVAVQHREKEVLAWMILHSLYQA RIVSHANTGVLKRMEWLLELMGYIRNVAYQSTSFHNTALDKALDFFLLIFATAVVAWADH TAPLLLGLSASWLPWHQENGPAGPVPSFLGRSPMHRVTLQEVLTLLPNSMALLLQKEPWK EQTQKFIDWLFSIMESPKEALSAQSRDLLKESFLIAGKMSASDLKGKERCLPTDTAEAT >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_1|900_bp ngtgaagagatccagcaactgtgccttgaaattatggtgacccaggcacagtcatcccag aatgcagctgcactattgggcttgtgggtgacaccaccactgatccacagtctgagtaga aaggatctagagctgtatatcagcatagcaaaatgcctcttagaaatgacagatgatgat gccaatcggatcgcccaggttactaagagcaacatagaaaaagctgcctttgtcaaactg tacttagtctctcaaggacgattccccttggtgaacctgaccgatatgctgagcgttgct gtgcagcaccgtgagaaagaggtgttggcctggatgattctgcacagcttataccaggca cggattgtgagccatgccaatacgggcgttttgaagagaatggagtggctcttggaactg atgggttatattagaaatgttgcttaccagtcaacatcctttcacaatacggctcttgac aaggctttggacttcttcttgctgatatttgcaaccgcagtggttgcatgggctgaccac actgcccctctcctcctcggcctcagtgccagttggttgccatggcatcaggagaatggc ccggctgggccagtaccaagcttccttggcaggagtccaatgcacagggtcactctgcag gaggttctcactctccttcccaatagcatggctctgctgctgcagaaagagccatggaag gaacagacccagaagttcattgactggctattcagcatcatggaaagccctaaagaagcc ctctcagcacagtccagggatcttttgaaagaatcattcctcatagctggcaagatgtct gccagtgacctcaaagggaaagaaaggtgtctgcccacagatacagcagaagcaacatga >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_2|392_aa MHRANRVQQHIKKLIHHDQVGFIPGMQGWFNILRSINVIHHINRTKDKNHMIISIDAEKA FDKIQQPFMLKNLHKLGIDGTYLKIIRAIYDKPTANIILNWQKLEAFPMKTGTRQGCPLS PLLFNIMLEVLARAIRQEKEIKGIQLGKEEVKLSLSADDMIVYLENPIVSAQNLLKLISN FSKVSGYRINAQNSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLKRDVKDLFKEN YKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTT LKFIWNQKRARITKTILSQKNKARGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRREP SEVIPHIYNHLIFDKPDKNNKWGKDSLFNKWC >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_2|1179_bp atgcacagggcaaaccgagtccagcagcacatcaaaaagcttatccaccacgatcaagtt ggcttcatccctgggatgcaaggctggttcaacatactcagatcaataaacgtaatccat catataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcc tttgacaaaattcagcagcccttcatgctaaaaaatctccataagctaggtattgatggg acgtatctcaaaataataagagctatttatgacaaaccgacagccaatatcatactgaat tggcaaaaactggaagcattccctatgaaaactggcacaagacagggatgccctctctca ccactcctattcaacataatgttggaagttctggccagggcaatcaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtctctgtctgcagatgacatg attgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacagaatcaatgcgcaaaactcacaagcattcctatatacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttaaaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttataga ttcaatgccatccccatcaagctaccaatgactttcttcacggaactggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccgcatcaccaagacaatcctaagccaaaag aacaaagctagaggcatcacgctacctgacttcaaactatactacaaggctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaagagagccc tcagaagtaataccacacatctacaaccatctgatctttgacaaacctgacaaaaacaac aaatggggaaaggattccctatttaataaatggtgctag >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_3|90_aa MSGFQGGLCSMPTDDPPSCFANKIQPIFKLLCPERETLKQKQKHLRRHLQASIRSSFSLP SPHPPGSSFGNFCAGTALSLPHCALNLVLP >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_3|273_bp atgagtggattccagggtgggctctgctcaatgcccaccgatgacccaccctcctgcttt gctaataagatccagcccatcttcaagctcctctgcccagaaagggaaacgttaaaacaa aaacagaaacatctaagaaggcacttgcaagcctccattcgttcttcattctctctgcca tctcctcatccccctggcagttcattcggaaacttctgtgcaggcactgccctgtctctc ccacattgtgccctgaatttggtgctaccctaa >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_4|98_aa MRTQEGEGDMVVDSSGALRAQQQEVVNNWQKSQEDEALDKKQSTAMVPERRITPAFCLGE VSKPWHREVNSYWKAALLLERENKYWSSRRLKRLEVCY >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_4|297_bp atgaggacccaagaaggagaaggagatatggtagtggacagcagcggagcactgagggct caacagcaggaagtggttaacaactggcagaaaagccaggaagatgaggcactggacaag aaacagtccacggctatggtccctgagagaaggataacaccagctttctgcctaggggaa gtttccaagccatggcacagggaagtgaattcctattggaaagcagcactcctgctggag agagaaaacaaatattggagttcaaggcggctgaagaggctagaagtgtgctattag >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_5|189_aa MCHGGLLHRSFHHLGIKPNIPLAVLPDALPPSPYPPTGPSKGVPLWLRIRAHYGVSEMSP EFPETMKVMLDKKQVQAIFLFKYKMGGKAAVFATLTMHLAQELLMNVVQWWFKKFCKREE NLEGEERSGQSSEVDNKLRVLIEADPLTTTQEVAEELNVSHSTVVCHLKQTGKVTKFNKW VPHELTKNF >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_5|570_bp atgtgccatggcggtttgctgcacagatcattccatcacctgggtattaagcccaacatt ccattagctgttcttcctgatgctctccctccttctccttaccctccaacaggccccagt aaaggggttccactatggctacgcattagagcccactatggagtttccgaaatgtctcca gaatttccagagactatgaaagtgatgttggacaaaaaacaagttcaagcaattttctta ttcaagtacaaaatgggtggtaaagcagcggtgtttgcaacattgacaatgcatttggcc caggaactgctaatgaatgtagtgcagtggtggttcaagaagttttgcaaaagagaggag aaccttgaaggtgaggagcgtagtggccagtcatcggaagttgacaacaaattgagagta ctcatcgaagctgatcctcttacaactacacaagaagttgccgaagaactcaatgtcagc cattctacagttgtttgccatctgaagcaaactggaaaggtgacaaagttcaataagtgg gtgcctcatgagctgaccaaaaatttttaa >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_6|73_aa MLLLNKMMSPWKLPQLDSHNPFSEPLEGFVALSECSQVIPLELCGVRNAEQQKTGQDIPG ILYEQGSGDWKQA >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_6|222_bp atgctgctgctgaataagatgatgtctccttggaagttaccacagcttgatagccataat cccttctcagaacctttagaaggctttgtggccctgtctgagtgttcccaagttattccg ctggaactatgtggcgtccgcaatgcagagcaacagaaaactggacaggacattcctgga attctgtatgaacaaggcagtggggactggaagcaggcttag >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_7|187_aa MTNKCLLQIALLLCFSTTALSMSYNLLGFLQRSSNFQCQKLLWQLNGRLEYCLKDRMNFD IPEEIKQLQQFQKEDAALTIYEMLQNIFAIFRQDSSSTGWNETIVENLLANVYHQINHLK TVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYLKAKEYSHCAWTIVRVEILRNFYFINR LTGYLRN >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_7|564_bp atgaccaacaagtgtctcctccaaattgctctcctgttgtgcttctccactacagctctt tccatgagctacaacttgcttggattcctacaaagaagcagcaattttcagtgtcagaag ctcctgtggcaattgaatgggaggcttgaatactgcctcaaggacaggatgaactttgac atccctgaggagattaagcagctgcagcagttccagaaggaggacgccgcattgaccatc tatgagatgctccagaacatctttgctattttcagacaagattcatctagcactggctgg aatgagactattgttgagaacctcctggctaatgtctatcatcagataaaccatctgaag acagtcctggaagaaaaactggagaaagaagatttcaccaggggaaaactcatgagcagt ctgcacctgaaaagatattatgggaggattctgcattacctgaaggccaaggagtacagt cactgtgcctggaccatagtcagagtggaaatcctaaggaacttttacttcattaacaga cttacaggttacctccgaaactga >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_8|149_aa MRVMQQVKRKLNVNCRIGIEEWSGYSPLPWCLPEASAAYLHKAEDSSGCLEDASVILVKT FSSSKGRENQEGLPGQTPDWISSAAGRPGDLLGIEDGKQESALEIEGPTLAIKSHALALV TDLEYEMHNPCPYGIHITAKDTAIEEVII >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_8|450_bp atgagggtgatgcaacaagtgaaaaggaaactaaatgtgaattgtcgaattggaattgaa gaatggagcgggtattcccctttgccctggtgcttgcctgaggcaagtgctgcttatctt cacaaagcagaagattcttctggctgcctggaagatgcctctgttattttagtgaaaaca tttagctcctcgaaaggtagagagaaccaagagggccttcctggacaaactccagactgg atttcatcagcagctggaagacctggagacctgcttggtatagaggatgggaagcaagag tctgccctggaaattgagggccctacactggccataaagagtcatgctttggccttggtg acagatcttgagtatgagatgcataatccttgcccttatggaattcacattacagcaaag gacacagccattgaagaagttataatctag >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_9|155_aa MGKNEGESNEGKTVVLSGKSYSIQHFNSIDLNDMDVNLHGNFVHIDFHGLSRGPSTRSPT REIRSKKKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKEEEEEEEEDEDEDEEEEEE EEEEEEEEEEEEEEEEEEEIKGIQRGKEEVISNCR >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_9|468_bp atgggaaagaatgagggagagtctaatgaggggaagacagtggtactttcagggaagagc tactccattcagcatttcaatagcattgacctgaatgacatggatgtgaacttacatggc aactttgttcacattgactttcacggtttgagtaggggtccaagtactagaagtcctacc agagaaatcagatcaaaaaaaaaggaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagag aaagaggaagaggaagaggaagaagaagacgaagacgaagacgaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaggaagaagaagaagaagaagaagaaata aaaggcatccaaaggggaaaggaagaagttatctctaattgtagatga >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_10|195_aa MALLFPLLAALVMTSYSPVGSLGCDLPQNHGLLSRNTLVLLHQMRRISPFLCLKDRRDFR FPQEMVKGSQLQKAHVMSVLHEMLQQIFSLFHTERSSAAWNMTLLDQLHTGLHQQLQHLE TCLLQVVGEGESAGAISSPALTLRRYFQGIRVYLKEKKYSDCAWEVVRMEIMKSLFLSTN MQERLRSKDRDLGSS >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_10|588_bp atggccctcctgttccctctactggcagccctagtgatgaccagctatagccctgttgga tctctgggctgtgatctgcctcagaaccatggcctacttagcaggaacaccttggtgctt ctgcaccaaatgaggagaatctcccctttcttgtgtctcaaggacagaagagacttcagg ttcccccaggagatggtaaaagggagccagttgcagaaggcccatgtcatgtctgtcctc catgagatgctgcagcagatcttcagcctcttccacacagagcgctcctctgctgcctgg aacatgaccctcctagaccaactccacactggacttcatcagcaactgcaacacctggag acctgcttgctgcaggtagtgggagaaggagaatctgctggggcaattagcagccctgca ctgaccttgaggaggtacttccagggaatccgtgtctacctgaaagagaagaaatacagc gactgtgcctgggaagttgtcagaatggaaatcatgaaatccttgttcttatcaacaaac atgcaagaaagactgagaagtaaagatagagacctgggctcatcttga >gi568815589r:20977309_21177869|GENSCAN_predicted_peptide_11|189_aa MALSFSLLMAVLVLSYKSICSLGCDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFG FPQEEFDGNQFQKAQAISVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELNQQLNDLE ACVIQEVGVEETPLMNVDSILAVKKYFQRITLYLTEKKYSPCAWEVVRAEIMRSFSLSKI FQERLRRKE >gi568815589r:20977309_21177869|GENSCAN_predicted_CDS_11|570_bp atggccctgtccttttctttactgatggccgtgctggtgctcagctacaaatccatctgt tctctgggctgtgatctgcctcagacccacagcctgggtaataggagggccttgatactc ctggcacaaatgggaagaatctctcctttctcctgcctgaaggacagacatgactttgga ttcccccaggaggagtttgatggcaaccagttccagaaggctcaagccatctctgtcctc catgagatgatccagcagaccttcaatctcttcagcacaaaggactcatctgctacttgg gaacagagcctcctagaaaaattttccactgaacttaaccagcagctgaatgacctggaa gcctgcgtgatacaggaggttggggtggaagagactcccctgatgaatgtggactccatc ctggctgtgaagaaatacttccaaagaatcactctttatctgacagagaagaaatacagc ccttgtgcctgggaggttgtcagagcagaaatcatgagatccttctctttatcaaaaatt tttcaagaaagattaaggaggaaggaatga