GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:58:06 Sequence gi568815581r:29164542_29393892 : 229351 bp : 44.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 714 709 6 -0.45 1.01 Sngl - 2399 1338 1062 2 0 76 42 1408 0.776 132.16 1.00 Prom - 10824 10785 40 -7.36 2.00 Prom + 11216 11255 40 -4.56 2.01 Init + 12419 12479 61 1 1 96 55 36 0.798 2.45 2.02 Term + 15517 16058 542 2 2 113 54 165 0.756 10.02 2.03 PlyA + 19017 19022 6 1.05 3.05 PlyA - 19078 19073 6 1.05 3.04 Term - 27342 27133 210 0 0 32 48 97 0.137 -2.61 3.03 Intr - 32619 32581 39 0 0 88 75 46 0.312 1.72 3.02 Intr - 33258 33180 79 2 1 118 86 56 0.641 7.95 3.01 Init - 35102 35062 41 2 2 52 63 77 0.711 1.29 3.00 Prom - 36921 36882 40 -5.76 4.02 PlyA - 37038 37033 6 1.05 4.01 Sngl - 39933 39136 798 0 0 99 48 336 0.979 26.56 4.00 Prom - 53982 53943 40 -4.66 5.00 Prom + 55671 55710 40 -0.26 5.01 Init + 60827 60910 84 1 0 77 80 69 0.503 5.82 5.02 Intr + 76092 76118 27 2 0 100 94 -1 0.138 0.01 5.03 Intr + 77056 77144 89 0 2 108 58 62 0.218 3.97 5.04 Intr + 84601 84665 65 1 2 88 121 85 0.098 10.16 5.05 Intr + 85612 85759 148 2 1 87 77 82 0.338 6.29 5.06 Intr + 87523 87664 142 1 1 50 68 88 0.878 3.36 5.07 Intr + 89099 89234 136 1 1 23 9 207 0.697 6.44 5.08 Term + 89661 89812 152 1 2 66 38 171 0.873 7.97 5.09 PlyA + 89928 89933 6 1.05 6.06 PlyA - 90987 90982 6 1.05 6.05 Term - 100050 99998 53 1 2 81 49 47 0.304 -2.31 6.04 Intr - 102989 102957 33 0 0 112 116 3 0.262 3.69 6.03 Intr - 123175 121451 1725 2 0 123 88 876 0.852 78.42 6.02 Intr - 129446 129242 205 2 1 47 41 201 0.002 10.07 6.01 Init - 139433 139341 93 2 0 46 80 28 0.013 -1.82 6.00 Prom - 170064 170025 40 -2.36 7.00 Prom + 170733 170772 40 -2.96 7.01 Init + 172560 172577 18 2 0 81 116 4 0.644 2.83 7.02 Term + 179322 179465 144 2 0 81 28 103 0.909 1.51 7.03 PlyA + 180677 180682 6 1.05 8.00 Prom + 180993 181032 40 -2.66 8.01 Init + 181974 182043 70 2 1 65 58 112 0.590 5.11 8.02 Intr + 204464 204594 131 0 2 10 72 54 0.001 -3.59 8.03 Intr + 226415 226483 69 1 0 112 86 64 0.006 7.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_1|353_aa MFNLMKKDKDKDGGRKEKKEKKEKKERMSAAELRSLEEMSLRRGFFNLNRSSKRESKTRL EISNPIPIKVASGSDLHLTDIDSDSNRGSVILDSGHLSTASSSDDLKGEEGSFRGSVLQR AAKFGSLAKQNSQMIVKRFSFSQRSRDESASETSTPSEHSAAPSPQVEVRTLEGQLVQHP GPGIPRPGHRSRAPELVTKKFPVDLRLPPVVPLPPPTLRELELQRRPTGDFGFSLRRTTM LDRGPEGQACRRVVHFAEPGAGTKDLALGLVPGDRLVEINGHNVESKSRDEIVEMIRQSG DSVRLKVQPIPELSELSRSWLRSGEGPRREPSDVSASLGTWAKQGWGMWAPPA >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_1|1062_bp atgtttaacctaatgaagaaagacaaggacaaagatggcgggcggaaggagaagaaggag aaaaaggagaaaaaggagcggatgtcagcggcagagcttcggagcctggaggagatgagc ctgcgacgtggcttcttcaacctgaaccgctcctccaagcgtgaatccaagacgcgcctg gaaatctccaaccccatccccatcaaggtggccagcggctctgacctgcacctgactgac attgactccgatagtaaccggggcagcgtcatcctggactcgggccacctaagtacagcc agctccagcgatgacctcaagggtgaggagggtagcttccgtggctcggtgctgcagcgg gcagccaagttcggctcactggccaagcagaactcacagatgattgtcaagcgcttttcc ttctcccagcgtagccgggatgagagcgcctcagaaacctcgacgccctcagagcactct gccgccccctcgccacaggtggaggtgaggactctagagggacagctggtgcagcatcct ggcccaggcatccctcgaccagggcaccgatcccgagcccctgagctagtgactaaaaag ttcccagtcgacctgcgcctgccccccgtggtgcccctgcccccacctaccctccgggag ctggagctgcaacgacggcccactggagactttggcttctccctgcggcgcacaaccatg ctggatcggggccccgagggccaggcctgtcggcgtgtggtccactttgctgagcctggt gcaggcaccaaggacctggccctggggctggtgccaggagatcgactggtggagattaat gggcacaatgtggagagcaagtccagggatgagattgtggagatgatccggcagtcaggg gacagcgtgcggctcaaggtgcagcccattccagagctcagcgagctcagcaggagctgg ctgcggagcggcgagggacctcgcagggagccatccgatgtgagtgcttccctgggcacc tgggctaagcagggatggggaatgtgggcacctcctgcttga >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_2|200_aa MEERERESSCGLARFPAALPGWLEGPVVPPSPPLPIGPHPRASSSTAPPGTGAGSPGRCF QGTGEEEEEPAGPAARPPSLPAGARPAAPPSRARHRAAAAADRHSPATGARPAGSRSGDA ASSSLGPVRDGAERSAAQLSAAPSDRAGRAERELPAGGCGDKRPACLAAPVRAARRGPGG PGAALAASVRCGLRSPSRCV >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_2|603_bp atggaggaacgcgagcgggagagctcctgcggcctcgccaggttccctgctgctcttcca ggttggctggaagggccggtggttcccccttccccgccgctcccgatcggcccccacccc cgggcgtcgagctccacggcgccgccagggacgggagccgggagtccgggccgctgcttc caggggacgggggaggaggaggaggagccggcgggccccgccgcccgcccgccctccctc ccggccggagcccgccccgccgccccaccaagccgggcccgccaccgagcggccgccgcc gccgaccggcactcaccggccaccggagcccgcccggcaggcagcagaagcggagacgcg gcatccagcagcctcggcccggtcagggatggagcagagcgcagcgcggcgcagctcagc gccgcccccagcgaccgcgcaggccgagccgaaagggagctccctgctggcggctgtggg gataaacgcccggcctgcctggcagcgccagtgcgcgccgccagacgtgggccaggcggg ccgggcgctgccctggcagcctccgtccgctgtggactccgaagcccctctcgctgtgtc tga >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_3|122_aa MPVAILVDCVLQDQNHKMSDWSFLGWLLTRVQNDSTVVGKSHEEENLPELHVQRKLFTIK SFTESVLTPELGHWEPLRNPKPRSDMIIFVLWKDNSGSHMEDQLEGMKRKQATTRRPLLL SS >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_3|369_bp atgcctgtggccatcctagtggactgcgtcctccaggaccaaaatcacaagatgagcgac tggtcattcctgggctggctcctgacccgagtgcagaacgattccaccgtggttggcaag agccatgaggaagaaaatcttcctgaacttcatgtgcagcgtaaactatttaccatcaag tcctttacagaaagtgtgctgacccctgagctgggccattgggaacctttgaggaatccc aagccaagaagtgacatgatcatatttgtattatggaaagacaactctggcagccacatg gaggaccagttagaggggatgaagcggaagcaggcgaccacgagaaggccgttgctattg tccagctaa >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_4|265_aa MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPL LEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDE IFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAF PISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTTNTELKDLPKRIPKDSARYHFFL YKHSHEGDYLESIVLFIQCLDTHAV >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_4|798_bp atgtcccaccagaccggcatccaagcaagtgaagatgttaaagagatctttgccagagcc agaaatggaaagtacagacttctgaaaatatctattgaaaatgagcaacttgtgattgga tcatatagtcagccttcagattcctgggataaggattatgattcctttgttttacccctg ttggaggacaaacaaccatgctatatattattcaggttagattctcagaatgcccaggga tatgaatggatattcattgcatggtctccagatcattctcatgttcgtcaaaaaatgttg tatgcagcaacaagagcaactctgaagaaggaatttggaggtggccacattaaagatgaa atatttggaacagtaaaggaagatgtatcattacatggatataaaaaatacttgctgtca caatcttcccctgccccactgactgcagctgaggaagaattacgacagattaaaatcaat gaggtacagactgacgtgggtgtggacactaagcatcaaacactacaaggagtagcattt cccatttctcgagaagcctttcaggctttggaaaaattgaataacagacagctcaactat gtgcagttggaaatagatataaaaaatgaaattataattttggccaacacaacaaataca gaactgaaagatttgccaaagaggattcccaaggattcagctcgttaccatttctttctg tataaacattcccatgaaggagactatttagagtccatagttttatttattcaatgcctg gatacacatgcagtataa >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_5|280_aa MVYSMRSSKTGIQKEKNHNSGGKAADKEDAVVGWSAWRQGQGVKAIHKIPLSIYAAWRVA LVSRSPRNPSNHQDGSDQPYAGVPGAMEALLRHSISFQITIYDQENFQGKRMEFTSSCPN VSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFR PICSANHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQTGFATNILDI VGISISWNVTIMEETINIGESGALMPRLRRSNRFAESNSS >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_5|843_bp atggtgtatagcatgagaagctccaaaacaggaattcagaaggagaaaaatcacaacagt ggagggaaggcagcagacaaagaggatgctgtggttggctggtcagcctggaggcaaggc cagggagtcaaagccatccacaaaattccactgagcatctatgctgcttggcgtgtagcc ctggtgtccagaagcccaagaaacccttccaaccaccaagatggctcagaccaaccctac gccggggtccctggggccatggaagctctcttgcgccattcaatctcatttcagataacc atctatgatcaggagaactttcagggcaagaggatggagttcaccagctcctgtccaaat gtctctgagcgcagttttgataatgtccggtccctgaaggtggaaagtggcgcctggatt ggttatgagcataccagcttctgtgggcaacagtttatcctggagagaggagaataccct cgctgggatgcctggagtgggagtaatgcctaccacattgagcgtctcatgtccttccgc cccatctgttcagctaatcataaggagtctaagatgaccatctttgagaaggaaaacttt attggacgccagtgggagatctctgacgactacccctccttgcaagccatgggctggttc aacaacgaagtcggctccatgaagatacaaactgggtttgctaccaatatcctggatatc gtgggtatcagtatatcttggaatgtgaccatcatggaggagactataaacattggagag agtggggctctcatgcccagacttcgcagatccaatcgattcgccgaatccaacagtagc tga >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_6|702_aa MEKLPVKSHEELSSSVQHLMGLKTKSYQQPWQQQQQPHHHHHYYFYNHSHNHHHHHHHQQ PHQYLQHGAEGSPKAQPKPLKHEQKHTLQQHQETPKKKTGYGELNGNAGEREISLKNLSS DEATNPISRVLNGNQQVVDTSLKQTVKANTFGKAGIKTKNFIQKNSMDKKNGKSYENKSG ENQSVDKSDTIPIPNGVVTNNSGYITNGYMGKGADNDGSGSESGYTTPKKRKARRNSAKG CENLNIVQDKIMQQETSVPTLKQGLETFKPDYSEQKGNRVDGSKPIWKYETGPGGTSRGK PAVGDMLRKSSDSKPGVSSKKFDDRPKGKHASAVASKEDSWTLFKPPPVFPVDNSSAKIV PKISYASKVKENLNKTIQNSSVSPTSSSSSSSSTGETQTQSSSRLSQVPMSALKSVTSAN FSNGPVLAGTDGNVYPPGGQPLLTTAANTLTPISSGTDSVLQDMSLTSAAVEQIKTSLFI YPSNMQTMLLSTAQVDLPSQTDQQNLGDIFQNQWGLSFINEPSAGPETVTGKSSEHKVME VTFQGEYPATLVSQGAEIIPSGTEHPVFPKAYELEKRTSPQVLGSILKSGTTSESGALSL EPSHIGDLQKADTSSQGALVFLSKDYEIESQNPLASPTNTLLGSAKEQRYQRGLERNDSW GSFDLRAAIVYHTKEMESIWNLQKQDPKRIITYNEAMDSPDQ >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_6|2109_bp atggagaaacttccagtgaaaagtcatgaggaactgagttcttcagtccaacacctcatg gggttgaaaactaagtcctaccaacaaccatggcagcagcagcagcagccgcaccaccac caccattattatttctacaaccacagccacaaccaccaccaccaccatcatcaccagcag cctcaccaatacctgcagcatggagccgagggcagccccaaggcccagccaaagccgctg aaacatgagcagaaacacaccctccagcagcaccaggaaacgccgaagaagaaaacaggc tatggtgaactaaacggtaatgctggagaaagagaaatatctttaaagaacctgagttct gatgaagccaccaaccctatttccagggtcctcaatggcaaccagcaagttgtagacact agcctgaagcagactgtaaaggccaacacctttgggaaagcaggaattaaaaccaagaat ttcattcagaaaaacagtatggacaaaaagaatgggaagtcttatgaaaataaatctgga gagaatcagtctgtagataagtctgatactataccaattccaaatggtgtggtaacaaat aattctggttatattactaatggttatatgggtaaaggagcagataatgatggtagtgga tctgagagcggatatacaactcctaaaaaaaggaaagctaggcgcaatagtgccaagggt tgtgaaaaccttaatatagtgcaggacaaaataatgcaacaagagaccagtgtcccaacc ttaaaacagggacttgaaactttcaagcctgactatagtgaacaaaagggaaatcgagta gatggttcgaagcccatttggaagtatgaaactgggcctggaggaacaagtcgaggaaaa cctgctgtgggtgatatgcttcggaaaagctcagatagtaaacctggtgtgagcagcaaa aagtttgatgatcggcccaaaggaaagcatgcttcagctgttgcctccaaagaggactcg tggaccctatttaaaccacccccagtttttccagtggacaatagcagtgctaaaatagtt cctaaaataagttatgcaagcaaagttaaggaaaacctcaacaaaactatacagaactct tctgtgtcaccaacttcatcttcatcatcttcatcatctaccggggaaactcagacccaa tcatcaagtcgcttatcccaggtccctatgtcagcgctgaaatctgttacttctgccaac ttttctaatgggcctgttttagcagggactgatggaaatgtttatcctccagggggtcag ccactgctaactactgctgctaatactctaacacccatctcttctgggacagattcagtt ctccaggacatgagtctaacttcagcagctgttgaacaaattaagactagcctttttatc tatccttcaaatatgcaaactatgctgttgagcacagcacaagtggatctgccctctcag acagatcagcaaaacctgggggatatcttccagaatcagtggggtttatcatttataaat gagcccagtgctggccctgagactgttactgggaagtcatcagagcataaagtgatggag gtgacatttcaaggagaatatcctgctactttggtttcacagggtgctgaaataattccc tcaggaactgagcatcctgtgtttcccaaggcttacgagctggagaaacggactagtcct caagttctgggtagcattctaaaatctgggactactagtgagagtggagccttatccttg gaacccagtcatataggtgacctgcagaaagcagacaccagtagtcaaggtgctttagtg tttctctcaaaggactacgagatagaaagtcaaaatcctctggcctctcctacgaacact ttgttaggctctgccaaagaacagagataccagagaggcctagaaaggaatgatagctgg ggttcttttgacctgagggctgctattgtatatcacactaaagaaatggaatctatttgg aatttgcagaagcaagatcccaaaaggataatcacttacaatgaagccatggatagtcca gatcaatga >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_7|53_aa MNVRVKISDLISHNQGNTMMMLEKGPNPDPKRGVLGSHAERNSRQVTECSKKR >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_7|162_bp atgaatgtgagagtcaagatttctgatctaatcagccacaaccagggtaacacaatgatg atgctggaaaaaggtcccaacccagaccctaaaagaggggttcttggatctcatgcagaa aggaattcaaggcaagtcacagagtgcagcaagaagagataa >gi568815581r:29164542_29393892|GENSCAN_predicted_peptide_8|90_aa MLARLVSPGQAGLTMLARLILTSGGWLRSTLISSESKAALNPATLYRQFYTNTLAPMATA TQLSEALGSGYRSAAGGCACLLRPRPVGER >gi568815581r:29164542_29393892|GENSCAN_predicted_CDS_8|270_bp atgttggccaggctggtctcacctggccaggctggtctgaccatgttggccaggctgatc ctgacctcagggggctggctcaggtctactctaatcagctctgaatcaaaagcagccctc aatcctgccaccttgtataggcagttctatacaaacacactcgcccccatggcaacggcc actcagctctcagaagcactgggtagcggctaccggagcgctgcagggggctgcgcctgc ctgctccgccccagacctgtcggcgaaagg