GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:11:30 Sequence gi568815587r:123534146_123753801 : 219656 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15774 15835 62 2 2 60 54 78 0.263 2.32 1.02 Intr + 37059 37208 150 0 0 79 60 64 0.001 1.98 1.03 Intr + 43222 43569 348 1 0 82 53 352 0.227 25.57 1.04 Intr + 47315 47474 160 2 1 87 110 5 0.103 2.59 1.05 Intr + 50167 50187 21 0 0 122 101 -8 0.031 1.64 1.06 Intr + 56807 56923 117 1 0 99 92 54 0.045 7.56 1.07 Intr + 59937 60021 85 2 1 83 97 120 0.693 11.79 1.08 Intr + 60590 60693 104 0 2 112 63 48 0.888 4.59 1.09 Intr + 61797 61892 96 2 0 43 115 97 0.838 8.11 1.10 Term + 62755 62826 72 0 0 -21 42 99 0.239 -7.69 1.11 PlyA + 63482 63487 6 -1.95 2.02 PlyA - 63593 63588 6 1.05 2.01 Sngl - 65307 64372 936 0 0 95 47 545 0.998 47.59 2.00 Prom - 66694 66655 40 -9.95 3.00 Prom + 67130 67169 40 -6.66 3.01 Init + 68940 68986 47 2 2 43 100 -3 0.069 -3.45 3.02 Intr + 69229 69396 168 1 0 37 84 168 0.172 10.36 3.03 Intr + 71177 71333 157 2 1 78 64 195 0.990 16.11 3.04 Intr + 72464 72653 190 1 1 66 86 296 0.999 26.36 3.05 Intr + 74514 74657 144 1 0 101 38 363 0.999 32.85 3.06 Intr + 75650 75768 119 0 2 85 99 86 0.877 9.58 3.07 Intr + 76051 76193 143 0 2 106 64 275 0.826 26.05 3.08 Intr + 78616 78719 104 1 2 108 71 171 0.948 17.22 3.09 Intr + 79310 79513 204 0 0 81 89 170 0.996 15.57 3.10 Intr + 80600 80690 91 0 1 112 47 156 0.980 12.95 3.11 Intr + 84548 84655 108 2 0 98 84 80 0.986 8.00 3.12 Intr + 84962 85079 118 2 1 122 94 20 0.974 6.47 3.13 Term + 88361 88450 90 1 0 96 48 179 0.994 12.42 3.14 PlyA + 88885 88890 6 1.05 4.10 PlyA - 89283 89278 6 1.05 4.09 Term - 100061 99998 64 1 1 122 42 59 0.906 2.06 4.08 Intr - 104179 104041 139 2 1 98 96 128 0.962 14.12 4.07 Intr - 108526 108301 226 1 1 126 72 417 0.998 41.46 4.06 Intr - 111605 111442 164 0 2 91 36 184 0.950 13.19 4.05 Intr - 119727 119569 159 1 0 2 92 86 0.014 0.36 4.04 Intr - 135522 135474 49 0 1 108 83 35 0.048 3.25 4.03 Intr - 137867 137789 79 1 1 70 87 44 0.088 2.05 4.02 Intr - 144153 144062 92 0 2 53 73 66 0.007 0.39 4.01 Init - 166442 166383 60 2 0 71 110 6 0.143 2.35 4.00 Prom - 178813 178774 40 -2.26 5.08 PlyA - 180377 180372 6 1.05 5.07 Term - 187605 187429 177 0 0 100 55 106 0.822 6.19 5.06 Intr - 192846 191975 872 1 2 107 -3 457 0.320 29.56 5.05 Intr - 193450 193331 120 2 0 41 94 50 0.665 1.37 5.04 Intr - 194117 193988 130 2 1 77 47 125 0.937 7.57 5.03 Intr - 195069 194909 161 1 2 108 72 61 0.668 6.21 5.02 Intr - 195764 195470 295 2 1 54 44 242 0.816 12.98 5.01 Init - 196743 196342 402 0 0 66 68 310 0.914 23.43 5.00 Prom - 199597 199558 40 -4.56 6.05 PlyA - 200741 200736 6 1.05 6.04 Term - 201324 201271 54 0 0 115 49 17 0.161 -1.94 6.03 Intr - 208801 208676 126 1 0 95 66 21 0.117 1.48 6.02 Intr - 214523 214359 165 2 0 52 61 73 0.188 1.16 6.01 Intr - 216075 216024 52 2 1 96 75 74 0.650 5.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 37190 37051 140 0 2 92 43 96 0.917 3.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_1|404_aa MYGKNGHQEKMSIDVNNPHRRILKNARVADQGAGLLTRSALCLQDHSGDKDEEESQAKWC SGHRVCSGRGGTASNSNRSTPACSPILRKRSRSPTPQNQDGDTMVEKGSDHSSDKSPSTP EQGVQRSCSSQSGRSGGKNSKVSGTPLRRYLLVRGCGERYWGGEPENICEGPHVMNSPDG SETVIIQAFGTHFRECRTTGIECASGQSSGTPAHSGQARGLLWTWLEGGPIFPVAPQWQQ KSQSWYNNLLVHPGACLPDLQLALGTCHAICILWTLGVTVCTDQVEVLSPTYKQRNEDFR KLFKQLPDTERLIVDYSCALQRDILLQGRLYLSENWICFYSNIFRWETLLTVRLKDICSM TKEKTARLIPNAIQVCTDSEKIWILGQLLNSTQLYHGYLGLENM >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_1|1215_bp atgtatgggaaaaatgggcaccaagaaaagatgtccatagatgtgaataatccacatcgt aggattctgaaaaatgcacgggtagctgatcagggagctggcttgctgacaaggtctgcc ttgtgcctgcaggaccacagtggggataaggatgaggaagaaagccaagcaaagtggtgt agcggccaccgtgtctgcagtgggcgtgggggcactgccagtaactccaaccgcagcacg ccggcctgctcgcccatcctccggaagcggtctcgctcgccaaccccgcagaaccaggac ggagacaccatggtggagaagggctcagatcactcctcggacaagtccccgtccacaccg gagcagggcgtgcagcgcagctgctcctcccagtccggccggagcggcggcaagaattcc aaggtgagcgggaccccgttgaggcggtacctccttgtcaggggctgcggggagcgatat tggggtggtgagccggagaacatctgcgagggtcctcacgtgatgaacagccctgatggc tcagagacagtcattatccaagcctttgggactcatttccgggaatgcagaaccacgggg attgaatgtgcctctgggcagagttcagggacccctgcccacagtggacaggctcggggg ctgctttggacgtggctggaggggggacccatttttcctgtggcccctcagtggcagcag aaaagccagagttggtataataatctgcttgtccacccaggggcctgcctcccggacctc cagctggccctgggcacctgccatgccatctgcatcctctggacccttggggtgactgtc tgcactgaccaggtggaggtgttaagccccacctacaagcagagaaatgaagacttcaga aagctctttaagcagcttccagacacggagcgcctcattgttgattactcatgtgcactc caaagagacattctccttcagggccgactctacctctctgaaaattggatctgcttctac agcaacatcttccgctgggaaactctgctgacagtccgtttgaaagacatctgttccatg actaaagaaaaaacagctcgcctcattcccaatgccatccaagtttgcactgattcagaa aagatttggatcctggggcagctgctcaactctacccaactctaccatggctacctgggc ttggagaacatgtga >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_2|311_aa METILEQQQCYHEEKEWLMDVMAKEMLTKKSMLWDQINSDHCTRATQDRYMEVSGNPRDL YDDKDGLRKEELGAISGPKEFSDFCNRLKQIKEFHRKHPNEIYVAMSVEFEELLKARENP SEEAQNSVEFTDEEGYGRYLDLHDCCLKYINLKASEKLDYITYLSILDQLFDIPKDRRNA EHKRYLEMLLEYLQDYTDRVKPLQDQNELSGKIQAEFEKKWENGIFPGWPKETSSALTQA GAHLDLSAFSSWEELASLGLDRLKSALLALGLKCGRIPEERAQRLFSTKGKSLESLDTSL FAKNPKSKGTK >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_2|936_bp atggagacaatactggagcagcagcagtgctatcatgaggagaaggaatggctcatggat gtcatggccaaagagatgctcactaaaaagtccatgctctgggaccagatcaattctgat cactgcactcgggccacgcaagataggtatatggaggtcagtgggaacccaagggatttg tatgatgataaggatggattacgaaaggaggagctcggtgccatttcaggacccaaggaa ttttctgatttctgtaacagactcaagcaaataaaggaattccaccggaagcacccaaat gagatctatgtggcaatgtcagtggaatttgaggagctcctgaaggctcgagagaatcca agtgaagaggcacaaaactcggtggagttcacagatgaagagggatatggtcgttacctc gatctccatgactgttgcctcaagtacattaacctgaaggcatctgagaagctggattat atcacatacctgtccatcttagaccaattatttgacattcctaaagacaggaggaatgca gagcataagagatacctagagatgctgcttgagtaccttcaggattacacagatagagtg aagcctctccaagatcagaatgaactttctgggaagattcaggctgaatttgagaagaaa tgggagaatgggatctttcctggatggccgaaagagacaagcagtgctctgacgcaggct ggagcccatcttgacctctctgcattctcctcctgggaggagttggcctctctgggtttg gacagattgaaatccgctctcttagctttaggactgaaatgtggcaggatcccagaagag cgagcccagagactattcagcaccaaaggaaagtccctagagtcacttgatacctctttg tttgccaaaaatcccaagtcaaagggcaccaagtaa >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_3|560_aa MGLCHSVLDLFHKSKRVGGPEGARLSRSFSPLKPLCPKELWHFVHQCYGNELGLTSDDED YVPPDDDFNTMGYCEEIPVEENEVNDSSSKSSIETKPDASPQLPKKSITNSTLTSTGSSE APVSFDGLPLEEEALEGDGSLEKELAIDNIMGEKIEMIAPVNSPSLDFNDNEDIPTELSD SSDTHDEGEVQAFYEDLSGRQYVNEVFNFSVDKLYDLLFTNSPFQRDFMEQRRFSDIIFH PWKKEENGNQSRVILYTITLTNPLAPKTATVRETQTMYKASQESECYVIDAEVLTHDVPY HDYFYTINRYTLTRVARNKSRLRVSTELRYRKQPWGLVKTFIEKNFWSGLEDYFRHLESE LAKTESTYLAEMHRQSPKEKASKTTTVRRRKRPHAHLRVPHLEEVMSPVTTPTDEDVGHR IKHVAGSTQTRHIPEDTPNGFHLQSVSKLLLVISCVLVLLVILNMMLFYKLWMLEYTTQT LTAWQGLRLQERLPQSQTEWAQLLESQQKYHDTELQKWREIIKSSVMLLDQMKDSLINLQ NGIRSRDYTSESEEKRNRYH >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_3|1683_bp atgggtttatgccactcagtgctggaccttttccataagagcaaaagagtcgggggacct gagggagcaaggctcagtcgttccttttctcccctgaagcctctgtgtcccaaggagctc tggcactttgttcaccagtgctatgggaacgaattgggcctgaccagtgatgacgaggac tacgtgccccctgacgacgacttcaacacaatgggatactgtgaagagatccctgtggaa gagaatgaagtgaatgacagctcatccaagagcagcatagagaccaagccagatgccagt ccacagctgcccaagaaatccatcaccaacagcacactaacatccacagggagcagtgag gcccccgtctcgtttgatgggctgcccctggaggaagaggcgctggagggagacgggtcc ctggaaaaggagctcgccattgacaacatcatgggggagaagattgagatgatcgctcct gtgaactccccttcactggacttcaatgacaatgaggacatccccactgagctcagtgac tcttccgacacacacgatgaaggagaggtccaggccttctatgaggacctgagtggccgg cagtacgtgaatgaagtcttcaacttcagcgtggacaagctctatgacctcctcttcacc aactcgcccttccagcgggatttcatggagcagcggcgcttctctgatatcatcttccat ccatggaaaaaggaggagaatggaaaccagagccgagtgattctttacaccatcaccctt accaaccctctggctcccaaaactgccactgtcagggagacacagaccatgtacaaggcg agccaggagagtgaatgttacgtgatagatgccgaagtcctcacccacgacgtgccctac catgactacttctacacaatcaatcgctacacgctcacccgtgtggctcggaacaagagc cgactcagggtctccacagagctgcgctatcgaaaacagccctgggggttagtgaaaacg ttcatcgagaagaacttctggagtgggctggaggactacttccgccatttagagagcgag ctggccaaaacggagagcacttatttggctgagatgcacagacaatctcccaaagagaag gccagcaagactacaacggtgcggaggaggaagcgtccccatgcccacctgcgagtccct cacctggaagaggtgatgagcccggtcaccacgcccacagatgaggatgtgggccacagg atcaaacatgtggcaggttccacacagacgcggcatatcccggaggacacccccaacggt ttccacctgcagagcgtgtccaagctgctgctggttatcagctgtgttctggtgctgctg gtcatccttaacatgatgctcttctacaaactctggatgttggaatacaccacgcagacc ctcactgcctggcagggtctaaggctccaagaaaggttaccccagtctcagacagaatgg gcccagctcttagagtcccaacaaaagtaccacgatactgagctccaaaaatggagggaa atcatcaaatcctcagtgatgctccttgaccagatgaaggactcgctcatcaaccttcag aacggcatcaggtcccgcgactacacgtcggaaagtgaagaaaagaggaatcgctatcat tga >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_4|343_aa MPSLITLIQHTVGSSGQGNQPGAAVGHRGPQSGGQAECRAEEGQNHLRPTRGSQQADKNS NSQIKVKMTPSLWAEGQRTAKHPITFNPVQQNIQSERAQSLTEGISLCSLGSRQPQKMPA FNRLFPLASLVLIYWGKYQHLAMPDAVSVCFPVCVEVPSETEAVQGNPMKLRCISCMKRE EVEATTVVEWFYRPEGGKDFLIYEYRNGHQEVESPFQGRLQWNGSKDLQDVSITVLNVTL NDSGLYTCNVSREFEFEAHRPFVKTTRLIPLRVTEEAGEDFTSVVSEIMMYILLVFLTLW LLIEMIYCYRKVSKAEEAAQENASDYLAIPSENKENSAVPVEE >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_4|1032_bp atgccctctctcatcactcttattcaacatactgttggaagttctggccagggcaatcag cctggagctgctgttggtcatcggggcccacagtctggaggacaagctgaatgcagagca gaggagggccagaaccacctaagacccacaagagggagccaacaggctgacaagaacagc aattcccagatcaaggtcaaaatgacaccatcgctgtgggcagaagggcagagaactgcc aaacatccaataactttcaatccagtccagcaaaacatacaatctgagagggcgcagtcc ttgaccgagggaatctctctgtgtagccttggaagccgccagccccagaagatgcctgcc ttcaatagattgtttcccctggcttctctcgtgcttatctactggggtaagtaccagcac ctcgccatgccggatgcagtcagtgtctgcttccctgtgtgtgtggaagtgccctcggag acggaggccgtgcagggcaaccccatgaagctgcgctgcatctcctgcatgaagagagag gaggtggaggccaccacggtggtggaatggttctacaggcccgagggcggtaaagatttc cttatttacgagtatcggaatggccaccaggaggtggagagcccctttcaggggcgcctg cagtggaatggcagcaaggacctgcaggacgtgtccatcactgtgctcaacgtcactctg aacgactctggcctctacacctgcaatgtgtcccgggagtttgagtttgaggcgcatcgg ccctttgtgaagacgacgcggctgatccccctaagagtcaccgaggaggctggagaggac ttcacctctgtggtctcagaaatcatgatgtacatccttctggtcttcctcaccttgtgg ctgctcatcgagatgatatattgctacagaaaggtctcaaaagccgaagaggcagcccaa gaaaacgcgtctgactaccttgccatcccatctgagaacaaggagaactctgcggtacca gtggaggaatag >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_5|718_aa MATAVEPEDQDLWEEEGILMVKLEDDFTCRPESVLQRDDPVLETSHQNFRRFRYQEAASP REALIRLRELCHQWLRPERRTKEQILELLVLEQFLTVLPGELQSWVRGQRPESGEEAVTL VEGLQKQPRRPRRWASSPKISSRDNQELPPDSMVTGSWNYSQVTVHVHGQEVLSEETVHL GVEPESPNELQDPVQSSTPEQSPEETTQSPDLGAPAEQRPHQEEELQTLQESEVPVPEDP DLPAERSSGDSEMVALLTALSQVCPSYLCTTENLFEEPLGISHTKQGLVTFKDVAVCFSQ DQWSDLDPTQKEFYGEYVLEEDCGIVVSLSFPIPRPDEISQVREEEPWVPDIQEPQETQE PEILSFTYTGDRSKDEEECLEQEDLSLEDIHRPVLGEPEIHQTPDWEIVFEDNPGRLNER RFGTNISQVNSFVNLRETTPVHPLLGRHHDCSVCGKSFTCNSHLVRHLRTHTGEKPYKCM ECGKSYTRSSHLARHQKVHKMNAPYKYPLNRKNLEETSPVTQAERTPSVEKPYRCDDCGK HFRWTSDLVRHQRTHTGEKPFFCTICGKSFSQKSVLTTHQRIHLGGKPYLCGECGEDFSE HRRYLAHRKTHAAEELYLCSECGRCFTHSAAFAKHLRGHASVRPCRCNECGKSFSRRDHL LYISPRQVQGTTNEKISGRGLQQLLGAWVHLRNLGNRSTEKQFHGKRAVGQLNNPSKQ >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_5|2157_bp atggctacagccgtggaaccagaggaccaggatctttgggaagaagagggaattctgatg gtgaaactggaagatgatttcacctgtcggccagagtctgtcttacagagggatgacccg gtgctggaaacctcccaccagaacttccgacgcttccgctaccaggaggcagcaagccct agagaagctctcatcagactccgagaactttgtcaccagtggctgagaccagagaggcgg acaaaggagcagatcctagagctgcttgtgctggaacaatttcttaccgtcctacctgga gaactacagagctgggtgcggggccaacggccagaaagtggcgaggaggcagtgacgctg gtggagggtttgcagaaacaacccaggagaccaaggcggtgggcatcttctcctaaaata agctcccgtgacaaccaagaacttcctcctgactccatggtgactggaagttggaattat tcccaggtgactgtccatgttcacggccaggaagtcctgtcagaggagacggtgcattta ggagtggagcctgagtcacctaatgagctgcaggatcctgtgcaaagctcgacccccgag cagtctcctgaggaaaccacacagagcccagatctgggggcaccggcagagcagcgtcca caccaggaagaggagctccagaccctgcaggagagcgaggtcccagtgcccgaggaccca gaccttcctgcagagaggagctctggagactcagagatggttgctcttcttactgctctg tcacaggtgtgccctagttacctctgtaccacagagaatttgtttgaagaaccactgggc ataagccatactaaacagggactggtaacgttcaaggatgtggccgtatgcttttcccag gaccagtggagtgatctggacccaacacagaaagagttctatggagaatatgtcttggaa gaagactgtggaattgttgtctctctgtcatttccaatccccagacctgatgagatctcc caggttagagaggaagagccttgggtcccagatatccaagagcctcaggagactcaagag ccagaaatcctgagttttacctacacaggagataggagtaaagatgaggaagagtgtctg gagcaggaagatctgagtttggaggatatacacaggcctgttttgggagaaccagaaatt caccagactccagattgggaaatagtctttgaggacaatccaggtagacttaatgaaaga agatttggtactaatatttctcaagtgaatagttttgtgaaccttcgggaaactacaccc gtccaccccctgttagggaggcatcatgactgttctgtgtgtggaaagagcttcacttgt aactcccaccttgttagacacctgaggactcacacaggagagaaaccctataaatgtatg gaatgtggaaaaagttacacacgaagctcacatcttgccaggcaccaaaaggttcacaag atgaacgcgccttacaaatatcccctaaaccggaagaatttggaagagacctcccctgtg acacaggctgagagaactccatcagtggagaaaccctatagatgtgatgattgcggaaag cacttccgctggacttcagaccttgtcagacatcagaggacacatactggagaaaaaccc ttcttttgtactatttgtggcaaaagcttcagccagaaatctgtgttaacaacacaccaa agaatccacctgggaggcaaaccctacttgtgtggagagtgtggtgaggacttcagtgaa cacaggcggtacctggcgcaccggaagacgcacgctgctgaggaactctacctctgcagc gagtgcgggcgctgcttcacccacagcgcagcgttcgccaagcacttgagaggacacgcc tcagtgaggccctgccgatgcaacgaatgtgggaagagcttcagtcgcagggaccacctc ttgtacatctctcctagacaagtccaaggaactactaacgagaagatttcaggaagaggc ctacagcaattgcttggtgcttgggttcatttgcggaatcttggcaacaggtctacagag aagcagttccacggcaaaagagctgtggggcagttgaataatccatccaaacaatga >gi568815587r:123534146_123753801|GENSCAN_predicted_peptide_6|132_aa XGGKDMKEIVGSLQCPEKHYSTIAKTWNQPKYPSMVDWIKKMWYIYTMEYYAATKKNDIM SSAAKWMELEAIITPAPQSLSSSQAFSFRLNYTTNLLISPACSKQMVGLLNVHNYAAKVN AKQAPEISACDF >gi568815587r:123534146_123753801|GENSCAN_predicted_CDS_6|399_bp nngggaggcaaagacatgaaggagattgtgggcagcctccagtgtccagagaagcactat tccacaatagcaaagacatggaatcaacctaaatacccctcaatggtggactggataaag aaaatgtggtacatatataccatggaatactatgcagccacaaaaaagaatgacatcatg tcttctgcagcaaaatggatggagctggaagccattattacaccagcacctcaatccctt tccagttctcaggccttcagctttagactgaattataccaccaacctccttatttctcca gcttgcagtaagcagatggtgggacttctcaacgtccataattatgctgccaaagtaaat gcaaaacaagcaccagaaatctcagcttgtgatttctga