GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:59:23 Sequence gi568815593r:135234979_135489093 : 254115 bp : 45.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 954 1080 127 2 1 49 44 154 0.411 7.43 1.02 Term + 3995 4167 173 0 2 103 53 81 0.775 4.09 1.03 PlyA + 4603 4608 6 -1.75 2.07 PlyA - 5940 5935 6 1.05 2.06 Term - 9787 9712 76 0 1 78 54 78 0.192 0.71 2.05 Intr - 10841 10736 106 0 1 53 76 95 0.173 4.07 2.04 Intr - 20057 20032 26 1 2 117 98 20 0.176 3.57 2.03 Intr - 28658 28541 118 0 1 60 63 105 0.195 4.72 2.02 Intr - 31847 31821 27 0 0 108 63 37 0.235 1.29 2.01 Init - 47309 47195 115 2 1 93 50 62 0.120 3.27 2.00 Prom - 49171 49132 40 -5.16 3.08 PlyA - 49265 49260 6 1.05 3.07 Term - 50207 50035 173 0 2 70 42 101 0.194 1.69 3.06 Intr - 72441 72126 316 0 1 41 17 259 0.092 9.74 3.05 Intr - 73051 72934 118 0 1 58 82 60 0.239 2.87 3.04 Intr - 78819 78510 310 1 1 56 105 97 0.087 3.57 3.03 Intr - 86401 86255 147 2 0 71 78 43 0.361 1.81 3.02 Intr - 89481 89286 196 0 1 118 26 110 0.292 6.79 3.01 Init - 90044 90015 30 2 0 104 42 65 0.676 3.10 3.00 Prom - 91034 90995 40 -5.56 4.13 PlyA - 94380 94375 6 1.05 4.12 Term - 100163 99998 166 1 1 117 44 270 0.998 22.89 4.11 Intr - 100586 100228 359 2 2 -47 81 201 0.178 -0.05 4.10 Intr - 107657 107540 118 1 1 74 91 108 0.991 10.17 4.09 Intr - 108456 108282 175 1 1 65 91 196 0.985 16.60 4.08 Intr - 111079 110990 90 2 0 107 78 165 0.999 17.37 4.07 Intr - 118067 117968 100 2 1 109 59 103 0.641 9.18 4.06 Intr - 125629 125519 111 1 0 29 102 156 0.909 11.68 4.05 Intr - 134625 134428 198 0 0 48 95 202 0.995 16.45 4.04 Intr - 135164 135058 107 0 2 113 80 189 0.887 20.53 4.03 Intr - 154125 153944 182 2 2 75 109 262 0.592 26.51 4.02 Intr - 170385 170325 61 1 1 71 82 44 0.021 -0.11 4.01 Init - 180395 180263 133 2 1 78 47 61 0.257 1.30 4.00 Prom - 180519 180480 40 -2.46 5.00 Prom + 181459 181498 40 -1.66 5.01 Init + 192914 193011 98 1 2 102 77 24 0.259 2.74 5.02 Intr + 200733 200783 51 0 0 71 85 87 0.181 4.72 5.03 Term + 202631 202766 136 2 1 104 49 41 0.368 -0.71 5.04 PlyA + 205630 205635 6 1.05 6.09 PlyA - 207005 207000 6 1.05 6.08 Term - 210917 210811 107 0 2 52 41 89 0.285 -0.83 6.07 Intr - 211552 211468 85 1 1 85 100 32 0.808 3.49 6.06 Intr - 213251 213134 118 1 1 84 64 84 0.848 6.07 6.05 Intr - 214971 214480 492 2 0 71 31 490 0.837 33.71 6.04 Intr - 226656 226513 144 2 0 57 86 84 0.740 4.50 6.03 Intr - 231506 231141 366 1 0 87 97 293 0.938 24.36 6.02 Intr - 231850 231697 154 2 1 71 -3 142 0.858 2.53 6.01 Init - 237373 237235 139 1 1 71 80 142 0.854 12.00 6.00 Prom - 244064 244025 40 -6.56 7.03 PlyA - 245962 245957 6 1.05 7.02 Term - 247534 247382 153 1 0 101 41 117 0.800 6.22 7.01 Intr - 252046 251946 101 2 2 98 98 -15 0.078 0.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 147917 148015 99 1 0 89 86 83 0.849 8.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_1|99_aa MMLMLLVQGPHLESTDREQTQTVPGFAKNDCGEKVRHKAFTKGPHFAIQEALWVSVVKLA HDGCPIDIHWMKECNLPGALEPLAIINLLLKGSSPQLPA >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_1|300_bp atgatgctgatgttgctggtccagggaccacaccttgagagcactgatcgagagcaaacg cagaccgtgccaggatttgccaagaatgactgtggagagaaagtgaggcacaaagccttc actaaaggtccacactttgctatccaggaggccctatgggttagtgtggtgaagctggca catgatggatgtccaatagatattcactggatgaaggaatgcaatttaccaggtgcccta gagcctttggcaataattaatcttctcttaaaaggatcttccccacaacttcctgcctga >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_2|155_aa MEYYAATTKNEIMSFAATLMQLETIFLSKLMQEQKTKYHGYYHQGLGSKQESFAPNPSRA KPNESHVHISEASGRVSLGQAHAEESGSLHVELSHSRNYQLPHTPEKTEINTQRMLELSE VPEPRPGRDEGAATFTTENPLGLTKHLPLRAITGD >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_2|468_bp atggagtattatgcagccacaacaaagaatgaaatcatgtcctttgcagcaacactgatg cagctggagacaatattcctaagcaaattaatgcaggaacagaaaaccaaataccacgga tactaccaccagggtctgggaagcaagcaggagagctttgctccaaatcctagcagagcc aagccaaatgaatcccacgtccacatcagcgaggcctctggccgggtctccctgggccaa gcccacgcggaggagagcggctccttgcatgtggagcttagtcacagcaggaactatcaa ctcccccacacccctgagaaaactgagatcaacacgcagcgaatgctggagctgagtgag gtgcctgagccaagacctggaagagatgaaggagcagccaccttcacaacagagaaccct ttggggctgaccaaacacttgcccttgcgggccatcacgggagactga >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_3|429_aa MAHMLLLNAMATPGPKSPGRMIAELIWPEYCTQDISPTDTCLFRNPKRQAAQQARPNIPG HGNSLTLIPQRATWASVIPKETDAEALQVVGQPAGSKRGTRPPQKEHLGTVAWVRQRGLE GGSSEGVNSHINLLKLMQKKHLQEKKLKVPLNQMWLQVDNMECNSQAILPLQDVLGAKKQ ATRSLNSPKNCHNFHALKAHSSSSQDTSVRASDASSSKRVQTGAAVRRHRVKRSLDISSM YYDCSSVSSLGTPSGLSTFTWIDQGPQVLGWAYVLHIVSACRLRHRHHRHHYCYFTGKGT APKQLAQGHTRRKRYNLESALVCLTLRPQPLPALTSCLLGVQTVVVSVTQALAIVPDDPP ISVSKAVLNTTIEIQKIPRDYSKQLYAHELKTPEEMDKILETQNLLRLNQEEIEALNRQI SSSEIKSVI >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_3|1290_bp atggcccacatgctcctgctcaacgccatggccaccccaggacccaagtcaccagggagg atgattgctgagctcatctggcctgagtactgcacacaggacatttctcccacagacacc tgcttgtttagaaaccccaaacgccaggcggcccagcaggcccggcccaatattcccggg cacgggaactctctcaccctcataccccagagggccacatgggccagtgtaattccaaag gagacagatgctgaggcactgcaggtggttgggcagccagctggcagcaagagggggacc aggcctccacagaaagagcacttaggaacagtggcctgggtacggcagcggggcctggag ggaggcagctctgagggtgtgaattctcacattaacctgctgaagttaatgcagaagaag cacctccaagaaaagaaactgaaagtccccctaaaccaaatgtggttacaagttgataac atggaatgcaactctcaggcaatcttacctcttcaggatgttttaggggccaagaaacag gccacaaggtctttgaatagcccaaaaaactgtcacaacttccatgcactgaaagctcat tcctcttcctcacaagacacctccgtgagagccagtgatgcttcctctagcaagcgggtg cagacaggagcagcggtgagaaggcacagggtgaagcgctccctagacatcagctcaatg tattatgattgctcctcagtgtccagcctgggcacaccatcagggctcagcaccttcacc tggattgaccagggcccccaggtccttggttgggcttacgtgctccacattgtgagtgcc tgtcgtcttcgtcatcgtcatcatcgtcatcactactgctatttcacaggtaaaggcact gcgcctaagcaacttgctcaaggtcacacaagaagaaaacggtacaatctagagtcggcg ctggtctgtttgacactgaggcctcaacccttgcccgccctcacatcctgcctccttggt gttcagactgttgtggtcagtgtgacacaagccctggccattgttccagatgaccctcca atatccgtcagcaaagcagtcctcaataccaccatagaaatacaaaagatccccagagac tactctaaacaactctatgcacatgaattaaaaactccagaggaaatggataaaatcctg gaaacacagaatctcctgagattgaatcaagaggagattgaagccctgaacagacaaata tcaagttctgaaattaaatcagtaatttaa >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_4|599_aa MEYYAAIKKDEFMSFVGTWMKLETIILSKLLQGQKTKDHMFSLIDSYVEALTSNVMVIGD GAFRRATAMSSRGGKKKSTKTSRSAKAGVIFPVGRMLRYIKKGHPKYRIGVGAPVYMAAV LEYLTAEILELAGNAARDNKKGRVTPRHILLAVANDEELNQLLKGVTIASGGVLPNIHPE LLAKKRGSKGKLEAIITPPPAKKAKSPSQKKPVSKKAGGKKGARKSKKKQGEVSKAASAD STTEGTPADGFTVLSTKSLFLGQKLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLGNT LEKKGGKEFVEAVLELRKKNGPLEVAGAAVSAGHGLPAKFVIHCNSPVWGADKCEELLEK TVKNCLALADDKKLKSIAFPSIGSGRKSPGRLTKVFTTNTVAPKTYTYPEDHHRRSLVFR QLMMQVSAFPENLQPSPGADIKSLGYGSGTVLVKGIMEKENQELTLYFGGLMHTTKHYSV CWLLPSLKAHSTSFCRCQLSAQLSVPHPIFWALGFPTLAYKQTRGSTDTDPTQGKLQHRG RWPVRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIGIYVQEMAKLDAN >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_4|1800_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctgagcaaactattgcaaggacagaaaaccaaagaccacatg ttctcactcatagattcatatgttgaagctctaacctccaatgtgatggtaattggagat ggagcctttaggagggccaccgccatgtcgagccgcggtgggaagaagaagtccaccaag acgtccaggtctgccaaagcaggagtcatctttcccgtggggcggatgctgcggtacatc aagaaaggccaccccaagtacaggattggagtgggggcacccgtgtacatggccgccgtc ctggaatacctgacagcggagattctggagctggctggcaatgcagcgagagacaacaag aagggacgggtcacaccccggcacatcctgctggctgtggccaatgatgaagagctgaat cagctgctaaaaggagtcaccatagccagtgggggtgtgttacccaacatccaccccgag ttgctagcgaagaagcggggatccaaaggaaagttggaagccatcatcacaccaccccca gccaaaaaggccaagtctccatcccagaagaagcctgtatctaaaaaagcaggaggcaag aaaggggcccggaaatccaagaagaagcagggtgaagtcagtaaggcagccagcgccgac agcacaaccgagggcacacctgccgacggcttcacagtcctctccaccaagagcctcttc cttggccagaagctgaaccttattcacagtgaaatcagtaatttagccggctttgaggtg gaggccataatcaatcctaccaatgctgacattgaccttaaagatgacctaggaaacacg ctggagaagaaaggtggcaaggagtttgtggaagctgtcctggaactccggaaaaagaac gggcccttggaagtagctggagctgctgtcagcgcaggccatggcctgcctgccaagttt gtgatccactgtaatagtccagtttggggtgcagacaagtgtgaagaacttctggaaaag acagtgaaaaactgcttggccctggctgatgataagaagctgaaatccattgcatttcca tccatcggcagcggcagaaaatccccaggtagactcaccaaggtcttcacgaccaacaca gttgcccctaagacatacacatacccggaggaccaccacaggagaagtctagtttttcga cagctgatgatgcaggtcagtgccttcccagagaacctgcagccatcccctggggctgac atcaagagcctgggctatggctcaggcactgtgctagtcaaaggtatcatggaaaaagaa aatcaggagcttacactgtacttcgggggcttaatgcacaccacaaaacactattctgtt tgctggctactgccttctctaaaagcacatagcacatccttctgcaggtgccagctctct gcccagttgtctgtcccccatcccatcttctgggcactgggctttcccacactggcgtac aagcagacacggggctcaacagacaccgacccaacacaaggcaagctgcagcacagaggg aggtggcctgtaaggaacggttttccaaagcagacagcagctcagctgattctgaaggcc atctccagttacttcgtgtctacaatgtcctcttccatcaaaacggtgtacttcgtgctt tttgacagcgagagtataggcatctatgtgcaggaaatggccaagctggacgccaactag >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_5|94_aa MALPQTVSTQLVVSLFGLVQVFVCSERLPELVSGLLVAPGPDLEISIALRNYETVTHTPF AFAAPLVAIGMVPSGTEPAVTHFPVPQAPASTIR >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_5|285_bp atggctctgccacaaacagtttccacccaactggtggtcagcctttttggactagtccag gtttttgtgtgctctgagaggctgccagaacttgtaagtgggctgctggtggcaccaggg cctgacctggagattagcatcgccctgaggaattatgagacagtaacgcacactcctttt gcatttgcggctcctcttgtggccatcggtatggttccttcagggactgagcctgctgtg actcacttcccagtcccccaggccccagcaagcaccatcagatag >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_6|534_aa MTACKIETTDLVELGPGVLEALIGKKARAGKETVKPMTGQFLQLSPEPRGATGKNKSPGN DPAAAIATAGAAATAGPGSPCSLQNALIYSHSFLEHHLKAEHPAHRTDPGTQQVLHKRLL NAGLEQCRNSSYSEIAGASRGVPHRRRRRIQGVEGAGERVQSAIYRQFVGLAGKALALRR DGAPQTALFATLLRIHSLSNRSAITSQSPVNFAAYLRRERSPLGQREEENAFYIMVVAMW KRHISLNIRFRMKTHVCKAYVKHVMHERTSSMEKPLTVLRVSLYHPTLGPSAFANVPPRL QHDTSPLLLGRGQDAHLQLQLPRLSRRHLSLEPYLEKGSALLAFCLKALSRKGCVWVNGL TLRYLEQVPLSTVNRVSFSGIQMLVRVEEGTSLEAFVCYFHVSPSPLIYRPEAEETDEWE GISQGQPPPGSGCQQFLFSAQKDDRFAPRASYKEVQLHSQALSIFSIRKTELEPSGWLQG LVFALDITIPHLLRQPEQAGILTLAFLCFSKNVNLAKLSVRDTGSPRGRLANAP >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_6|1605_bp atgacggcctgcaagattgaaacgactgaccttgttgaattgggacctggcgttttggag gctctgataggcaagaaggccagagcaggcaaggaaactgtgaagccaatgactggccag tttctacagctctcgccagagcctagaggtgcaaccggaaagaacaagtcccctggaaac gaccccgctgctgccatcgccactgctggtgctgctgctacggcaggccctggctctcca tgcagtctccagaatgccctcatctactcccattcatttcttgagcaccacctgaaggct gaacacccagcacacaggacagaccctggtactcagcaggtgctgcataagcgtctgtta aatgcaggactggagcagtgcagaaacagctcttactctgaaatcgcaggcgcttcccgg ggagtcccgcaccggcgcagacggcggatccagggcgtggagggggccggggaacgggtt cagagtgccatctaccggcagttcgtcggactggcaggaaaggccttggccctgcggcgg gatggagccccccagactgcgctgtttgctacgctgctccggatccattcactctccaac cgctctgcaatcacttcgcaatcaccagtaaactttgccgcctacttgagaagagaaaga tcccccctggggcagagggaggaggaaaatgctttttatattatggtagtggccatgtgg aaaagacatatttccctcaacattcgattccgaatgaaaacgcacgtttgtaaagcatat gtgaaacatgtcatgcacgaaaggacttcttccatggagaagcccctcaccgtcctgcga gtgagcctgtaccatcccacgctgggcccatctgcctttgccaatgtcccaccacggctg cagcatgataccagccctctgcttctcggacgggggcaggacgcccacctccagctgcag ctccctcgcctctcccgccgtcacctgtccctggagccctacctggagaaaggcagtgcc ctgctggccttctgcctcaaggccctgagccgcaagggctgtgtgtgggtcaatgggctg acgctgaggtacctggagcaggtccccctgagcaccgtcaacagggtctccttctcaggc atccagatgctggttcgcgtagaagaaggcacatccctggaggcttttgtctgctatttc catgtcagcccttcacccctgatttacagacctgaggctgaggaaactgacgaatgggaa ggcatctcccaggggcagcctccccctggttcaggctgtcaacaatttctgttttctgcc caaaaggatgatcgttttgcacccagggcaagctacaaggaggtccaacttcactcacag gccctgagcatcttctccatcaggaaaacagagctggaaccgtcaggctggcttcagggc ctggtcttcgcacttgatatcactatccctcacctgctcagacagccagagcaggcgggt attttaaccctcgcattcctctgcttttccaagaatgtgaatcttgccaagctgtccgtg agagacacgggctcccctcgagggagacttgcaaatgctccataa >gi568815593r:135234979_135489093|GENSCAN_predicted_peptide_7|84_aa XTWVFLQPLHPSSRMKVGTAFCPYGAPCPRPKVRLSRDDDSDDSNSHIDLGSLPLHPALP LKISVSDQTTSSHPEELITIAGPK >gi568815593r:135234979_135489093|GENSCAN_predicted_CDS_7|255_bp nggacatgggtgttccttcagcccctccacccttcttcacggatgaaggtaggcacagcc ttctgcccttatggagcaccgtgcccaaggcctaaggtcaggctgagcagagatgatgac agtgatgatagtaacagccacattgacctaggcagtcttcccctccacccggccttgcct ctgaagatctcagtatctgatcagacaaccagttctcaccctgaagagctgatcacaatt gcaggccccaagtga