GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:41:42 Sequence gi568815594r:46828228_47093424 : 265197 bp : 36.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 3112 3321 210 0 0 104 37 148 0.862 5.57 1.02 PlyA + 3850 3855 6 1.05 2.00 Prom + 5171 5210 40 -6.45 2.01 Init + 7258 7295 38 0 2 81 72 85 0.744 5.73 2.02 Term + 59687 59963 277 2 1 66 54 217 0.385 10.05 2.03 PlyA + 63056 63061 6 1.05 3.00 Prom + 76094 76133 40 -3.15 3.01 Init + 80878 81025 148 0 1 64 80 119 0.705 9.00 3.02 Term + 81055 81101 47 2 2 59 47 87 0.883 -1.91 3.03 PlyA + 81548 81553 6 1.05 4.06 PlyA - 82968 82963 6 1.05 4.05 Term - 100528 99998 531 1 0 76 28 385 0.324 24.56 4.04 Intr - 137002 136743 260 2 2 64 68 326 0.431 24.46 4.03 Intr - 146148 146005 144 1 0 126 80 87 0.661 11.03 4.02 Intr - 164719 164577 143 0 2 102 36 123 0.501 7.58 4.01 Init - 165197 165112 86 2 2 87 97 187 0.746 17.84 4.00 Prom - 169529 169490 40 -5.75 5.03 PlyA - 169585 169580 6 1.05 5.02 Term - 189964 189771 194 2 2 -56 42 402 0.949 17.80 5.01 Init - 190678 190675 4 1 1 76 57 0 0.545 -4.09 5.00 Prom - 192845 192806 40 -3.25 6.00 Prom + 195368 195407 40 -4.65 6.01 Init + 203706 203778 73 2 1 71 82 84 0.819 7.38 6.02 Term + 204190 204572 383 2 2 87 55 172 0.650 7.82 6.03 PlyA + 204913 204918 6 1.05 7.03 PlyA - 205270 205265 6 1.05 7.02 Term - 217334 217189 146 0 2 86 47 214 0.985 14.19 7.01 Init - 217917 217875 43 0 1 83 98 35 0.579 4.63 7.00 Prom - 222214 222175 40 -5.85 8.04 PlyA - 223214 223209 6 1.05 8.03 Term - 224601 224494 108 0 0 91 49 126 0.534 6.53 8.02 Intr - 226284 226221 64 2 1 8 98 52 0.020 -4.00 8.01 Init - 241720 241614 107 1 2 101 68 48 0.207 3.84 8.00 Prom - 246522 246483 40 -4.65 9.04 PlyA - 246976 246971 6 1.05 9.03 Term - 251211 250712 500 1 2 37 46 260 0.878 10.40 9.02 Intr - 256573 256429 145 1 1 57 82 131 0.874 8.33 9.01 Init - 258884 258489 396 2 0 49 84 172 0.876 9.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_1|69_aa MGSCAARASSTSATPGSTAPCPIDHPGAEECGRMARDWQAAPPVAPVRDPLGEASWAPES SGDLENLYV >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_1|210_bp atgggctcctgtgcggcccgagcctcctcaacgagtgccacccccggctccacggcaccc tgtcccatcgaccacccaggggctgaggagtgcgggcgcatggcacgggactggcaggca gctccacctgtggccccagtgcgggatccactgggtgaagccagctgggctcctgagtct agtggggacttggagaacctttatgtctag >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_2|104_aa MVEEESDGYILGQHGSLLFGAINQARTSRLPTPTEGIDTQMKDLLAVLTPLLMTQTSGKQ HFTVMITTSLISNNSHGVYCALGAQQEKQSGVAFKVPQELSALL >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_2|315_bp atggtggaagaagagtctgacggctacattttgggacaacatggttcactcctatttgga gccattaatcaagccagaaccagtaggctgccaacccctactgaaggtatagatacccaa atgaaagacctgcttgctgtgctgacaccactcttaatgactcaaacatctggaaaacaa catttcacagtgatgatcacaacttcactgataagcaataactcgcatggtgtgtactgt gcattaggagctcaacaggagaagcaaagcggtgtcgctttcaaggtgccacaagaactc agtgctctcctttga >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_3|64_aa MSYNCQLTDNEPQNETNLPRTSQVTGKKRRRRGKFRTFPFYSLEGSGKTGERKRRSHEGL RVPN >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_3|195_bp atgtcttataactgtcagcttactgacaacgaaccccagaatgagacaaacttaccaagg acgtcacaggtaacaggcaaaaaaagacggcgcagaggcaaattccgaacctttccgttt tacagcctggaggggtcgggtaaaacaggtgaaaggaagaggcggtcacatgaaggactg cgcgttccaaactga >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_4|387_aa MVSAKKVPAIALSAGVSFALLRFLCLAVCLNESPGQNQKEEKLCTENFTRILDSLLDGYD NRLRPGFGGMNDFKRANAYPKSEMIYTWTKGPEKSVEVPKESSSLVQYDLIGQTVSSETI KSITGITTVLTMTTLSISARHSLPKVSYATAMDWFIAVCFAFVFSALIEFAAVNYFTNIQ MEKAKRKTSKPPQEVPAAPVQREKHPEAPLQNTNANLNMRKRTNALVHSESDVGNRTEVG NHSSKSSTVVQESSKGTPRSYLASSPNPFSRANAAETISAARALPSASPTSIRTGYMPRK ASVGSASTRHVFGSRLQRIKTTVNTIGATGKLSATPPPSAPPPSGSGTSKIDKYARILFP VTFGAFNMVYWVVYLSKDTMEKSESLM >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_4|1164_bp atggtttctgccaagaaggtacccgcgatcgctctgtccgccggggtcagtttcgccctc ctgcgcttcctgtgcctggcggtttgtttaaacgaatccccaggacagaaccaaaaggag gagaaattgtgcacagaaaatttcacccgcatcctggacagtttgctcgatggttatgac aacaggctgcgtcctggatttgggggtatgaacgatttcaaacgcgcaaatgcctatcca aagagtgagatgatctatacctggacaaaaggtcctgagaaatcagttgaagttccgaag gagtcttccagcttagttcaatatgatttgattgggcaaaccgtatcaagtgaaaccatc aaatcaattacgggaataacaactgtcctcaccatgaccacactaagcatcagtgcacga cattctttgcccaaagtgtcctatgctaccgccatggactggttcatagctgtctgcttt gcttttgtattttcggcccttatcgagtttgctgctgtcaactatttcaccaatattcaa atggaaaaagccaaaaggaagacatcaaagccccctcaggaagttcccgctgctccagtg cagagagagaagcatcctgaagcccctctgcagaatacaaatgccaatttgaacatgaga aaaagaacaaatgctttggttcactctgaatctgatgttggcaacagaactgaggtggga aaccattcaagcaaatcttccacagttgttcaagaatcttctaaaggcacacctcggtct tacttagcttccagtccaaacccattcagccgtgcaaatgcagctgaaaccatatctgca gcaagagcacttccatctgcttctcctacttctatccgaactggatatatgcctcgaaag gcttcagttggatctgcttctactcgtcacgtgtttggatcaagactgcagaggataaag accacagttaataccataggggctactgggaagttgtcagctactcctcctccatcggct ccaccaccttctggatctggcacaagtaaaatagacaaatatgcccgtattctctttcca gtcacatttggggcatttaacatggtttattgggttgtttatttatctaaggacactatg gagaaatcagaaagtctaatgtaa >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_5|65_aa MERRRREEEKGGGGGKRKKKEEEDEEEKKRRQRRKREKKKEEEEEEEEEEDDDKEEERRR KENLF >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_5|198_bp atggaaagaagaagaagagaagaagaaaaaggaggaggaggaggaaaaaggaagaagaaa gaggaggaggatgaggaggagaagaagaggaggcagaggaggaagagggagaagaagaag gaggaggaggaagaagaggaggaggaggaagatgacgacaaagaagaagaaagaagaaga aaagaaaacttattttaa >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_6|151_aa MSYVKETVDRLLKGYDIRLRPDFGGPPVDVGMRIDVASIDMVSEVNMVSGLPRGPAVRLT QMGNGQVPLPSAFHWRSPRPWPLRSHSAPAPRSGTHTRSPLVCNFDTHGLLRCSRGNVLG TILGRTSDCCAVDLLGRSLSVTLAGFPCVFG >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_6|456_bp atgtcatacgtgaaagagacagtggacagattgctcaaaggatatgacattcgcttgcgg ccggacttcggagggccccccgtcgacgttgggatgcggatcgatgtcgccagcatagac atggtctccgaagtgaatatggtgagtggcctcccgaggggcccggcggttcggcttacg cagatgggaaatggacaggtccctttgccctctgcgtttcattggcggtcacctcgcccc tggcccctgaggtcccactccgcacccgctccccgctccggcacacacacccggtcgccc ctggtttgtaatttcgacacacacgggctactgcggtgttccaggggaaacgtgctcggc actattttgggaaggacgagtgactgttgcgccgtggatctgctggggcggagtctgagc gttactttagctgggtttccctgcgtgtttggatga >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_7|62_aa MAEGTSSQSGRREKELRDKCLLFISFPVCDTLLYQLKWTDDDDDDDDDDDDDDDKGEYYA LK >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_7|189_bp atggcagagggcacctcttcacagagtggccggagagagaaggaactgagagataaatgt ctcttgtttataagctttccagtctgtgatactctgttatatcagctgaaatggactgat gatgatgatgatgatgatgatgatgatgatgatgatgatgataaaggagaatactatgct ttaaaatga >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_8|92_aa MGVFIDPDMRCSGGVPETEANFSGLERKCMERNWRLQALPIDLHRSPSVQHLLLFMKFLD PLDASEADPQSDGYIDRKMHNATVGEENERTE >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_8|279_bp atgggggtctttattgatcctgacatgaggtgttctggtggagtaccggaaacggaggct aatttcagtgggcttgaaaggaaatgcatggaaagaaattggagattgcaggcattaccc attgatcttcatcgcagtccttcagtccaacacctgctgctattcatgaagtttttagat ccattggatgcttcagaagctgatccgcaaagcgatggatacattgacaggaagatgcat aatgctacagtcggggaggaaaatgagagaactgaatag >gi568815594r:46828228_47093424|GENSCAN_predicted_peptide_9|346_aa MQAEIEGTLEIHEAFCETLVIPRWSPSGKMWHEMGKSSVSPLHHGTQHGSPHCEDKREYS NIVTLCSRIRRMSVTTPMEVSEAVLRYHAPAALSHCTASSKRAVLEEMRALCEKSSETCF WAPSHSLTQVERGPVPGSDEEHYGYLGYPVYDLQITCKSYKDFVYDLQIQPERWEAQNQV EIQTTIREYYKHLYANKLENLVEMEKFLDTYTLPRLNQEEIESLNRPITGSEIEAIINSL PTKKSPGPDGFTSEFYQRYKKELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRNTT KKENFRPISLMNIDTRILKKILANRIQQHIKKLIHHDQVGFIPGMQ >gi568815594r:46828228_47093424|GENSCAN_predicted_CDS_9|1041_bp atgcaggcagagattgaagggactttggaaatccacgaagcattttgtgagacgttggtt atacccaggtggagcccaagtggaaaaatgtggcatgaaatgggaaaaagcagtgtttcc ccacttcaccatggaacacagcatggttctccacactgtgaggataaaagggaatacagt aacattgtgacactttgcagcagaattaggagaatgagtgtgaccactcccatggaggta tcagaagcagtactgaggtaccacgctccagcagcccttagccactgcacagcaagcagt aagagggctgtcttggaagaaatgagagctctatgtgagaaatcaagtgagacatgcttt tgggctccaagccattctctgacacaggtggagaggggtcctgttcctggctctgatgag gaacactatgggtacctgggctaccctgtttatgacttacaaataacttgtaagtcttac aaagactttgtttatgacttacaaatacaacctgagagatgggaagctcagaatcaggta gaaatacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaat ctagtagaaatggagaaattccttgacacatacaccctcccaagattaaaccaggaagaa attgaatctctgaatagaccaataacaggctctgaaattgaggcaataattaatagctta ccaaccaaaaaaagtccaggaccagatggattcacatccgaattctaccagaggtacaag aaggagctggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcatcctgataccaaagcctggcagaaacacaaca aaaaaagagaattttagaccaatatccctgatgaacatcgacacaagaatcctcaagaaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaatga