GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:57:31 Sequence gi568815591r:134342731_134559062 : 216332 bp : 43.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6922 6961 40 -2.46 1.01 Init + 9682 9698 17 0 2 97 80 21 0.497 2.03 1.02 Intr + 18388 18541 154 1 1 98 80 63 0.434 6.57 1.03 Intr + 28158 28276 119 2 2 47 80 60 0.061 0.36 1.04 Intr + 40767 40864 98 0 2 42 53 61 0.016 -2.35 1.05 Intr + 41366 41461 96 0 0 109 32 64 0.533 2.88 1.06 Term + 42473 42489 17 0 2 145 53 -1 0.380 0.50 1.07 PlyA + 44192 44197 6 1.05 2.14 PlyA - 44853 44848 6 1.05 2.13 Term - 46850 46779 72 2 0 112 43 33 0.046 -0.89 2.12 Intr - 50775 50704 72 0 0 105 63 44 0.496 3.20 2.11 Intr - 81304 81172 133 0 1 139 105 -31 0.041 4.35 2.10 Intr - 102590 102508 83 2 2 89 105 65 0.914 6.74 2.09 Intr - 104651 104568 84 2 0 86 116 126 0.994 15.22 2.08 Intr - 105331 105250 82 0 1 112 92 82 0.977 10.64 2.07 Intr - 105763 105657 107 1 2 105 93 101 0.993 11.31 2.06 Intr - 106389 106267 123 0 0 79 97 150 0.996 15.78 2.05 Intr - 107067 106990 78 0 0 38 95 89 0.952 4.35 2.04 Intr - 108172 108056 117 1 0 74 86 170 0.997 16.06 2.03 Intr - 109013 108856 158 0 2 95 69 340 0.561 32.53 2.02 Intr - 116054 115850 205 2 1 23 70 105 0.341 0.97 2.01 Init - 116332 116267 66 1 0 103 100 62 0.931 10.29 2.00 Prom - 119548 119509 40 -3.86 3.00 Prom + 124400 124439 40 -2.36 3.01 Init + 133030 133091 62 0 2 114 92 6 0.222 4.32 3.02 Intr + 149295 149491 197 0 2 114 14 125 0.289 6.76 3.03 Term + 159455 159612 158 0 2 57 48 83 0.516 -0.70 3.04 PlyA + 159614 159619 6 1.05 4.00 Prom + 179825 179864 40 -2.26 4.01 Init + 185182 185247 66 0 0 107 94 83 0.915 11.88 4.02 Intr + 187913 188080 168 1 0 80 51 359 0.987 31.54 4.03 Intr + 189178 189294 117 0 0 79 9 111 0.864 2.96 4.04 Intr + 190274 190351 78 1 0 78 92 89 0.633 8.05 4.05 Intr + 193920 194042 123 2 0 53 99 163 0.934 14.68 4.06 Intr + 194321 194427 107 1 2 100 73 142 0.929 12.91 4.07 Intr + 194850 194931 82 0 1 96 50 60 0.995 2.64 4.08 Intr + 195464 195547 84 1 0 62 111 100 0.963 9.72 4.09 Intr + 196205 196287 83 1 2 116 92 44 0.895 6.04 4.10 Intr + 206382 206519 138 0 0 65 115 16 0.133 1.58 4.11 Intr + 208771 209030 260 1 2 41 55 148 0.566 3.91 4.12 Intr + 209664 209897 234 1 0 81 87 88 0.433 5.66 4.13 Intr + 210354 210442 89 1 2 56 88 51 0.299 1.59 4.14 Intr + 211592 211660 69 1 0 30 81 74 0.406 0.28 4.15 Term + 212542 212760 219 0 0 86 48 106 0.514 3.44 4.16 PlyA + 215102 215107 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:134342731_134559062|GENSCAN_predicted_peptide_1|166_aa MSKDAGVSDFRRTVTHAGGLSPGESGDPVILRIIPRGKPSFSKAEFLNGGTTDILDQEWI PVWGQRKPGMGQRETGAGGDYHHPLWEMSAKLQVTSRKSTRDADTASMFVPPSYMLEFDP QCWKWDLMGGQQMSSLFLVTVLGVEDTEREKTSLPSWSSQAGLPIP >gi568815591r:134342731_134559062|GENSCAN_predicted_CDS_1|501_bp atgagcaaggatgctggtgtctcagatttcaggagaactgtgacccatgcagggggtttg agtcccggtgaaagtggagaccctgtcatcctgagaatcatccccagggggaaaccatct ttttctaaggcggaatttctcaacggtggaactactgacattttggaccaggaatggatc ccagtatggggccagaggaaacctggtatggggcagagggaaacaggtgctggaggtgac taccatcaccctctatgggaaatgtcggccaagctgcaggtgacctccagaaagagtaca agggatgctgatacagcttctatgtttgtccctccaagctacatgttggaatttgatccc cagtgttggaagtgggacctaatgggaggtcaacagatgtcctctctgttcctggtcacc gtgctaggtgtggaggacacagagagggagaagacctctctgccttcttggagctcacaa gccggccttcctattccctga >gi568815591r:134342731_134559062|GENSCAN_predicted_peptide_2|459_aa MASRLLLNNGAKMPILGLGTWKGSILRVGFGSSSREPPPYRGQPLMGGPPVRILGPSGRP KRHNARGRRGKWASQTGGPRAQTGTWSRRQGQVTEAVKVAIDVGYRHIDCAHVYQNENEV GVAIQEKLREQVVKREELFIVSKLWCTYHEKGLVKGACQKTLSDLKLDYLDLYLIHWPTG FKPGKEFFPLDESGNVVPSDTNILDTWAAMEELVDEGLVKAIGISNFNHLQVEMILNKPG LKYKPAVNQIECHPYLTQEKLIQYCQSKGIVVTAYSPLGSPDRPWAKPEDPSLLEDPRIK AIAAKHNKTTAQVLIRFPMQRNLVVIPKSVTPERIAENFKVFDFELSSQDMTTLLSYNRN WRVCALLRVKLWTCHSSTMPGPLHPLCLGSGCVPHLGNPSLALHKYFLNGALNSFDYVLE TLAPENSHASKELTTQICFILTIFLFQNKKAVEVCEFAY >gi568815591r:134342731_134559062|GENSCAN_predicted_CDS_2|1380_bp atggcaagccgtctcctgctcaacaacggcgccaagatgcccatcctggggttgggtacc tggaagggcagcatcctgcgagtggggtttgggagcagctcacgggagcccccgccctac cgcgggcaacccttgatgggcggcccaccagtccgcattttgggtcctagcgggcgcccc aagcggcacaacgcgagagggaggcggggaaagtgggcttcacagaccggtggacctcgg gcgcagacagggacgtggagccgtcggcaagggcaggtgactgaggccgtgaaggtggcc attgacgtcgggtaccgccacatcgactgtgcccatgtgtaccagaatgagaatgaggtg ggggtggccattcaggagaagctcagggagcaggtggtgaagcgtgaggagctcttcatc gtcagcaagctgtggtgcacgtaccatgagaagggcctggtgaaaggagcctgccagaag acactcagcgacctgaagctggactacctggacctctaccttattcactggccgactggc tttaagcctgggaaggaatttttcccattggatgagtcgggcaatgtggttcccagtgac accaacattctggacacgtgggcggccatggaagagctggtggatgaagggctggtgaaa gctattggcatctccaacttcaaccatctccaggtggagatgatcttaaacaaacctggc ttgaagtataagcctgcagttaaccagattgagtgccacccatatctcactcaggagaag ttaatccagtactgccagtccaaaggcatcgtggtgaccgcctacagccccctcggctct cctgacaggccctgggccaagcccgaggacccttctctcctggaggatcccaggatcaag gcgatcgcagccaagcacaataaaactacagcccaggtcctgatccggttccccatgcag aggaacttggtggtgatccccaagtctgtgacaccagaacgcattgctgagaactttaag gtctttgactttgaactgagcagccaggatatgaccaccttactcagctacaacaggaac tggagggtctgtgccttgttgagagtcaaactgtggacttgccattcctccaccatgcca gggcctctacatcccctgtgccttggctcaggctgtgtcccccacttgggcaacccaagc ctggcactccacaagtacttcctcaatggggctctgaatagttttgactatgtcttagaa accttggcaccagagaacagccatgcttcaaaggaattgactacacaaatttgttttatt cttactattttcctcttccaaaataagaaagctgttgaagtctgtgagtttgcctattag >gi568815591r:134342731_134559062|GENSCAN_predicted_peptide_3|138_aa MPSQHMVFYGYLASVSPEPTYFSLIFTTYSSQNHFTDFFQELAKADWPMQAGGGWNFIPD FVEPKCHCHLPTPYIYELNAASAAIQGKLTHVYPSAYVYIDVHSTMASNSESLETTQMPL SKRLDKYIAVHIHNGLSR >gi568815591r:134342731_134559062|GENSCAN_predicted_CDS_3|417_bp atgcccagccagcatatggtattttatggctatttggcatcagttagtccagagcccaca tatttcagcctcatcttcaccacctacagctctcagaaccacttcaccgatttctttcag gaattagctaaagcagactggcccatgcaggccggtggaggctggaatttcataccagac tttgtagaacctaagtgccactgccaccttccgactccttacatttatgaattaaatgcg gcatcagcagccattcaagggaaacttacacatgtgtatcctagtgcctatgtgtacata gatgttcacagtaccatggctagtaacagcgaaagcctagaaaccactcagatgcctctc agcaagagattggataagtacattgcagtacatatacacaacggactatcaagatga >gi568815591r:134342731_134559062|GENSCAN_predicted_peptide_4|638_aa MATFVELSTKAKMPIVGLGTWKSPLGKVKEAVKVAIDAGYRHIDCAYVYQNEHEVGEAIQ EKIQEKAVKREDLFIVSKLWPTFFERPLVRKAFEKTLKDLKLSYLDVYLIHWPQGFKSGD DLFPKDDKGNAIGGKATFLDAWEAMEELVDEGLVKALGVSNFSHFQIEKLLNKPGLKYKP VTNQVECHPYLTQEKLIQYCHSKGITVTAYSPLGSPDRPWAKPEDPSLLEDPKIKEIAAK HKKTAAQVLIRFHIQRNVIVIPKSVTPARIVENIQVFDFKLSDEEMATILSFNRNWRACN VLQQTAGLQGHLKTFQGSTGHCLLEGNGSTAGAPSQIQLLSPQSLVPGREGPYLKRSQKV FPYISQLLEIAQRVFDNREFEKQKQVAQAAERAADKASKRQAKSLVAAIQEAKKEGPPSQ STDQGTPSPHQEGQKRLANHHLPAVVELLATTLPVQVKQYPMILWAIEGINPHIQQLLQA GILTPCEIAWNIPFLPVQKPGTNDYQPVQDLQEGYETNEGKRALTSARKAAILQIPTPTT KRQTTQNTPQRKKNKLRIFKPVKIRKLGDLVYVKKFQKKGLTPAWKGPHTVILTTPTALK VDGTPAWIHHSRIKKANKAQQKKWVPKPRPGPLKLRLS >gi568815591r:134342731_134559062|GENSCAN_predicted_CDS_4|1917_bp atggccacgtttgtggagctcagtaccaaagccaagatgcccattgtgggcctgggcact tggaagtctcctcttggcaaagtgaaagaagcagtgaaggtggccattgatgcaggatat cggcacattgactgtgcctatgtctatcagaatgaacatgaagtgggggaagccatccaa gagaagatccaagagaaggctgtgaagcgggaggacctgttcatcgtcagcaagttgtgg cccactttctttgagagaccccttgtgaggaaagcctttgagaagaccctcaaggacctg aagctgagctatctggacgtctatcttattcactggccacagggattcaagtctggggat gaccttttccccaaagatgataaaggtaatgccatcggtggaaaagcaacgttcttggat gcctgggaggccatggaggagctggtggatgaggggctggtgaaagcccttggggtctcc aatttcagccacttccagatcgagaagctcttgaacaaacctggactgaaatataaacca gtgactaaccaggttgagtgtcacccatacctcacacaggagaaactgatccagtactgc cactccaagggcatcaccgttacggcctacagccccctgggctctccggatagaccttgg gccaagccagaagacccttccctgctggaggatcccaagattaaggagattgctgcaaag cacaaaaaaaccgcagcccaggttctgatccgtttccatatccagaggaatgtgattgtc atccccaagtctgtgacaccagcacgcattgttgagaacattcaggtctttgactttaaa ttgagtgatgaggagatggcaaccatactcagcttcaacagaaactggagggcctgtaac gtgttgcaacagacagctggcttacagggccacctgaagacgttccagggctccacaggc cactgtcttctggaggggaacggatcgactgccggtgcgcccagccaaattcaactcctg agtcctcagtctctagtcccgggaagagagggaccatatctgaagagaagccagaaagta tttccttacatcagccagcttttagagatagcccagagagttttcgacaatcgagaattt gaaaagcaaaaacaggtagctcaggcagctgaaagggctgcagacaaagcatcaaaaaga caggcaaaaagcttagtggctgccatccaagaagccaaaaaggaagggcccccatcacag agcactgaccaggggaccccgagtccccaccaggaaggccagaaacgactggcaaaccac catctgccagctgttgtagaactcctggccaccaccctgccggtccaggtcaaacaatat cctatgattctgtgggctatagagggaattaatcctcatattcagcaactattacaagct ggtatactcacaccatgtgagatcgcctggaatattccatttttgccggtccagaaaccc ggaacaaatgattaccagcctgtacaggacttgcaggaagggtatgagacaaatgaagga aaaagggcactcaccagtgctcgaaaggcagccatccttcaaatccccactcccaccact aagagacagactacccagaatactccccagaggaagaaaaacaagcttcggatcttcaag ccagtaaaaatcaggaagctgggtgacctagtgtatgttaaaaagttccagaaaaaagga ctgactcctgcctggaaaggacctcatactgtcatcctcaccacaccaacagctctgaag gtggacggcactcctgcttggattcatcattctcgcatcaaaaaggccaacaaagcccag caaaaaaaatgggtccccaagcctaggccaggccccttaaaactgcgcctaagttga