GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:14:32 Sequence gi568815596r:39150621_39536987 : 386367 bp : 38.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 171 166 6 1.05 1.12 Term - 528 381 148 2 1 76 44 112 0.046 1.99 1.11 Intr - 27949 27912 38 1 2 101 58 29 0.121 -2.66 1.10 Intr - 28139 27975 165 2 0 105 62 24 0.214 0.74 1.09 Intr - 28701 28567 135 0 0 97 80 118 0.959 11.74 1.08 Intr - 34027 33971 57 1 0 103 75 70 0.823 5.36 1.07 Intr - 37089 37007 83 1 2 93 103 -10 0.776 -0.56 1.06 Intr - 39882 39685 198 1 0 81 52 184 0.766 12.50 1.05 Intr - 41328 41160 169 0 1 55 51 49 0.240 -3.50 1.04 Intr - 45836 45662 175 1 1 116 54 60 0.228 4.42 1.03 Intr - 46440 46332 109 1 1 102 105 3 0.324 1.82 1.02 Intr - 54024 53907 118 0 1 44 80 23 0.147 -3.78 1.01 Init - 55380 55318 63 0 0 64 93 87 0.304 8.00 1.00 Prom - 56287 56248 40 -5.75 2.04 PlyA - 57114 57109 6 1.05 2.03 Term - 68418 67953 466 2 1 5 37 237 0.493 3.80 2.02 Intr - 75340 75219 122 1 2 89 106 24 0.890 2.67 2.01 Init - 78912 78745 168 0 0 59 115 75 0.964 6.88 2.00 Prom - 86672 86633 40 -6.55 3.00 Prom + 87197 87236 40 -5.75 3.01 Init + 89050 89120 71 0 2 40 81 46 0.439 -0.33 3.02 Term + 93099 93561 463 0 1 124 55 533 0.705 47.14 3.03 PlyA + 94458 94463 6 1.05 4.28 PlyA - 95411 95406 6 1.05 4.27 Term - 106448 106198 251 0 2 73 45 163 0.989 5.38 4.26 Intr - 107820 107728 93 1 0 105 97 36 0.972 5.22 4.25 Intr - 110157 109986 172 0 1 105 72 90 0.886 7.69 4.24 Intr - 114686 114583 104 0 2 51 121 42 0.618 2.77 4.23 Intr - 121780 121663 118 1 1 102 116 -34 0.187 -0.08 4.22 Intr - 127866 127787 80 1 2 51 91 95 0.365 4.45 4.21 Intr - 137636 137501 136 2 1 75 63 56 0.396 1.02 4.20 Intr - 139714 139672 43 0 1 93 69 36 0.667 -0.48 4.19 Intr - 142206 142153 54 2 0 66 98 102 0.280 6.18 4.18 Intr - 149181 149123 59 0 2 50 108 41 0.023 -0.94 4.17 Intr - 158899 158841 59 2 2 80 89 39 0.069 0.88 4.16 Intr - 164768 164690 79 2 1 95 95 46 0.197 4.21 4.15 Intr - 175008 174898 111 0 0 28 75 128 0.486 5.16 4.14 Intr - 186961 186906 56 2 2 93 99 17 0.007 1.08 4.13 Intr - 192832 192768 65 0 2 106 107 -1 0.004 1.04 4.12 Intr - 204165 204087 79 1 1 75 75 74 0.010 2.49 4.11 Intr - 209231 208868 364 2 1 55 31 168 0.005 1.63 4.10 Intr - 209963 209791 173 0 2 24 11 160 0.018 0.64 4.09 Intr - 227503 227446 58 1 1 123 99 46 0.085 6.74 4.08 Intr - 253707 253608 100 2 1 66 62 76 0.265 1.99 4.07 Intr - 253874 253767 108 1 0 104 89 92 0.341 9.38 4.06 Intr - 262790 262702 89 2 2 54 67 32 0.012 -4.45 4.05 Intr - 280889 280794 96 2 0 74 86 105 0.044 8.19 4.04 Intr - 286328 286117 212 0 2 58 -6 283 0.111 13.61 4.03 Intr - 286533 286485 49 0 1 97 27 133 0.096 5.43 4.02 Intr - 318111 318007 105 0 0 56 14 122 0.052 1.09 4.01 Init - 325954 325871 84 1 0 76 100 90 0.985 9.87 4.00 Prom - 330910 330871 40 -5.35 5.11 PlyA - 331549 331544 6 1.05 5.10 Term - 336009 335918 92 1 2 124 44 55 0.297 1.60 5.09 Intr - 356645 356522 124 2 1 75 92 44 0.526 2.74 5.08 Intr - 361084 360743 342 1 0 -37 70 238 0.358 4.20 5.07 Intr - 361938 361830 109 2 1 67 48 123 0.466 5.57 5.06 Intr - 364559 364441 119 2 2 53 68 59 0.521 -1.26 5.05 Intr - 364879 364761 119 2 2 46 100 23 0.351 -1.44 5.04 Intr - 365789 365685 105 0 0 83 97 32 0.595 2.87 5.03 Intr - 366351 366230 122 2 2 -24 41 215 0.659 4.82 5.02 Intr - 367032 366877 156 2 0 50 107 109 0.927 7.20 5.01 Init - 375181 375024 158 1 2 80 87 104 0.859 8.93 5.00 Prom - 378160 378121 40 -3.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 234883 234843 41 1 2 78 115 36 0.943 5.01 S.002 Term - 318111 317995 117 0 0 56 42 135 0.823 3.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:39150621_39536987|GENSCAN_predicted_peptide_1|485_aa MDSDSIVGRLRSVADRSRLAPKNNQIMILQCIHRDIKPENILITKQGIIKICDFGFAQIL SWTSSFSGASLIGLIVDLLNSFSANSEIFLLAWIHCWDGASSEPNCIDCYFSSGSSHPEK LPGSGLVLGSVCKESRDVIHCQIFQLWIPAPALVECLWGYLTTLLRRIAHYLISASRAAF QLFLLCNAFVQLLIVQGPVMILGLASTHLEAVPGDAYTDYVATRWYRAPELLVGDTQYGS SVDIWAIGCVFAELLTGQPLWPGKSDVDQLYLIIRTLGKLIPRHQSIFKSNGFFHGISIP EPEDMETLEEKFSDVHPVALNFMKGCLKMNPDDRLTCSQLLESSYFDSFQEAQIKRKARN EGRNRRRQQQAPKSAFPRLFLKTKICQVQRNETQTSGNQILPNGPILQNSMVTVMTNINS AVYQKTLKLNRKLHGNRSEAVFGCCSSAPGCIENRFRVSSHSGTQAKGAAHMGNMPFSWQ RAQET >gi568815596r:39150621_39536987|GENSCAN_predicted_CDS_1|1458_bp atggattctgacagcatcgtgggccgtctcaggagtgttgctgatcgttccaggttagct ccgaaaaataatcaaataatgatcttacagtgtattcacagagatataaaacctgaaaat attctaataactaagcaaggaataatcaagatttgtgacttcgggtttgcacaaattctg agttggacttcatctttctctggtgcctccttgattggcttaatagttgaccttctgaat tctttttctgccaattcagagatttttctcctggcttggatccattgctgggatggggct tcctcagagccaaactgcattgattgttatttctcttctggatctagccacccagagaag ctaccaggctctgggctggtactaggaagtgtctgcaaagagtcccgtgatgtgatccat tgtcagatctttcagctatggataccagcacctgctctggtagagtgtctttggggctac ctgacgactttactaaggagaattgcccattacctcatctctgcaagtcgggctgctttc caattgttcctcctctgtaatgcatttgtacagctcctgatcgttcagggtcctgtgatg attctgggattggcttccacccacttggaggcagttccaggagatgcctacaccgattat gtagctacgagatggtaccgagctcctgaacttcttgtgggagatactcagtatggttct tcagtcgatatatgggctattggttgtgtttttgcagagctcctgacaggccagccactg tggcctggaaaatcagatgtggaccaactttatctgataatcagaacactaggaaaatta atcccaagacatcaatcaatctttaaaagtaacgggtttttccatggcatcagtatacct gagccagaagacatggaaactcttgaggaaaagttctcagatgttcatcctgtggctctg aacttcatgaaggggtgtctgaagatgaatccagatgacagattaacctgttcccaactc ctggagagctcctactttgattcttttcaagaggcccaaattaaaagaaaagcacgtaat gaaggaagaaacagaagacgccaacagcaggcacctaagtctgcctttcctaggcttttt ctcaaaaccaagatctgccaggtacaaaggaatgaaacccagacatctgggaaccaaatt ctacccaatgggccaatattgcagaacagtatggtgactgttatgacaaacattaattct gcagtttatcagaaaactttgaagttaaatcgtaaactccatggaaataggtcagaggct gtgtttggctgctgcagctctgctccaggctgcattgagaatcgatttcgagtgtcttct cattcagggacccaggccaaaggagcagctcatatgggaaatatgcccttctcatggcag agggcacaggaaacttga >gi568815596r:39150621_39536987|GENSCAN_predicted_peptide_2|251_aa MEKYEKLAKTGEGSYGVVFKCRNKTSGQVVAVKKFVESEDDPVVKKIALREIRMLKQLKH PNLVNLIEVFRRKRKMHLVFEYCDHTLLNELERNPNGPTIKKSYESLKKNKAQCGVRECP GGVLVKVLQRNKTSGIHRGTDNRKFIIGIDSCDYGGREVPQSAICKLENQESRWSNSGQV LRPQRTESFKVQRPESQELPHPRAGEDGRPSSRREKQPFPYRFVVSRPSTGLMMPACLPS PLWVGLLYAVY >gi568815596r:39150621_39536987|GENSCAN_predicted_CDS_2|756_bp atggaaaagtatgaaaaattagctaagactggagaagggtcttatggggttgtattcaaa tgcagaaacaaaacctctggacaagtagtagctgttaaaaaatttgtggaatctgaagat gatcctgttgttaagaaaatagcactaagagaaatacgtatgttgaagcaattaaaacat ccaaatcttgtgaacctcatcgaggtgttcaggagaaaaaggaaaatgcatttagttttt gaatactgtgatcatacacttttaaatgagctggaaagaaacccaaatggtccaacaata aagaagagttatgagtctttgaagaaaaataaagcacagtgtggagttagagaatgccca ggaggtgtattagtcaaggttcttcagagaaacaaaaccagtgggatacatagaggtact gataacaggaaatttattattggtattgactcatgtgattatggagggagagaagttcca caatctgccatctgcaagctagagaaccaggaaagccgatggtctaattcaggccaagtc ctaaggccccagaggacagagagtttcaaagtccaaaggcctgagagccaggagcttcca catccaagggcaggagaagatggacgtcccagctcaaggagagagaagcagcccttcccc taccgttttgttgtatccaggccttcaacgggtttgatgatgcctgcctgcctgccctca cctttgtgggtgggtcttctttacgcggtctactga >gi568815596r:39150621_39536987|GENSCAN_predicted_peptide_3|177_aa MTKIQRLTISSVDKMETGTHTNHWGHRSGRRSRFSFGELFLVPATGIRSLDLAAHPINPA RCPQFPRHRQNPHLLHPQGDGDEASSSAEQPQPQLPRVSLGGGAVYPATSEPAEPPQRLP GNRNARARGALRLPERVPLALAAPALILLRGDSVLAVLTALARSRRLLCLGSHFGGT >gi568815596r:39150621_39536987|GENSCAN_predicted_CDS_3|534_bp atgactaaaatccaaagactgacaatatcaagtgttgataagatggaaactggaactcac acaaatcactggggccaccgaagcggaaggcggtcccgcttttctttcggggaacttttc ttggttcctgcaactgggatccgcagtctagacctcgcggctcaccccatcaaccccgca cgctgcccgcagttcccccgccaccggcagaatccgcacttactgcacccacagggcgac ggtgacgaagcttcgagcagcgccgaacagccccagccccagctcccgcgcgtgtcgctc ggaggaggcgccgtgtacccagcgacttcggagccagcggaaccgccgcagcgtctcccg ggcaaccggaacgctagggcgcgcggggcactgcggcttccggagcgggttcccctggcg ctggcagccccagccctgatccttctgagaggtgacagcgtgctggcagtcctcacagcc ctcgctcggtctcgacgcctcctatgcctgggctcccactttggcggcacttga >gi568815596r:39150621_39536987|GENSCAN_predicted_peptide_4|998_aa MGAEGKTVSFSDSETTAVTSERAIAKRKESGPDPDPKRGFLDLAQERIQGKSTVQSKSKF IKKRREGSGGGGGGRSVEPGGLRADSAHRQRHLRRRLQGKGRGRAPPRPWDPDPQPSLGV RALAACPRETLAPGSSRGGGAADAKGSAALGYSSGMDPGFTGPEVDAVLGTLIKKTKLEL KKTSALIAPFPHNGGETLRGMVVLFLAPSLIRQRSLTHWPPPLQAHGEYCQATADVHLRP EGSSVSLCAGPEMLSKSQGLESGTPRVHLVLYPTVAELVPKARNVNTGELAAIKVIKLEP AREWTSTVNWYQEWGAAVKIPENLEATLELGNRQRLEQFGGLRRRKENVEKFGSSERLPR DLVLCILAAPAVAKRGQRIAQAIASEGASPKPGSFHVVLSSQVHRNQELRFENLWLDFQG YMEIPGCPDRSLLQGWSPRGSARAVQKGNMGLEPPHRVPTAALPSGTVRRWSLSSGSQNA FEKPTGPAVHGCNHSCTALIAPPWLRRDKLWICMEFCGGGSLQDIYHVTGPLSELQIAYV SRETLQHPFVTQHLTRSLAIELLDKVNNPDHSTYHDFDDDDPEPLVAVPHRIHSTSRNVR EEKTRSEITFGQVKFDPPLRKETEPHHELDLQLEYGQGHQGGYFLGANKGHVAHLEDDEG DDDESKHSTLKAKIPPPLPPKEMHSTEDENQGTIKRCPMSGSPAKPSQVPPRPPPPRLPP HKPVALDQYLIFGAEEGIYTLNLNELHETSMEQVKLLSFIPIIYQGFLIMQDKCKSYLLL FQHTNSLTEYCQVRNPYTGHKYLCGALQTSIVLLEWVEPMQKFMLIKHIDFPIPCPLRMF EMLVVPEQEYPLVCVGVSRGRDFNQVVRFETVNPNSTSSWFTESGCIKIVNLQGRLKSSR KLSSELTFDFQIESIECHSSNKQMIIISQFRGISDQDVKMESAFKDIARVKKQDQQQNLK AVYRSEANRPAGLQLQKYPTGDGEPQPSSLLVVPDNQA >gi568815596r:39150621_39536987|GENSCAN_predicted_CDS_4|2997_bp atgggggctgaaggaaaaacggtttcattcagtgactcggaaacaacagcagtgacttca gagagggcaattgccaagagaaaggaaagcggtcctgatccagaccccaagagagggttc ttggatcttgcgcaagagagaattcagggcaagtccacagtgcagagcaaaagcaaattt attaagaaacggcgtgaggggagcggcggcggcggcggcggccgcagcgtggagccggga ggacttcgagctgattcagcgcatcggcagcggcacctacggcgacgtctacaaggtaaa ggcaggggccgcgcgccgccgaggccgtgggatcccgacccgcagccaagcctgggcgtc cgagcccttgccgcctgtcctcgggagacgctggcgccagggagcagtcggggaggagga gctgcggacgccaagggctctgctgccttgggatacagttcagggatggatccaggtttt acgggacctgaagttgatgcagttttggggaccctcattaagaaaacaaaactagaatta aaaaagacttctgccttaattgccccctttcctcataatggtggagagacattaagggga atggtggttctgttcctagctccctcgttaattaggcagaggagcctcacgcattggcca ccaccactacaggcccatggggagtactgtcaggctaccgctgatgttcacttaagacct gagggttcttcagtcagcttgtgtgcaggtccagaaatgctgtccaagagccaaggccta gaatcagggactccaagagtccacttggtgctctaccccactgtggctgagctggtacct aaggcacggaatgttaacactggtgaattagcagcaattaaagtaataaaattggaacca gcacgagaatggactagtacagtaaattggtaccaggagtggggtgctgctgtaaagata cctgaaaatctggaagcaactttggaattgggtaacaggcagaggttggaacagtttgga gggctcagaagaagaaaagaaaatgtggaaaagtttggaagttccgagagacttcctagg gacttggtgctctgcatcctagccgctccagccgtggctaaaaggggccaacgtatagct caggccattgcctcagagggtgcaagccccaagcctggcagcttccacgtggtattgagc tcacaggtgcacagaaaccaagaactgaggtttgagaacctctggctagatttccaagga tatatggaaatacctggatgtccagacagaagtttgctgcaagggtggagccctcgtggt tctgctagggcagtgcagaagggaaacatggggttggagcccccacacagagttcccact gcagcactgcctagtggaactgtgagaaggtggtcactgtcctctggatctcagaatgct tttgaaaaaccaactggacctgctgttcatggttgtaaccactcttgtacagctttgatt gccccaccttggttaaggcgagataagctttggatttgcatggagttttgtggaggtggt tctttacaggatatttatcacgtaactggacctctgtcagaactgcaaattgcatatgtt agcagagaaacactgcagcatccttttgtaacacaacatttgacacggtctttggcaatc gagctgttggataaagtaaataatccagatcattccacttaccatgatttcgatgatgat gatcctgagcctcttgttgctgtaccacatagaattcactcaacaagtagaaacgtgaga gaagaaaaaacacgctcagagataacctttggccaagtgaaatttgatccacccttaaga aaggagacagaaccacatcatgaacttgatctgcaactggaatatggacaaggacaccaa ggtggttactttttaggtgcaaacaaaggacacgtcgcacatttagaagatgatgaagga gatgatgatgaatctaaacactcaactctgaaagcaaaaattccacctcctttgccacca aaggaaatgcattctactgaggatgaaaatcaaggaacaatcaagagatgtcccatgtca gggagcccagcaaagccatcccaagttccacctagaccaccacctcccagattaccccca cacaaacctgttgccttagatcagtacttgatatttggtgccgaagaagggatttatacc ctcaatcttaatgaacttcatgaaacatcaatggaacaggtaaagcttctcagctttatt cccataatttaccagggctttttgattatgcaagacaaatgcaaaagttacctgttgcta ttccagcacacaaactccctgacagaatactgccaagtaagaaatccttacacgggccat aaatacctatgtggagcacttcagactagcattgttctattagaatgggttgaaccaatg cagaaatttatgttaattaagcacatagattttcctataccatgtccacttagaatgttt gaaatgctggtagttcctgaacaggagtaccctttagtttgtgttggtgtcagtagaggt agagacttcaaccaagtggttcgatttgagacggtcaatccaaattctacctcttcatgg tttacagaatcaggttgtataaaaatagtaaatctccaaggaagattaaaatctagcagg aaattgtcatcagaactcacctttgatttccagattgaatcaatagaatgtcattcatcc aacaaacaaatgattataataagtcagtttcgagggatatctgatcaagatgttaaaatg gaatccgctttcaaagacatagccagggtcaaaaagcaagatcaacaacagaatctcaaa gcagtatacaggtctgaagctaacaggcctgctgggctccagttacaaaagtatcccaca ggagatggagaaccacagccaagttcattgcttgtggtacctgataatcaggcatga >gi568815596r:39150621_39536987|GENSCAN_predicted_peptide_5|481_aa MRQVELPGQKVVTNGENCVDIMQILMESEAVGVKDSVVLPRFRVPSEVFVRSWYSLNSKF LSNYLVGFGGEQLAGYLFEAGVPEATLLCIDISWHTAFSKKADGRLESSSSSAADEANQL VGADSQRPKDFPGATPLWKFEDVGTENRKGKNRLGRAKMMFYQGELFLTSSPFRDVTSFE GVNQIDLFKLHQHISHSQKWSAVTFTTIVPNHFRKFWGKENLIARDCQGARNLHFYQAHR LLLVQVVCGSDFGTLCYKEFSAAFQITRGIIECNCLWRVGGRIGHKFIGRLKEKRKIPVL DAFQEKAKSEQPEQLWEIQVCALLGRMRTGGSAGPELQTKSMVCSLGGDQRTQATGSGAQ RASMSLQGSACGQEDRSELPASAYGKLDMYLPVSPREGVLCQLELSQERQWSHLLIENPG THWLLPNPRLYFHCNFRANLPPYGVCCIPEVVHIKRETRQPERSSKMLGGEGLSPTKYST I >gi568815596r:39150621_39536987|GENSCAN_predicted_CDS_5|1446_bp atgaggcaggtagaattgcctggtcaaaaggtggtcacaaatggtgagaactgtgttgac attatgcaaatactgatggagagtgaagcagttggggtcaaagactccgtggtgctcccc aggtttagggttccgagtgaagtatttgtgagaagttggtattcactcaactccaagttc ctttccaattacctggtggggtttggtggggagcagttagctggatacctgtttgaagca ggtgtcccagaagccaccttgctgtgcatagacatcagctggcatacagccttcagtaag aaggctgatggaaggctggagtccagcagcagttcagctgcagatgaagcaaatcagctg gttggtgctgacagccagagacccaaggactttcctggtgctacacctctctggaagttt gaagatgttgggacagaaaacagaaaaggaaagaatagattgggaagggcaaaaatgatg ttttatcaaggagagctctttctcaccagctccccattcagagatgtaaccagctttgaa ggcgtcaatcaaattgaccttttcaaattgcaccaacacatctcgcattcacagaaatgg tctgctgtgaccttcacaacaattgttcctaaccactttaggaagttctgggggaaggaa aatctgatagcacgggactgtcagggagccaggaatctgcatttttatcaagcccaccgg ctgctactagtacaggtggtctgtggctcagactttggaacactgtgctataaggaattc tcggccgccttccaaatcaccagaggcatcatagaatgcaactgtctgtggagggttgga ggacgaattggacacaaattcattggacggctgaaggaaaaaagaaagattcccgtcctg gatgctttccaagaaaaagcaaaaagtgagcaaccagagcagttatgggaaattcaagtg tgtgctttgttgggaaggatgagaacggggggttccgctggcccagagctgcaaacgaaa tcaatggtctgcagcttgggaggtgaccagaggacccaggccactggatcaggcgctcag agagccagcatgagccttcaaggcagtgcttgtgggcaggaggacaggtctgagcttcct gccagtgcctatggcaagttggatatgtatttgccagtatccccaagggaaggggttctg tgccagctggagctctctcaggaaagacagtggagccacctgctcatagagaaccctggc acccattggcttcttcctaatcctagattgtacttccattgcaacttcagagccaattta ccaccctatggggtttgctgtattccagaagtggtacacattaaaagggaaacacgtcag ccagagagaagcagcaaaatgcttgggggtgaggggttgtcccccaccaagtacagcacc atctag