GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:43:14 Sequence gi568815596r:39079169_39329532 : 250364 bp : 39.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1184 956 229 2 1 42 101 129 0.016 6.12 1.02 Intr - 3266 3209 58 1 1 41 95 49 0.002 -1.13 1.01 Init - 12704 12454 251 2 2 64 48 175 0.015 8.08 1.00 Prom - 21431 21392 40 -6.35 2.09 PlyA - 22391 22386 6 1.05 2.08 Term - 28487 28404 84 2 0 102 48 81 0.896 2.17 2.07 Intr - 33539 33384 156 2 0 64 78 51 0.580 0.99 2.06 Intr - 38323 38170 154 0 1 83 94 -18 0.053 -2.55 2.05 Intr - 40896 40749 148 1 1 63 69 125 0.244 6.47 2.04 Intr - 41043 40923 121 0 1 42 53 93 0.886 0.35 2.03 Intr - 41272 41168 105 1 0 93 100 139 0.572 15.09 2.02 Intr - 41951 41446 506 0 2 79 63 216 0.494 9.97 2.01 Init - 45561 45138 424 0 1 78 40 141 0.333 4.85 2.00 Prom - 48852 48813 40 -5.85 3.03 PlyA - 49647 49642 6 1.05 3.02 Term - 52572 52533 40 2 1 102 55 91 0.547 2.88 3.01 Init - 54472 54378 95 1 2 103 82 52 0.578 6.00 3.00 Prom - 60943 60904 40 -4.55 4.15 PlyA - 60990 60985 6 1.05 4.14 Term - 71348 71279 70 1 1 96 41 97 0.646 2.23 4.13 Intr - 71980 71803 178 2 1 76 54 62 0.531 -0.34 4.12 Intr - 82482 82459 24 1 0 110 83 24 0.024 1.18 4.11 Intr - 92696 92660 37 2 1 49 101 35 0.036 -2.18 4.10 Intr - 99591 99427 165 0 0 105 62 24 0.216 0.74 4.09 Intr - 100153 100019 135 1 0 97 80 118 0.959 11.74 4.08 Intr - 105479 105423 57 2 0 103 75 70 0.823 5.36 4.07 Intr - 108541 108459 83 2 2 93 103 -10 0.776 -0.56 4.06 Intr - 111334 111137 198 2 0 81 52 184 0.766 12.50 4.05 Intr - 112780 112612 169 1 1 55 51 49 0.240 -3.50 4.04 Intr - 117288 117114 175 2 1 116 54 60 0.228 4.42 4.03 Intr - 117892 117784 109 2 1 102 105 3 0.324 1.82 4.02 Intr - 125476 125359 118 1 1 44 80 23 0.147 -3.78 4.01 Init - 126832 126770 63 1 0 64 93 87 0.304 8.00 4.00 Prom - 127739 127700 40 -5.75 5.04 PlyA - 128566 128561 6 1.05 5.03 Term - 139870 139405 466 0 1 5 37 237 0.493 3.80 5.02 Intr - 146792 146671 122 2 2 89 106 24 0.890 2.67 5.01 Init - 150364 150197 168 1 0 59 115 75 0.964 6.88 5.00 Prom - 158124 158085 40 -6.55 6.00 Prom + 158649 158688 40 -5.75 6.01 Init + 160502 160572 71 1 2 40 81 46 0.439 -0.33 6.02 Term + 164551 165013 463 1 1 124 55 533 0.705 47.14 6.03 PlyA + 165910 165915 6 1.05 7.14 PlyA - 166863 166858 6 1.05 7.13 Term - 177900 177650 251 1 2 73 45 163 0.989 5.38 7.12 Intr - 179272 179180 93 2 0 105 97 36 0.972 5.22 7.11 Intr - 181609 181438 172 1 1 105 72 90 0.886 7.69 7.10 Intr - 186138 186035 104 1 2 51 121 42 0.618 2.77 7.09 Intr - 193232 193115 118 2 1 102 116 -34 0.187 -0.08 7.08 Intr - 199318 199239 80 2 2 51 91 95 0.365 4.45 7.07 Intr - 209088 208953 136 0 1 75 63 56 0.397 1.02 7.06 Intr - 211166 211124 43 1 1 93 69 36 0.668 -0.48 7.05 Intr - 213658 213605 54 0 0 66 98 102 0.282 6.18 7.04 Intr - 220633 220575 59 1 2 50 108 41 0.027 -0.94 7.03 Intr - 230351 230293 59 0 2 80 89 39 0.091 0.88 7.02 Intr - 236220 236142 79 0 1 95 95 46 0.266 4.21 7.01 Intr - 246460 246350 111 1 0 28 75 128 0.672 5.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 12704 12450 255 2 0 64 55 182 0.889 7.36 S.002 Init - 34591 34529 63 1 0 37 121 62 0.851 5.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_1|180_aa MVTWKLNEEGVARKWKWSTVILPRGQMRCGQGFSPGRIIEWHPLVTMTALGGEAWGAGIE KVTVEILLRNLPLKGKECGEHWGINVEEKEKMQYHKANNRASDPLTIIRITAKTPPLFRS VEASDSQEPVVNCACEASRLRTPYENLMPDDLSLSPIPPHMGPSSCRKTSTGLPLIVHYX >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_1|540_bp atggtgacttggaagttaaatgaagaaggagtggcaagaaagtggaagtggtcaactgtc atactgccaagaggtcagatgagatgcgggcagggatttagtcctggccgtattatagaa tggcatcccctggtgaccatgacagccctgggtggagaagcctggggggctggtatagaa aaggtgactgtggagatccttttgaggaatctgcctctgaaaggcaaagaatgtggtgaa cactgggggataaatgtagaggaaaaggaaaagatgcagtaccacaaagctaacaataga gcaagtgatccactcaccatcattcgcattaccgccaaaactccgcctcttttcagatca gtggaggcatcagattctcaggagcctgttgtgaactgtgcatgtgaggcatctaggttg cgtactccttatgagaatctaatgcctgatgatctgtcactgtctcccattcccccccat atgggaccatctagttgcagaaaaacaagcacagggctcccattgattgtacattatgnn >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_2|565_aa MDTRLRTLEFILGRQCEFLNRKVSSWVRSSRQKDHCGCSKAENQVAASWPAGQAKARSVA LAPAGGTLKGQTPTPAPVLAALLRCAALGTRLARLDAPLSVFAYSARTRLVRHVRRGVRL LRAEEGSEKKPLRLRQGAPVGAFLLLHREPENPERTSPARQRMLSSEQVQFASRKLQRYT GGGGGGGGRGGTSGQPDSPTTTSGPSPSPRDLRPSSAPPGALAFRSPNYAERAPRPERPS TRPPPAASSAAGPDRARECSPGGRAAAAAEGSPGRRRRAGEGRGAWPRGGGAGCRRLRAA SVLHGWYLCRRRPGGTMQAQQLPYEFFSEENAPKWRGLLVPALKKDGAHGEAPLIRLALG VDIHRVCGNAAGSRGEFIRIIQPLRGQQQLEEPQFSAEVKAGHFSVFLVLARSCCQPTVL GVQYAGFSGLRMPWPQSGLLLFLNMAGMSHCICFTASMSLSPILSIAQIYHAYYRPGIAL DTQAKKSEEKESSKDFRRNNEVWELLLWSVEEPTSALERNCKPGAVAHACNSSTWEPECD ASRGAFVIVLVQDLLEADAKMGLST >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_2|1698_bp atggataccaggttaaggactttggaatttatcctgggacgccagtgtgagttcttgaac agaaaagtaagtagctgggtccgaagttcgcgtcagaaggatcactgtggctgcagcaag gcagagaaccaggtggccgcgtcctggccagcagggcaggcaaaggcgcggagcgttgca cttgcacctgcagggggcacactgaagggccaaacgcccacccccgcccccgtcctcgcc gccctactccgatgcgccgcgctcggaactcggctagcccgcctggacgcgcccctctcg gtcttcgcctattccgcccggacgcgcctcgtgcgtcacgtccgccgtggcgtgaggctc cttagagctgaggagggaagcgaaaagaagcctctgcgcctgcgccagggcgcacccgtc ggcgctttcctcctccttcaccgggagccggaaaacccggagcgaaccagcccagcacgg cagcgcatgctcagtagcgagcaggttcagttcgctagtaggaagctccagcgctacacc ggcggcggcggtggcggcggcggccgcggcggcacctcagggcagccagattccccaaca acgacctccggtccttccccttccccgcgagacctccgccccagctccgccccgcccggg gctcttgcgtttcggagtcccaactacgcggagcgggcgccgcggccggagcgtccctcc acacgtcctcccccagccgccagctccgccgcggggccagaccgggcccgggagtgcagc cccggcgggcgggcggcggccgcggcagagggcagcccggggcggcgccggcgggccggg gaggggcggggggcctggccccggggcggtggtgcgggatgccgccgcctccgggccgcg tccgttctccacggctggtacctgtgtcggaggcgccccgggggcaccatgcaggcgcag cagctgccctacgagtttttcagcgaagagaacgcgcccaagtggcggggactactggtg cctgcgctgaaaaaggacggcgcacacggagaggcccccctcatacggctggctctcggt gttgacatccaccgggtgtgtgggaatgccgcagggagccggggggagtttatccggata atccaacccctgagaggacaacaacagttggaggagcctcagttttcagcggaggtcaaa gcaggccacttttccgtcttcttggtcttagcccggagctgctgtcagcctaccgtttta ggtgtccagtacgctggttttagtggcttgagaatgccctggccccagtccggactgcta cttttcctgaatatggctggaatgtcccactgtatctgtttcaccgcatctatgagtcta agtcctatcttatctattgcacaaatatatcatgcctactatagaccaggtattgccctt gacactcaggctaaaaagagtgaagaaaaggaaagtagtaaggactttagaaggaataat gaagtctgggaattgctgctgtggagtgtagaggagccaaccagtgccttagaaaggaat tgcaagccaggtgcggtggctcatgcctgcaattccagcacttgggagcctgagtgtgat gcttcaaggggagcctttgtcattgtattagtccaggacctcttagaagcagatgccaag atgggattaagcacataa >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_3|44_aa MRELQVDDFDLLNKVKVKVRNKRQVEIDGEKRPSVNMTVTAHIG >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_3|135_bp atgagggagctccaggttgatgattttgatttactcaataaagtaaaagtcaaggtcagg aacaaaaggcaggtggaaatagatggggaaaaaaggccctcggtgaacatgacggtgact gcccacattggatga >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_4|526_aa MDSDSIVGRLRSVADRSRLAPKNNQIMILQCIHRDIKPENILITKQGIIKICDFGFAQIL SWTSSFSGASLIGLIVDLLNSFSANSEIFLLAWIHCWDGASSEPNCIDCYFSSGSSHPEK LPGSGLVLGSVCKESRDVIHCQIFQLWIPAPALVECLWGYLTTLLRRIAHYLISASRAAF QLFLLCNAFVQLLIVQGPVMILGLASTHLEAVPGDAYTDYVATRWYRAPELLVGDTQYGS SVDIWAIGCVFAELLTGQPLWPGKSDVDQLYLIIRTLGKLIPRHQSIFKSNGFFHGISIP EPEDMETLEEKFSDVHPVALNFMKGCLKMNPDDRLTCSQLLESSYFDSFQEAQIKRKARN EGRNRRRQQQAPKSAFPRLFLKTKICQVQRNETQTSGNQILPNGPILQNSMVTVMTNINS AVYQFLPDAVQILRHHGLLQIELRGQRLCLAAAALLQAALRIDFECLLIQGPRPKEQLIW EICPSHGRGHRKLEIPLKCSVQECSWNLGIDILELEQEVGDLTEED >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_4|1581_bp atggattctgacagcatcgtgggccgtctcaggagtgttgctgatcgttccaggttagct ccgaaaaataatcaaataatgatcttacagtgtattcacagagatataaaacctgaaaat attctaataactaagcaaggaataatcaagatttgtgacttcgggtttgcacaaattctg agttggacttcatctttctctggtgcctccttgattggcttaatagttgaccttctgaat tctttttctgccaattcagagatttttctcctggcttggatccattgctgggatggggct tcctcagagccaaactgcattgattgttatttctcttctggatctagccacccagagaag ctaccaggctctgggctggtactaggaagtgtctgcaaagagtcccgtgatgtgatccat tgtcagatctttcagctatggataccagcacctgctctggtagagtgtctttggggctac ctgacgactttactaaggagaattgcccattacctcatctctgcaagtcgggctgctttc caattgttcctcctctgtaatgcatttgtacagctcctgatcgttcagggtcctgtgatg attctgggattggcttccacccacttggaggcagttccaggagatgcctacaccgattat gtagctacgagatggtaccgagctcctgaacttcttgtgggagatactcagtatggttct tcagtcgatatatgggctattggttgtgtttttgcagagctcctgacaggccagccactg tggcctggaaaatcagatgtggaccaactttatctgataatcagaacactaggaaaatta atcccaagacatcaatcaatctttaaaagtaacgggtttttccatggcatcagtatacct gagccagaagacatggaaactcttgaggaaaagttctcagatgttcatcctgtggctctg aacttcatgaaggggtgtctgaagatgaatccagatgacagattaacctgttcccaactc ctggagagctcctactttgattcttttcaagaggcccaaattaaaagaaaagcacgtaat gaaggaagaaacagaagacgccaacagcaggcacctaagtctgcctttcctaggcttttt ctcaaaaccaagatctgccaggtacaaaggaatgaaacccagacatctgggaaccaaatt ctacccaatgggccaatattgcagaacagtatggtgactgttatgacaaacattaattct gcagtttatcagttccttcctgatgctgtccaaatactgcgccatcatggtctgttgcag attgaattgagaggtcagaggctgtgtttggctgctgcagctctgctccaggctgcattg agaatcgatttcgagtgtcttctcattcagggacccaggccaaaggagcagctcatatgg gaaatatgcccttctcatggcagagggcacaggaaacttgaaatacccttaaaatgttct gttcaggagtgttcttggaatttggggattgatattctggaacttgagcaggaagtgggt gacctgacagaagaagattaa >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_5|251_aa MEKYEKLAKTGEGSYGVVFKCRNKTSGQVVAVKKFVESEDDPVVKKIALREIRMLKQLKH PNLVNLIEVFRRKRKMHLVFEYCDHTLLNELERNPNGPTIKKSYESLKKNKAQCGVRECP GGVLVKVLQRNKTSGIHRGTDNRKFIIGIDSCDYGGREVPQSAICKLENQESRWSNSGQV LRPQRTESFKVQRPESQELPHPRAGEDGRPSSRREKQPFPYRFVVSRPSTGLMMPACLPS PLWVGLLYAVY >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_5|756_bp atggaaaagtatgaaaaattagctaagactggagaagggtcttatggggttgtattcaaa tgcagaaacaaaacctctggacaagtagtagctgttaaaaaatttgtggaatctgaagat gatcctgttgttaagaaaatagcactaagagaaatacgtatgttgaagcaattaaaacat ccaaatcttgtgaacctcatcgaggtgttcaggagaaaaaggaaaatgcatttagttttt gaatactgtgatcatacacttttaaatgagctggaaagaaacccaaatggtccaacaata aagaagagttatgagtctttgaagaaaaataaagcacagtgtggagttagagaatgccca ggaggtgtattagtcaaggttcttcagagaaacaaaaccagtgggatacatagaggtact gataacaggaaatttattattggtattgactcatgtgattatggagggagagaagttcca caatctgccatctgcaagctagagaaccaggaaagccgatggtctaattcaggccaagtc ctaaggccccagaggacagagagtttcaaagtccaaaggcctgagagccaggagcttcca catccaagggcaggagaagatggacgtcccagctcaaggagagagaagcagcccttcccc taccgttttgttgtatccaggccttcaacgggtttgatgatgcctgcctgcctgccctca cctttgtgggtgggtcttctttacgcggtctactga >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_6|177_aa MTKIQRLTISSVDKMETGTHTNHWGHRSGRRSRFSFGELFLVPATGIRSLDLAAHPINPA RCPQFPRHRQNPHLLHPQGDGDEASSSAEQPQPQLPRVSLGGGAVYPATSEPAEPPQRLP GNRNARARGALRLPERVPLALAAPALILLRGDSVLAVLTALARSRRLLCLGSHFGGT >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_6|534_bp atgactaaaatccaaagactgacaatatcaagtgttgataagatggaaactggaactcac acaaatcactggggccaccgaagcggaaggcggtcccgcttttctttcggggaacttttc ttggttcctgcaactgggatccgcagtctagacctcgcggctcaccccatcaaccccgca cgctgcccgcagttcccccgccaccggcagaatccgcacttactgcacccacagggcgac ggtgacgaagcttcgagcagcgccgaacagccccagccccagctcccgcgcgtgtcgctc ggaggaggcgccgtgtacccagcgacttcggagccagcggaaccgccgcagcgtctcccg ggcaaccggaacgctagggcgcgcggggcactgcggcttccggagcgggttcccctggcg ctggcagccccagccctgatccttctgagaggtgacagcgtgctggcagtcctcacagcc ctcgctcggtctcgacgcctcctatgcctgggctcccactttggcggcacttga >gi568815596r:39079169_39329532|GENSCAN_predicted_peptide_7|452_aa HPFVTQHLTRSLAIELLDKVNNPDHSTYHDFDDDDPEPLVAVPHRIHSTSRNVREEKTRS EITFGQVKFDPPLRKETEPHHELDLQLEYGQGHQGGYFLGANKGHVAHLEDDEGDDDESK HSTLKAKIPPPLPPKEMHSTEDENQGTIKRCPMSGSPAKPSQVPPRPPPPRLPPHKPVAL DQYLIFGAEEGIYTLNLNELHETSMEQVKLLSFIPIIYQGFLIMQDKCKSYLLLFQHTNS LTEYCQVRNPYTGHKYLCGALQTSIVLLEWVEPMQKFMLIKHIDFPIPCPLRMFEMLVVP EQEYPLVCVGVSRGRDFNQVVRFETVNPNSTSSWFTESGCIKIVNLQGRLKSSRKLSSEL TFDFQIESIECHSSNKQMIIISQFRGISDQDVKMESAFKDIARVKKQDQQQNLKAVYRSE ANRPAGLQLQKYPTGDGEPQPSSLLVVPDNQA >gi568815596r:39079169_39329532|GENSCAN_predicted_CDS_7|1359_bp catccttttgtaacacaacatttgacacggtctttggcaatcgagctgttggataaagta aataatccagatcattccacttaccatgatttcgatgatgatgatcctgagcctcttgtt gctgtaccacatagaattcactcaacaagtagaaacgtgagagaagaaaaaacacgctca gagataacctttggccaagtgaaatttgatccacccttaagaaaggagacagaaccacat catgaacttgatctgcaactggaatatggacaaggacaccaaggtggttactttttaggt gcaaacaaaggacacgtcgcacatttagaagatgatgaaggagatgatgatgaatctaaa cactcaactctgaaagcaaaaattccacctcctttgccaccaaaggaaatgcattctact gaggatgaaaatcaaggaacaatcaagagatgtcccatgtcagggagcccagcaaagcca tcccaagttccacctagaccaccacctcccagattacccccacacaaacctgttgcctta gatcagtacttgatatttggtgccgaagaagggatttataccctcaatcttaatgaactt catgaaacatcaatggaacaggtaaagcttctcagctttattcccataatttaccagggc tttttgattatgcaagacaaatgcaaaagttacctgttgctattccagcacacaaactcc ctgacagaatactgccaagtaagaaatccttacacgggccataaatacctatgtggagca cttcagactagcattgttctattagaatgggttgaaccaatgcagaaatttatgttaatt aagcacatagattttcctataccatgtccacttagaatgtttgaaatgctggtagttcct gaacaggagtaccctttagtttgtgttggtgtcagtagaggtagagacttcaaccaagtg gttcgatttgagacggtcaatccaaattctacctcttcatggtttacagaatcaggttgt ataaaaatagtaaatctccaaggaagattaaaatctagcaggaaattgtcatcagaactc acctttgatttccagattgaatcaatagaatgtcattcatccaacaaacaaatgattata ataagtcagtttcgagggatatctgatcaagatgttaaaatggaatccgctttcaaagac atagccagggtcaaaaagcaagatcaacaacagaatctcaaagcagtatacaggtctgaa gctaacaggcctgctgggctccagttacaaaagtatcccacaggagatggagaaccacag ccaagttcattgcttgtggtacctgataatcaggcatga