GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:06:04 Sequence gi568815583f:50339074_50599085 : 260012 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7538 7639 102 1 0 71 95 52 0.109 3.65 1.02 Intr + 15365 15599 235 1 1 83 35 104 0.090 0.94 1.03 Intr + 15854 16127 274 0 1 14 72 240 0.061 10.67 1.04 Intr + 22379 22539 161 2 2 64 35 79 0.008 -1.09 1.05 Term + 67851 67921 71 1 2 89 49 101 0.284 3.42 1.06 PlyA + 69123 69128 6 1.05 2.00 Prom + 71460 71499 40 -6.15 2.01 Init + 78475 78921 447 0 0 50 32 442 0.628 30.81 2.02 Intr + 79166 79276 111 1 0 30 61 159 0.470 7.06 2.03 Term + 79429 79620 192 0 0 39 54 164 0.534 4.54 2.04 PlyA + 80518 80523 6 -0.45 3.00 Prom + 82759 82798 40 -8.45 3.01 Init + 85111 85329 219 0 0 49 72 145 0.846 5.96 3.02 Term + 85466 85579 114 1 0 33 43 174 0.904 4.99 3.03 PlyA + 87053 87058 6 1.05 4.00 Prom + 90604 90643 40 -5.35 4.01 Init + 100001 100104 104 1 2 72 64 96 0.958 5.36 4.02 Intr + 102276 102420 145 0 1 68 82 125 0.920 9.26 4.03 Intr + 110327 110412 86 1 2 44 87 77 0.403 0.90 4.04 Intr + 119927 120089 163 2 1 53 63 166 0.910 9.66 4.05 Intr + 123207 123249 43 2 1 42 115 42 0.788 -0.81 4.06 Intr + 125974 126118 145 2 1 53 59 152 0.574 7.22 4.07 Term + 127632 127722 91 0 1 107 38 196 0.998 12.71 4.08 PlyA + 128039 128044 6 1.05 5.00 Prom + 129945 129984 40 -6.95 5.01 Init + 131815 131840 26 0 2 114 47 -13 0.150 -3.75 5.02 Intr + 132560 132722 163 2 1 34 47 167 0.156 6.26 5.03 Intr + 137776 137920 145 0 1 74 50 117 0.970 5.43 5.04 Intr + 138203 138426 224 0 2 25 91 201 0.866 11.02 5.05 Intr + 142408 142992 585 0 0 81 121 752 0.999 69.72 5.06 Intr + 145202 145288 87 1 0 39 106 96 0.970 5.75 5.07 Intr + 150728 150808 81 1 0 49 93 87 0.947 4.22 5.08 Intr + 151190 151452 263 1 2 69 55 304 0.999 20.56 5.09 Intr + 153628 153840 213 1 0 112 99 110 0.999 11.41 5.10 Intr + 154997 155207 211 2 1 76 69 200 0.999 14.89 5.11 Intr + 156775 157011 237 0 0 54 111 61 0.745 1.89 5.12 Intr + 158016 158158 143 2 2 75 55 32 0.531 -3.17 5.13 Intr + 159523 159655 133 1 1 91 89 57 0.857 5.83 5.14 Term + 159830 160015 186 1 0 81 49 149 0.894 6.81 5.15 PlyA + 160623 160628 6 -0.45 6.14 PlyA - 160968 160963 6 1.05 6.13 Term - 161764 161696 69 1 0 110 40 53 0.002 -0.34 6.12 Intr - 180970 180898 73 0 1 34 68 96 0.021 0.69 6.11 Intr - 183909 183881 29 0 2 67 89 58 0.044 -0.20 6.10 Intr - 190856 190724 133 1 1 74 89 50 0.234 3.43 6.09 Intr - 199778 199636 143 2 2 114 86 14 0.519 2.03 6.08 Intr - 202191 201976 216 0 0 26 57 149 0.434 3.38 6.07 Intr - 204720 204525 196 2 1 47 115 87 0.216 5.80 6.06 Intr - 207424 207400 25 2 1 135 50 26 0.047 -0.33 6.05 Intr - 235406 235201 206 1 2 25 94 167 0.888 8.92 6.04 Intr - 235646 235564 83 2 2 82 94 34 0.898 0.92 6.03 Intr - 236062 235779 284 2 2 117 89 91 0.801 8.31 6.02 Intr - 253553 252838 716 1 2 79 97 414 0.976 31.55 6.01 Intr - 254676 254544 133 1 1 129 26 142 0.996 10.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_1|280_aa RWESHYGAKAGSEPPHPAQEAEKSKIRHWQIQYLHRAPRTRHPRRPFPSPESPAGPSGCR DKGYRGARGRGPAAPAFLFLPPTANLQSPAEPPLPRRDPIATPGLPLGTGKPAKRSTSTS GGNRALHRPALTWKSFGSGDGKAAGGGAGTRGASYPAGLTRPRYTQSASAAQGAIFRKCR VWSAPKIPTEKSPVLRAEGHMVCEMAKMENPENARCWSRVGVTETTQLLEMYIGITTLEN GKFNKQANQSRFRDICLVIFKRTKHSGSSGTTSSPQGAAE >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_1|843_bp agatgggagtctcactatggtgccaaggctggtagtgaaccacctcacccagcccaggag gctgagaagtccaaaataaggcactggcagattcagtatctgcacagggcgcccaggacc cgccaccctcgccgcccgttcccctctccggagagcccggcggggccgtcaggctgccga gacaaagggtaccgcggcgccagaggccgaggcccagccgcgcctgccttcctctttctc ccgcccactgccaacctccagtcgcccgcagaacctccgctgcctcggcgcgaccccatc gccaccccggggctgccgctcgggaccggcaagccggcgaagagatcaacttccacatcc ggtggtaacagggccctacacagacccgcactgacctggaaaagcttcgggagcggcgac gggaaggcagcaggaggtggtgcggggacccgaggcgcctcgtacccggccgggctgacg cggccccgctacacacaaagcgcttcagctgcacagggcgctattttccgaaaatgccgc gtctggtcggcgcccaaaatccccaccgaaaagtcccccgtgctgcgcgcggagggacac atggtgtgcgaaatggctaaaatggaaaacccagaaaatgccagatgttggtcaagagtt ggagtaactgaaactacacagctgctggaaatgtatattggtataaccactttggaaaat gggaagttcaacaaacaggcaaaccaatctagatttagggatatatgtttagttatattc aaacgcacaaaacacagtggcagcagtgggactacatctagtcctcaaggagctgcagaa tag >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_2|249_aa MTVETVVLTETLLVLGAEVQWSSCNIFSTQDHAAAAIAKAGIPVNAWKGETEEKYLWCTK QTLYFKDKLLNMILDITGGLTNLTHTKYPQLLSGIRDISEETMTEVHNLYKMMANGILKV STINVNDSVTKSKFDNLYGCHQSLPRWHQLPNRHFEQMKDDAIVCNTGHFDVKIDVRWLN KNAMKKIELWTHSDKYPIEVHFLPRKLDEAVAEAHLGKVNMKLTKQTEKQAQYLGMSCDS LFKLDHYHY >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_2|750_bp atgactgtggagacagtcgtcctcactgagaccctccttgtcctgggtgctgaagtgcag tggtccagctgcaacatcttctccacccaggaccatgcagcagctgccattgccaaggct gggattccagtgaacgcctggaagggagaaacggaggagaagtacctttggtgcaccaag cagacactgtacttcaaggacaagctcctcaacatgattctggacatcactgggggcctc accaacctcactcacactaagtacccacagctcttgtcgggcatcagagacatctccgag gagaccatgactgaggtccacaacctatacaagatgatggccaatgggatcctgaaggtg tccaccatcaacgttaatgattctgtcaccaagagcaagtttgacaacctctatggctgc caccagtccctccccagatggcatcaactgcccaacaggcactttgaacagatgaaggat gatgccatcgtgtgtaacactggacactttgacgtgaaaatcgatgtcaggtggctcaac aagaatgctatgaagaagattgagctgtggacccactcagataagtaccccattgaggtt cacttcctgcccaggaagctggatgaggcagtggctgaagcccatctgggcaaggtgaac atgaaactgaccaagcagactgagaagcaggcccagtacctgggcatgtcctgtgatagt ctcttcaagctggatcactaccactactga >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_3|110_aa MLTRLRETPPLPPPPTHNVLLDPPSTIPRPAALTEKGRAVEERGVKGAHAKPRPCRDVWR RDASPRNANREKGHLFSGKSSAVNDERLSFHQLLVTPETSSVRGEEWDNS >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_3|333_bp atgttgacccgactcagagaaaccccgcccttgccgccaccgcccacccataacgtcctc ctcgaccctccctcgaccatccctcgccccgcggccctcaccgaaaaggggagggcggtg gaagagagaggagtcaagggagcgcacgcgaagccccgcccctgccgtgacgtctggaga cgcgacgcgtcgcctcgcaatgcaaatcgggaaaaggggcacctgttctctgggaagtcg tccgctgtgaacgatgaacgcctttccttccaccagctgctggttaccccggagacaagc tctgtccgcggagaggagtgggacaactcctaa >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_4|258_aa MPAVASVPKELYLSSSLKDLNKKTEVKPEKISTKSYVHSALKIFKTAEECRLDRDEERAY VLYMKYVTVYNLIKKRPDFKQQQDYFHSILGPGNIKKAVEEAERLSESLKLRYEEAEVRK KLEEKDRQEEAQRLQQKRQETGREDGGTLAKGSLENVLDSKDKTQKSNGEKNEKCETKEK GAITAKELYTMMTDKNISLIIMDARRMQDYQDSCILHSLSVPEEAISPGGTGRSGSRTAA VAFKDAGKTPMEPEIAIH >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_4|777_bp atgcctgctgtggcttcagttcctaaagaactctacctcagttcttcactaaaagacctt aataagaagacagaagttaaaccagagaaaataagcactaagagttatgtgcacagtgcc ctgaagatctttaagacagcagaagaatgcagattagatcgtgatgaggaaagggcctat gtactatatatgaaatacgtgactgtttataatcttatcaaaaaaagacctgatttcaag caacagcaggattatttccattcaatacttggacctggaaacatcaaaaaagctgtcgaa gaagctgaaagactctctgaaagccttaaattaagatatgaagaagctgaagtccggaaa aaacttgaggaaaaagacaggcaggaggaagcacagcggctacaacaaaaaaggcaggaa acaggaagagaggatggtggcacattggctaaaggctctttggagaatgttttggattcc aaagacaaaacccaaaagagcaatggtgaaaagaatgaaaaatgtgagaccaaagagaaa ggagcaatcacagcaaaggaactatacacaatgatgacggataaaaacatcagcttgatt ataatggatgctcgaagaatgcaggattatcaggattcctgtattttacattctctcagt gttcctgaagaagccatcagtccaggaggaaccggtcggtcaggaagccgcacagcagcc gtggcttttaaagatgctggaaaaacacccatggagccggagatagcaattcactga >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_5|898_aa MPSRTLVFIVTASWIEAHLPDDSKDTWKKRGNVEYVVLLDWFSSAKDLQIGTTLRSLKDA LFKWESKTVLRNEPLVLEGGYENWLLCYPQYTTNAKVTPPPRRQNEEVSISLDFTYPSLE ESIPSKPAAQTPPASIEVDENIELISGQNERMGPLNISTPVEPVAASKSDVSPIIQPVPS IKNVPQIDRTKKPAVKLPEEHRIKSESTNHEQQSPQSGKVIPDRSTKPVVFSPTLMLTDE EKARIHAETALLMEKNKQEKELRERQQEEQKEKLRKEEQEQKAKKKQEAEENEITEKQQK AKEEMEKKESEQAKKEDKETSAKRGKEITGVKRQSKSEHETSDAKKSVEDRGKRCPTPEI QKKSTGDVPHTSVTGDSGSGKPFKIKGQPESGILRTGTFREDTDDTERNKAQREPLTRAR SEEMGRIVPGLPSGWAKFLDPITGTFRYYHSPTNTVHMYPPEMAPSSAPPSTPPTHKAKP QIPAERDREPSKLKRSYSSPDITQAIQEEEKRKPTVTPTVNRENKPTCYPKAEISRLSAS QIRNLNPVFGGSGPALTGLRNLGNTCYMNSILQCLCNAPHLADYFNRNCYQDDINRSNLL GHKGEVAEEFGIIMKALWTGQYRYISPKDFKITIGKINDQFAGYSQQDSQELLLFLMDGL HEDLNKADNRKRYKEENNDHLDDFKAAEHAWQKHKQLNESIIVALFQGQFKSTVQCLTCH KKSRTFEAFMYLSLPLASTSKCTLQDCLRLFSKEEKLTDNNRFYCSHCRARRDSLKKIEI WKLPPVLLVHLKRFSYDGRWKQKLQTSVDFPLENLDLSQYVIGPKNNLKKYNLFSVSNHY GGLDGGHYTAYCKNAARQRWFKFDDHEVSDISVSSVKSSAAYILFYTSLGPRVTDVAT >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_5|2697_bp atgcccagccggactttagtttttatagtcactgctagttggattgaagcacacctgcca gatgattctaaagacacatggaagaagagggggaatgtggagtatgtggtacttcttgac tggtttagttctgccaaagatttacagattggaacaactctccggagtctgaaagatgca cttttcaagtgggaaagtaaaactgtcctgcgcaatgagcctttggttttagagggaggc tatgaaaactggctcctttgttatccccagtatacaacaaatgctaaggtcactccaccc ccacgacgccagaatgaagaggtgtctatctcattggattttacttatccctcattggaa gaatcaattccttctaaacctgctgcccagacgccacctgcatctatagaagtagatgaa aatatagaattgataagtggtcaaaatgagagaatgggaccactgaatatatcaactcca gttgaaccagttgctgcttctaaatctgatgtttcacccataattcagccagtgcctagt ataaagaatgttccacagattgatcgtactaaaaaaccagcagtcaaattgcctgaagag catagaataaaatctgaaagtacaaaccatgagcaacaatctcctcagagtggaaaagtt attcctgatcgttccaccaagccagtagttttttctccaactctcatgttaacagatgaa gaaaaggctcgtattcatgcagaaactgctcttctaatggaaaaaaacaaacaagaaaaa gaacttcgggaaaggcagcaagaggaacagaaagagaaactgaggaaggaagaacaagaa caaaaagccaaaaagaaacaagaagctgaagaaaatgaaattacagagaagcaacaaaaa gcaaaagaagaaatggagaagaaagaaagtgaacaggccaagaaagaagataaagaaacc tcagcaaagaggggcaaagaaataacaggagtaaaaagacaaagtaaaagtgaacatgaa acttctgatgccaagaaatctgtagaagatagggggaaaaggtgtccaaccccagaaata cagaaaaagtcaacaggagatgtgccccatacatctgtgacaggggattcaggttcaggc aagccatttaagattaaaggacaaccagaaagtggaattctaaggacaggaacttttaga gaggatacagacgataccgaaagaaataaagctcaacgagaacctttgacaagagcacga agtgaagaaatggggaggatcgtaccaggactgccttcaggctgggccaagtttcttgac ccaatcactggaacctttcgttattatcattcacccaccaacactgttcatatgtaccca ccggaaatggctccttcatctgcacctccttccacccctccaactcataaagccaagcca cagattcctgctgagcgggatagggaaccttccaaactgaagcgctcctactcctcccca gatataacccaggctattcaagaggaagagaagaggaagccaacagtaactccaacagtt aatcgggaaaacaagccaacatgttatcctaaagctgagatctcaaggctttctgcttct cagattcggaacctcaatcctgtttttggaggttctggaccagctcttactggacttcgt aacttaggaaatacttgttatatgaactcaatattgcagtgcctatgtaacgctccacat ttggctgattatttcaaccgaaactgttatcaggatgatattaacaggtcaaatttgttg gggcataaaggtgaagtggcagaagaatttggtataatcatgaaagccctgtggacagga cagtatagatatatcagtccaaaggactttaaaatcaccattgggaagatcaatgaccag tttgcaggatacagtcagcaagattcacaagaattgcttctgttcctaatggatggtctc catgaagatctaaataaagctgataatcggaagagatataaagaagaaaataatgatcat ctcgatgactttaaagctgcagaacatgcctggcagaaacacaagcagctcaatgagtct attattgttgcactttttcagggtcaattcaaatctacagtacagtgcctcacatgtcac aaaaagtctaggacatttgaggccttcatgtatttgtctctaccactagcatccacaagt aaatgtacattacaggattgccttagattattttccaaagaagaaaaactcacagataac aacagattttactgcagtcattgcagagctcgacgggattctctaaaaaagatagaaatc tggaagttaccacctgtgcttttagtgcatctgaaacgtttttcctacgatggcaggtgg aaacaaaaattacagacatctgtggacttcccgttagaaaatcttgacttgtcacagtat gttattggtccaaagaacaatttgaagaaatataatttgttttctgtttcaaatcactac ggtgggctggatggaggccactacacagcctattgtaaaaatgcagcaagacaacggtgg tttaagtttgatgatcatgaagtttctgatatctccgtttcttctgtgaaatcttcagca gcttatatcctcttttatacttcattgggaccacgagtaactgatgtagccacataa >gi568815583f:50339074_50599085|GENSCAN_predicted_peptide_6|768_aa XLFLTEEDQKKLHDFEEQCVEMYFNEKDDKFHSGSEERIRVTFERVEQMCIQIKEVGDRV NYIKRSLQSLDSQIGHLQDLSALTVDTLKTLTAQKASEASKVHNEITRELSISKHLAQNL IDDGPVRPSVWKKHGVVNTLSSSLPQGDLESNNPFHCNILMKDDKDPQCNIFGQDLPAVP QRKEFNFPEAGSSSGALFPSAVSPPELRQRLHGVELLKIFNKNQKLGSSSTSIPHLSSPP TKFFVSTPSQPSCKSHLETGTKDQETVCSKATEGDNTEFGAFVGEPVTVYRLEESSPNIL NNSMSSWSQLGLCAKIEFLSKEEMGGGLRRAVKVQCTWSEHDILKSGHLYIIKSFLPEVV NTWSSIYKEDTVLHLCLREIQQQRAAQKLTFAFNQMKPKSIPYSPRFLEVFLLYCHSAGQ WFAVEECMTGEFRKYNNNNGDEIIPTNTLEEIMLAFSHWTYEYTRGELLVLDLQDDFDIY HVLDCSEVATAFAYLMTDMWLGDSDCVSPEIFWSALGNLYPAFTKKMQQDAQEFLICVLN ELHEALKKYHYSRRRSYEKGSTQRCCRKWITTETSIITQLFEEQLNYSIVCLKCEKCTYK NEVFTVFSLPIPSKYECSLRDCLQCFFQQDALTWNNEIHCSFCETKQETAVRASISKAPK IIIFHLKRFDIQGTTKRKLRTDIHYPLTNLDLTPYICSIFRKYPKYNLCAVVEFQYYAEK KWHKQLLWPARILDVVNSAAINMRVQNHFGDLDGGHYTAFCKNSVTQA >gi568815583f:50339074_50599085|GENSCAN_predicted_CDS_6|2307_bp naacttttcttaacagaagaagatcaaaagaaacttcatgattttgaagagcagtgtgtt gaaatgtatttcaatgaaaaagatgacaaatttcattctgggagtgaagagagaattcgt gtcacttttgaaagagtggaacagatgtgcattcagattaaagaagttggagatcgtgtc aactacataaaaagatcattacaatcattagattctcaaattggccatttgcaagatctt tcagccctgacggtagatacattaaaaacactcactgcccagaaagcgtcggaagctagc aaagttcataatgaaatcacacgagaactgagcatttccaaacacttggctcaaaacctt attgatgatggtcctgtaagaccttctgtatggaaaaagcatggtgttgtaaatacactt agctcctctcttcctcaaggtgatcttgaaagtaataatccttttcattgtaatatttta atgaaagatgacaaagatccccagtgtaatatatttggtcaagacttacctgcagtaccc cagagaaaagaatttaattttccagaggctggttcctcttctggtgccttattcccaagt gctgtttcccctccagaactgcgacagagactacatggggtagaactcttaaaaatattt aataaaaatcaaaaattaggcagttcatctactagcataccacatctgtcatccccacca accaaattttttgttagtacaccatctcagccaagttgcaaaagccacttggaaactgga accaaagatcaagaaactgtttgctctaaagctacagaaggagataatacagaatttgga gcatttgtaggggagcctgtcacagtgtatcgtttggaagagagttcacccaacatacta aataacagcatgtcttcttggtcacaactaggcctctgtgccaaaatagagtttttaagc aaagaggagatgggaggaggtttacgaagagctgtcaaagtacagtgtacctggtcagaa catgatatcctcaaatcagggcatctttatattatcaaatcttttcttccagaggtggtt aatacatggtcaagtatttacaaagaagatacagttctgcatctctgtctgagagaaatt caacaacagagagcagcacaaaagcttacgtttgcctttaatcaaatgaaacccaaatcc ataccatattctccaaggttccttgaagttttcctgctgtattgccattcagcaggacag tggtttgctgtggaagaatgtatgactggagaatttagaaaatacaacaataataatgga gatgagattattccaactaatactctggaagagatcatgctagcctttagccactggact tacgaatatacaagaggggagttactggtacttgatttgcaagatgacttcgatatctac cacgtcctcgattgcagtgaagttgccactgcttttgcctatctgatgacagacatgtgg ctgggagactcagactgtgtctcaccagaaatattctggtcagctcttggcaacctctac ccagcatttacgaaaaagatgcaacaagatgctcaggaattcttgatttgtgtcctaaat gaacttcatgaagctctaaaaaagtaccactactcccggagaagatcatatgagaaagga tctactcagagatgctgcaggaagtggattaccactgagacatccatcatcacccagctg tttgaagagcagctcaattatagcatcgtatgtttaaagtgtgagaaatgcacctacaag aacgaagtcttcactgtcttctcactccccattccatccaaatatgaatgctcccttcgg gactgtctccaatgtttttttcaacaagacgcactgacctggaacaacgaaattcactgc tccttttgtgaaaccaagcaagaaactgctgtgagggccagtatttccaaagcaccaaaa ataattattttccacctaaaaaggtttgacattcagggtacaacaaaaaggaagctgaga acggatattcattacccactcactaacttggacctcactccttatatttgctcaattttc cggaaatatcctaaatacaacctctgtgcagtggtggaattccagtactatgctgaaaag aagtggcacaagcagctactatggccagctcgtatcttggatgtcgtgaatagtgctgca ataaacatgagggtgcagaaccattttggtgatttggatggtggccactacactgctttc tgcaagaattcagtcacccaggcctga