GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:39:24 Sequence gi568815580f:58571644_58847839 : 276196 bp : 43.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 8905 7171 1735 0 1 77 110 580 0.158 46.42 1.02 Intr - 35796 35679 118 1 1 76 111 72 0.429 7.82 1.01 Init - 42527 42488 40 2 1 91 103 -6 0.310 1.55 1.00 Prom - 44010 43971 40 -7.16 2.07 PlyA - 44584 44579 6 1.05 2.06 Term - 47226 47176 51 0 0 117 55 72 0.168 4.23 2.05 Intr - 59956 59733 224 2 2 43 95 78 0.011 1.85 2.04 Intr - 70886 70752 135 0 0 69 105 39 0.200 4.24 2.03 Intr - 74620 74552 69 2 0 68 102 30 0.026 1.55 2.02 Intr - 88471 88417 55 1 1 58 115 39 0.029 2.15 2.01 Init - 90852 90772 81 0 0 81 94 11 0.048 1.07 2.00 Prom - 91089 91050 40 -4.26 3.00 Prom + 93997 94036 40 -4.96 3.01 Init + 100001 100209 209 1 2 78 99 351 0.998 31.49 3.02 Intr + 109527 109693 167 0 2 103 96 83 0.993 10.20 3.03 Intr + 118693 118889 197 2 2 22 28 117 0.226 -1.77 3.04 Intr + 119554 119803 250 0 1 82 -6 258 0.012 12.81 3.05 Intr + 124723 124844 122 2 2 134 76 -27 0.975 0.91 3.06 Intr + 128262 128345 84 2 0 91 62 33 0.506 1.02 3.07 Intr + 128798 128948 151 1 1 91 89 109 0.862 11.04 3.08 Intr + 142440 142466 27 1 0 96 105 4 0.094 0.99 3.09 Intr + 151405 151608 204 2 0 85 95 160 0.200 15.57 3.10 Intr + 161754 161931 178 1 1 124 115 25 0.980 7.78 3.11 Intr + 162664 162738 75 1 0 69 76 116 0.993 7.13 3.12 Intr + 163559 163686 128 2 2 111 93 101 0.999 13.22 3.13 Intr + 170222 170371 150 0 0 90 64 62 0.849 4.13 3.14 Intr + 172695 172852 158 1 2 94 70 48 0.904 3.33 3.15 Intr + 174023 174085 63 1 0 116 -3 76 0.252 0.21 3.16 Intr + 175762 175882 121 0 1 125 20 55 0.355 2.47 3.17 Term + 189487 189584 98 2 2 72 36 88 0.040 0.13 3.18 PlyA + 190241 190246 6 1.05 4.04 PlyA - 192702 192697 6 1.05 4.03 Term - 197359 197144 216 1 0 56 55 95 0.070 0.14 4.02 Intr - 217307 217164 144 2 0 81 61 67 0.131 3.78 4.01 Init - 236050 235862 189 1 0 48 49 123 0.238 3.41 4.00 Prom - 238351 238312 40 -6.66 5.00 Prom + 240129 240168 40 -5.06 5.01 Sngl + 255574 256062 489 0 0 60 49 228 0.475 10.63 5.02 PlyA + 257119 257124 6 1.05 6.02 PlyA - 257894 257889 6 1.05 6.01 Term - 271932 271739 194 1 2 32 42 146 0.390 1.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 66886 67069 184 1 1 94 49 93 0.803 3.02 S.002 Term + 119554 119832 279 0 0 82 42 230 0.926 13.25 S.003 Init + 124487 124520 34 1 1 91 108 29 0.951 5.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_1|631_aa MSLLQHYGSFHPSCQPKPEVTWYKNGQAIDGSGIISNYEFFENQYIHVLHLSCCTKNDAA VYQISAKNSFGMICCSASVEVECSSENPQLSPNLEDDRDRGWKHETGTHEEERANQIDEK EHPYKEEESISPGTPRSADSSPSKSNHSLSLQSLGNLDISVSSSENPLGVKGTRHTGEAY DPSNTEEIANGLLFLNSSHIYEKQDRCCHKTVHSMASKFTDGDLNNDGPHDEGLRSSQQN PKVQKYISFSLPLSEATAHIYPGDSAVANKQPSPQLSSEDSDSDYELCPEITLTYTEEFS DDDLEYLECSDVMTDYSNAVWQRNLLGTEHVFLLESDDEEMEFGEHCLGGCEHFLSGMGC GSRVSGDAGPMVATAGFCGHHSQPQEVGVRSSRVSKHGPSSPQTGMTLILGPHQDGTSSV TEQGRYKLPTAPEAAENDYPGIQGETRDSHQAREEFASDNLLNMDESVRETEMKLLSGES ENSGMSQCWETAADKRVGGKDLWSKRGSRKSARVRQPGMKGNPKKPNANLRESTTEGTLH LCSAKESAEPPLTQSDKRETSHTTAAATGRSSHADARECAISTQAEQEAKTLQTSTDSVS KEGNTNCKGEGMQVNTLFETSQVPDWSDPPQ >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_1|1893_bp atgtcacttcttcaacattacggctctttccacccatcatgtcagcccaagccagaggta acttggtataagaatggtcaggccatcgatgggagtggcattatttccaactatgaattc tttgagaatcagtatattcatgtgttacatctctcttgctgtaccaaaaatgatgctgct gtctatcaaatctcggctaaaaactcttttggaatgatctgttgttctgcttccgttgag gttgagtgctcatcagagaacccacaattgtctcctaacctggaagatgacagggacagg ggttggaaacatgaaacagggacacatgaagaagaaagggcaaatcagattgatgagaag gaacatccttataaggaagaagaaagcatctccccgggcactcccaggtcagctgactcc tccccctccaaatccaaccattcactctccctccagtcattgggcaatcttgacattagt gtgtccagttctgaaaatcctttgggtgttaaaggaacaaggcacactggagaggcttat gatccaagtaacacagaagaaattgcaaatgggttgctttttcttaattcaagtcatatt tatgaaaaacaagacagatgttgccacaagacagtgcattccatggcatcaaagttcacg gatggtgacctgaacaatgatggtcctcatgatgaaggcttacgctctagtcagcaaaat cccaaagtacagaaatacattagcttcagcctcccgctatctgaggcaactgcacacatt tacccaggtgacagtgccgtggccaacaaacaacccagcccacagctttccagtgaagac tctgacagtgactatgaactttgcccagagataaccctaacctacaccgaggagttttca gatgatgacctggagtatctggaatgttctgatgttatgacggattactctaatgcagtt tggcaaaggaacctgctggggactgagcatgtttttttattagaaagcgatgacgaagag atggaattcggtgagcattgcctgggtgggtgtgagcatttcctcagtggaatgggttgt gggtctcgggtgtcgggtgacgctgggcctatggttgccactgctggcttctgtggtcat cactcacaaccccaagaagttggggtgaggagcagcagagtctccaagcacggtccctca tccccacaaacagggatgactctcattttgggacctcaccaggatggaacgtcttcagtg acagaacaggggagatataaactccccactgctcccgaggctgctgaaaatgattatcca ggaattcaaggagaaaccagagacagccaccaagcaagagaggaatttgccagtgacaat ctgctcaacatggatgaatcagtaagagagacagagatgaagctcttgtctggtgagtca gaaaactcagggatgagccagtgttgggagacggcagctgacaagagagtggggggaaag gacttatggagcaagaggggttcaaggaaatctgccagggtgaggcagccgggaatgaag ggaaatcccaagaagccgaatgccaacctgagagaaagtacaacagaaggtacccttcat ctctgctctgccaaagaatctgctgagcccccactaacccagagtgataaaagagagact tctcacaccacagcagcagcgactggtcggagttcccatgctgatgcaagagaatgtgct atttcaacccaggcagagcaagaagcaaaaacccttcaaacttcaacagactcagtctcc aaagaaggcaacacaaattgcaagggagaaggcatgcaagttaatactctatttgaaaca agccaggttccagactggagtgatcctcctcag >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_2|204_aa MKKFWGWLYTLQTDLRPLICTLKNSSKFKCDNHDHSRLISTLPFAGDPPMTSGPQTKQPK EHLTNFKSDIIPQFGLPTSIQSDNGLAFISQITQAVSQALGIQWLLVLPQTATQISDFNK RLLANNVDGEVRMHSSRLTTEAPCSTLDHVLFVTTNSANGIQVLSISPHFVNLEPLLFLS KETEFSTDLTASFLVTAAMRLADY >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_2|615_bp atgaagaagttctggggatggctgtacacactgcaaacggacttaaggccactgatttgc acactcaaaaatagttcaaagtttaaatgtgacaaccatgatcattctcggttaatatcg acattaccctttgcaggagatccacctatgacctcaggtcctcagaccaaacagcccaaa gaacatctcaccaatttcaaatcggacataattcctcagtttggccttcctacctctata cagtctgataacggactggcctttattagtcaaatcacccaagcagtttctcaggctctt ggtattcagtggctcctggttttacctcaaaccgccacccaaatttcagatttcaacaaa agactcctggctaacaatgttgatggggaagtgcgcatgcacagctctcgtcttacaaca gaagccccgtgcagcaccctggatcatgttctgtttgtgactacaaatagtgccaatggt atccaggtcctttctatttccccacactttgtcaacctggagcctcttcttttcctttct aaggagactgaattctccactgatctgacagcaagctttttggtgacagcagccatgcga cttgctgactattga >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_3|793_aa MSLLGDPLQALPPSAAPTGPLLAPPAGATLNRLREPLLRRLSELLDQAPEGRGWRRLAEL AGSRGRLRLSCLDLEQCSLKVLEPEGSPSLCLLKLMGEKGCTVTELSDFLQAMEHTEVLQ LLSPPDQISMVLADQYGAARQALTSSRQLNLGSFGAPRGGACEWGMHSMWKPEPPPLDGV YEIPGPEPIPSECNVSDVKDDTGFWEGYPYPYLHILHFLEKADLQPHSLQPDQLWAKMIL FAFGIALAQAQLLCGNDTKISEQPVVVQSTCTDGRIKITVNPESKAVLAGQFVKLCCRAT GHPFVQYQWFKMNKETLKEIGYCLVKEFILLQNFADQKVRGLKIPNGNTSELIFNAVHVK DAGFYVCRVNNNFTFEFSQWSQLDVCDIPESFQNELNNLGHPAKDKVALLIGNMNYREHP KLKAPLVDVYELTNLLRQLDFKVVSLLDLTEYEMRNAVDEFLLLLDKGVYGLLYYAGHGY ENFGNSFMVPVDAPNPYRSENCLCVQNILKLMQEKETGLNVFLLDMCRKRNDYDDTIPIL DALKVTANIVFGYATCQGAEAFEIQHSGLANGIFMKFLKDRLLEDKKITVLLDEVAEDMG KCHLTKGKQALEIRSSLSEKRALTDPIQGTEYSAESLVRNLQWAKAHELPESMCLKFDCG VQIQLGFAAEFSNVMIIYTSIVYKPPEIIMCDAYVTDFPLDLDIDPKDANKGTPEETGSY LEHLVFTVCLSYQYSGLEDTVEDKQEVNVGKPLIAKLDMHREFKLAPLMILLPAAIAYSD VRAAIPMENGMES >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_3|2382_bp atgtcgctgttgggggacccgctacaggccctgccgccctcggccgcccccacggggccg ctgctcgcccctccggccggcgcgaccctcaaccgcctgcgggagccgctgctgcggagg ctcagcgagctcctggatcaggcgcccgagggccggggctggaggagactggcggagctg gcggggagtcgcgggcgcctccgcctcagttgcctagacctggagcagtgttctcttaag gtactggagcctgaaggaagccccagcctgtgtctgctgaagttaatgggtgaaaaaggt tgcacagtcacagaattgagtgatttcctgcaggctatggaacacactgaagttcttcag cttctcagccccccagaccagatcagtatggtgttggcagatcagtatggtgccgctagg caggcactgactagctccaggcagctcaaccttgggagcttcggggccccgagaggtgga gcctgtgagtggggcatgcactccatgtggaagccagagcctcctcccctggatggagtg tacgagatccctggaccggagcccatcccctctgaatgcaatgtttctgatgtgaaagat gacacaggattctgggagggctatccttacccctatctccatatcctgcacttcctggag aaagccgatttgcaaccacatagccttcaaccagatcagctgtgggccaagatgatcctg tttgcttttggcattgccctggctcaggcccagctcctctgtggcaatgacaccaagatc tcggagcagcccgtggttgtgcagagtacatgcacagacggacgaataaagattactgta aacccagagtcaaaggcagtcttggctggacagtttgtgaaactgtgttgccgggcaact ggacatccttttgttcaatatcagtggttcaaaatgaataaagagaccttgaaggagatt ggatattgtttggttaaggagtttatcctcttacagaattttgcagatcagaaggtccgg ggtctcaagattccaaatggaaatacatcagagcttatttttaatgcagtgcatgtaaaa gatgcaggcttttatgtctgtcgagttaataacaatttcacctttgaattcagccagtgg tcacagctggatgtttgcgacatcccagagagcttccagaatgaattaaataatcttggt catcctgcgaaggacaaggttgcccttttgataggaaatatgaattaccgggagcacccc aagctcaaagctcctttggtggatgtgtacgaattgactaacttactgagacagctggac ttcaaagtggtttcactgttggatcttactgaatatgagatgcgtaatgctgtggatgag tttttactccttttagacaagggagtatatgggttattatattatgcaggacatggttat gaaaattttgggaacagcttcatggtccccgttgatgctccaaatccatataggtctgaa aattgtctgtgtgtacaaaatatactgaaattgatgcaagaaaaagaaactggacttaat gtgttcttattggatatgtgtaggaaaagaaatgactacgatgataccattccaatcttg gatgcactaaaagtcaccgccaatattgtgtttggatatgccacgtgtcaaggagcagaa gcttttgaaatccagcattctggattggcaaatggaatctttatgaaatttttaaaagac agattattagaagataagaaaatcactgtgttactggatgaagttgcagaagatatgggt aagtgtcaccttaccaaaggcaaacaggctctagagattcgaagtagtttatctgagaag agagcacttactgatccaatacagggaacagaatattctgctgaatctcttgtgcggaat ctacagtgggccaaggctcatgaacttccagaaagtatgtgtcttaagtttgactgtggt gttcagattcaattaggatttgcagctgagttttccaatgtcatgatcatctatacaagt atagtttacaaaccaccggagataataatgtgtgatgcctacgttactgattttccactt gatctagatattgatccaaaagatgcaaataaaggcacacctgaagaaactggcagctac ttggaacatctagtcttcacagtatgtttatcatatcagtactcaggattggaagatact gtagaggacaagcaggaagtgaatgttgggaaacctctcattgctaaattagacatgcat cgagagttcaaactcgcacctttgatgatactgcttcctgctgccattgcctattctgat gtcagagcagccattcccatggaaaatggaatggaaagctaa >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_4|182_aa MLQTRRDSPPITVVAHRQNEPDKTSTFQALDEADLDSVHSQKMLVAALLRQSLLCLSVFW IELKDPADMLWEVPKPPGKSKWSTIEAPGPWYQLNSQPIAPSHQLRVIHMEGPQMGIWLT VLKAGASLPGDSVELFDISLPFGSITGMGMGEEGAGHLSWPVDACPSLAVIPAALAAVPR SS >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_4|549_bp atgctacagacccgtagggactcaccacccataacagtagttgcccatcggcaaaatgag cctgacaaaacgtccactttccaagctcttgatgaagctgacttggacagtgttcacagt cagaagatgttggtcgcagcccttctccgtcagagcctcctctgtctctcagtgttctgg attgaactgaaagacccagctgacatgctatgggaagtccctaagccacctggaaagtcc aagtggagtacaattgaggctccaggaccatggtaccagctgaactcccagccaatagca ccaagtcaccaactccgagtgatccacatggaggggccccagatgggtatctggttgact gttctcaaggcaggggcctccctcccaggagattcggtggagctttttgatatttctctg ccctttggcagcatcacaggcatgggcatgggagaggagggagccggccacttgtcctgg cctgttgatgcatgcccgagtctggctgtcatcccagccgcgctggctgctgtccctcgg tcttcctga >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_5|162_aa MKPWTLAVSVQFLKMVCPEFVPSDVQMCPEFPPSGGFVVSLTSGMKLQTFTVGLTALKSS TSGVVHSFQWVVLLASGVKLQTFTVLQFTKAARTQRVSSGKIYHKEQKNKTHTLQKRTQA SCHRWFRQLAFIPSSGPTHILLIGPLYRELIGPFYRELIGLF >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_5|489_bp atgaagccgtggaccctcgcggtgagtgtacagttcttaaagatggtgtgtccggagttt gttccttcagatgttcagatgtgtccagagtttcctccttctggtgggttcgtggtctca ctgacttcaggaatgaagctgcaaaccttcacagtgggccttacagctcttaaaagcagt acatctggagttgttcattccttccagtgggtggtcttgctggcttcaggagtgaagctg cagaccttcacagtgttacagttcacaaaggcagcaaggactcaaagagtgagcagcggc aagatttatcacaaagagcaaaagaacaaaactcacacactgcagaagcggacccaagcc agttgccaccgctggttccggcagcttgcttttattccctcatctggccccacccacata ctgctgattggtccactttacagagagctgattggtccattttacagagagctgattggc ctgttttga >gi568815580f:58571644_58847839|GENSCAN_predicted_peptide_6|64_aa XLRRPGGVGSSAKDSSEGLEYLCTTELLETSTVAQAALNQQLPGNADPSYFEAGDTKAYD CHPD >gi568815580f:58571644_58847839|GENSCAN_predicted_CDS_6|195_bp nagctcagaaggcctgggggagtaggcagttcagccaaggacagttccgagggtttggaa tacctctgcacaacagagctcctggagacctcgacagttgcgcaggcagccctgaatcag cagctcccaggcaacgctgacccgagctattttgaagctggtgacactaaagcctatgac tgtcacccggattaa