GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:45:27 Sequence gi568815587f:73134160_73335290 : 201131 bp : 50.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3452 3537 86 1 2 65 67 76 0.664 3.49 1.02 Intr + 7493 7817 325 2 1 -11 24 241 0.521 3.58 1.03 Intr + 8844 8898 55 2 1 109 50 85 0.256 5.35 1.04 Intr + 17913 18001 89 1 2 44 68 110 0.514 4.29 1.05 Term + 33560 33625 66 1 0 80 47 109 0.693 3.94 1.06 PlyA + 34741 34746 6 1.05 2.00 Prom + 47290 47329 40 -3.56 2.01 Init + 51183 51304 122 2 2 79 59 124 0.834 6.30 2.02 Intr + 54545 54621 77 2 2 45 94 65 0.745 1.96 2.03 Intr + 56334 56454 121 1 1 -1 100 120 0.361 3.85 2.04 Intr + 58442 58539 98 2 2 72 34 71 0.876 -0.25 2.05 Intr + 59030 59065 36 0 0 107 97 13 0.786 2.33 2.06 Intr + 63880 63957 78 2 0 28 71 106 0.334 2.42 2.07 Intr + 64399 64524 126 2 0 110 72 -11 0.485 0.15 2.08 Intr + 66737 66791 55 0 1 125 116 3 0.639 4.84 2.09 Intr + 88055 88092 38 2 2 125 105 6 0.905 3.91 2.10 Intr + 88883 89023 141 0 0 27 67 146 0.736 6.72 2.11 Intr + 94896 95022 127 1 1 65 16 91 0.386 -0.66 2.12 Term + 99997 101134 1138 1 1 133 49 1916 0.556 183.64 2.13 PlyA + 107039 107044 6 1.05 3.00 Prom + 113634 113673 40 -6.06 3.01 Init + 114102 114216 115 2 1 49 62 92 0.339 3.07 3.02 Intr + 130370 130490 121 0 1 99 110 49 0.385 7.75 3.03 Intr + 132669 132802 134 0 2 78 59 7 0.159 -2.91 3.04 Intr + 138181 138307 127 2 1 91 92 36 0.303 4.04 3.05 Intr + 155252 155277 26 2 2 86 72 67 0.261 2.57 3.06 Intr + 156435 156593 159 1 0 69 55 80 0.309 2.76 3.07 Intr + 156653 156686 34 0 1 90 97 20 0.190 0.38 3.08 Term + 162326 163346 1021 2 1 68 47 1506 0.030 136.09 3.09 PlyA + 164443 164448 6 -0.45 4.00 Prom + 164598 164637 40 -7.66 4.01 Init + 174480 177671 3192 2 0 102 89 2582 0.649 250.52 4.02 Term + 191433 191597 165 2 0 94 47 81 0.568 2.52 4.03 PlyA + 192574 192579 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 162360 163346 987 2 0 97 47 1490 0.965 142.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:73134160_73335290|GENSCAN_predicted_peptide_1|206_aa MPEPCAKSKSFEITENKELAVIGNQEFVRTDEPRRPLCRKGNSHARPPPGAGGKSVTSPR RGRSRPLAGGGRGAAVAPAAFRVRISPNPKAPQLRPYLPRRRLHDAEGTSILPDGSVSKD QEEEEGRRGGDGPASARLDVRERGEQGQVALRLAAGVNCFLLVPKYLTLHLEMIDLNGQI QVEGQLWRQDFLELDLLLQGWLPDKH >gi568815587f:73134160_73335290|GENSCAN_predicted_CDS_1|621_bp atgccagaaccatgtgcaaaatcaaaatcctttgagatcacagagaacaaagaacttgca gttattggtaatcaagaatttgtcaggacggacgagccccggcggcccctctgtcggaaa ggcaactcacacgcgcgccccccacctggagccggcggcaagtcggtgacaagcccaaga cgagggcggtcacggccccttgcgggagggggaaggggcgcagcggtcgccccagccgcg ttccgcgtacgcatctctccgaaccccaaagccccccagctgcgcccttaccttcctcgg cggcggctgcatgatgctgaagggacatcaatcctccccgacggcagcgttagcaaggac caggaggaggaggagggccggagaggaggggacggcccagcgagcgcgcgcctagatgtg cgcgagagaggcgagcaaggacaagtcgctttgcggctggccgcaggggtgaattgcttc ctgctggtccccaaatacctgaccctgcacctggagatgattgatctgaatgggcagatc caggtggagggtcagctatggagacaggatttcctggagctggacctgctgctgcagggc tggctaccagacaagcactga >gi568815587f:73134160_73335290|GENSCAN_predicted_peptide_2|718_aa MPIHLCPALPFSPAATRRRQSNPSTAASGRSPISNVDGWLRGAQDLEQPMEIKTEYWCLH AWRMQTGSLSGCCMKHGLKADSSVKGPSAFEVQEEKMKDEVGQFSGRMMNRRALAFPKKP PAKGPPAVMKAQVPVKLLRGSLIHELLDLLEVCTFEDIEDIGVYKDVPKSGAYQKEAGFH PRCHVTLKHHASILSSWSERVAQTLLVFDDLESFEYSSSGWHPALCYPRKKQLNPNERSE ESLWVSTWLPDPCGSENLKTKGYQEQRVIELKKHSLHTRNQSLLELDCENQGTQNPGGLL LDDRDREKHQMDPALKEHLKCVGVEDGRTNYQLAEKRPEGAMAADLGPWNDTINGTWDGD ELGYRCRFNEDFKYVLLPVSYGVVCVPGLCLNAVALYIFLCRLKTWNASTTYMFHLAVSD ALYAASLPLLVYYYARGDHWPFSTVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPL RSLRWGRARYARRVAGAVWVLVLACQAPVLYFVTTSARGGRVTCHDTSAPELFSRFVAYS SVMLGLLFAVPFAVILVCYVLMARRLLKPAYGTSGGLPRAKRKSVRTIAVVLAVFALCFL PFHVTRTLYYSFRSLDLSCHTLNAINMAYKVTRPLASANSCLDPVLYFLAGQRLVRFARD AKPPTGPSPATPARRRLGLRRSDRTDMQRIEDVLGSSEDSRRTESTPAGSENTKDIRL >gi568815587f:73134160_73335290|GENSCAN_predicted_CDS_2|2157_bp atgcccatccacctctgcccagccctgcctttctcccctgctgccacacggaggcgacag agcaacccttccacagctgcctctggacgttctcccatcagcaacgtggatgggtggctg agaggagcccaggatttagaacagcccatggagatcaagactgagtattggtgcctccat gcctggaggatgcagacagggtccctctctggttgctgcatgaagcatggactgaaggca gacagcagtgtcaaagggccctctgcttttgaggttcaggaggaaaagatgaaggatgaa gttgggcagttcagcggaagaatgatgaataggagggccctggcctttcctaagaagccc ccagccaagggtcctcctgctgtcatgaaggcccaagttccagtcaagttgttgagaggg tcccttatccatgaacttcttgatttgttggaagtctgcacttttgaagatattgaagat attggtgtctataaggatgtccccaagtcaggggcctatcaaaaagaggcaggattccat ccaaggtgccacgttacacttaaacatcatgcctccatcctttcttcctggtctgagaga gttgctcagactctcctggtttttgatgatcttgagagttttgagtattcatcatcaggc tggcacccagcactgtgctatcctaggaagaaacagctaaatcccaatgaaaggtctgag gagagcctctgggtgtcgacctggcttcctgacccctgtggctctgagaatctcaaaacc aaaggctatcaggagcaacgagtaatagaattgaagaaacacagccttcacacccggaac cagagcctcttggaattagactgcgagaaccagggaactcagaatcctggaggcctcctg ctggatgatagggacagagaaaagcatcagatggacccagccctcaaggagcacctgaaa tgtgttggggtagaggatggaaggacaaattaccaactggctgagaaacgtcctgagggg gcgatggcagcagacctgggcccctggaatgacaccatcaatggcacctgggatggggat gagctgggctacaggtgccgcttcaacgaggacttcaagtacgtgctgctgcctgtgtcc tacggcgtggtgtgcgtgcctgggctgtgtctgaacgccgtggcgctctacatcttcttg tgccgcctcaagacctggaatgcgtccaccacatatatgttccacctggctgtgtctgat gcactgtatgcggcctccctgccgctgctggtctattactacgcccgcggcgaccactgg cccttcagcacggtgctctgcaagctggtgcgcttcctcttctacaccaacctttactgc agcatcctcttcctcacctgcatcagcgtgcaccggtgtctgggcgtcttacgacctctg cgctccctgcgctggggccgggcccgctacgctcgccgggtggccggggccgtgtgggtg ttggtgctggcctgccaggcccccgtgctctactttgtcaccaccagcgcgcgcgggggc cgcgtaacctgccacgacacctcggcacccgagctcttcagccgcttcgtggcctacagc tcagtcatgctgggcctgctcttcgcggtgccctttgccgtcatccttgtctgttacgtg ctcatggctcggcgactgctaaagccagcctacgggacctcgggcggcctgcctagggcc aagcgcaagtccgtgcgcaccatcgccgtggtgctggctgtcttcgccctctgcttcctg ccattccacgtcacccgcaccctctactactccttccgctcgctggacctcagctgccac accctcaacgccatcaacatggcctacaaggttacccggccgctggccagtgctaacagt tgccttgaccccgtgctctacttcctggctgggcagaggctcgtacgctttgcccgagat gccaagccacccactggccccagccctgccaccccggctcgccgcaggctgggcctgcgc agatccgacagaactgacatgcagaggatagaagatgtgttgggcagcagtgaggactct aggcggacagagtccacgccggctggtagcgagaacactaaggacattcggctgtag >gi568815587f:73134160_73335290|GENSCAN_predicted_peptide_3|578_aa MSCFMGYKEVHTTSYEEFIGEPEFDQTSRSNYQFSENTGTLGPELLPPSTRSPHLTGSPP SSAWASDIWAGPAGAARRREYCCPRGPESFSGRSGQSPPGEKEAWPSPDFLSDKASCLLS CQGDFCQNIARDSFRHRTDWQQGLLHEWEFAPALHGLQARHLLTLGGSCDHIDQEVKLIP HFCAFAHTGPPVCDALFSLVMCPGPSGPSVPSNGHAFSMSFAGYSLPGLARSMGKAKARL PEHRKPTWAAMEWDNGTGQALGLPPTTCVYRENFKQLLLPPVYSAVLAAGLPLNICVITQ ICTSRRALTRTAVYTLNLALADLLYACSLPLLIYNYAQGDHWPFGDFACRLVRFLFYANL HGSILFLTCISFQRYLGICHPLAPWHKRGGRRAAWLVCVAVWLAVTTQCLPTAIFAATGI QRNRTVCYDLSPPALATHYMPYGMALTVIGFLLPFAALLACYCLLACRLCRQDGPAEPVA QERRGKAARMAVVVAAAFAISFLPFHITKTAYLAVRSTPGVPCTVLEAFAAAYKGTRPFA SANSVLDPILFYFTQKKFRRRPHELLQKLTAKWQRQGR >gi568815587f:73134160_73335290|GENSCAN_predicted_CDS_3|1737_bp atgtcctgctttatggggtacaaagaagtacacaccacctcctatgaagaattcataggt gaacctgaatttgatcaaacctctagatctaactaccagttttcagaaaatacagggacg ctggggccggagctgctgccgccgtctacacggtcccctcatttgacgggttcgcctcct agcagcgcctgggcgagtgacatctgggccggaccagctggtgctgcgcggcgcagggag tactgctgccccagaggccctgagagcttctctggaaggtctggacagagccctcctggg gagaaggaggcctggcccagccctgactttctgagtgataaggccagctgtcttcttagc tgtcagggagacttctgccagaacattgcacgcgacagtttcaggcacagaactgactgg cagcaggggctgctccacgagtgggaatttgctccagcacttcacggactgcaagcgagg cacttgctaactcttggtggctcctgtgaccacatcgaccaggaggttaagctcattccc cacttctgtgcctttgcccacactgggcctcctgtctgcgatgccctcttcagccttgtt atgtgtccaggtccctctggcccttcagtgcccagcaatggacatgctttctccatgagc tttgcaggttactccctcccaggcctagccaggtccatgggcaaagctaaggcccgcctc cctgaacataggaaacccacctgggcagccatggaatgggacaatggcacaggccaggct ctgggcttgccacccaccacctgtgtctaccgcgagaacttcaagcaactgctgctgcca cctgtgtattcggcggtgctggcggctggcctgccgctgaacatctgtgtcattacccag atctgcacgtcccgccgggccctgacccgcacggccgtgtacaccctaaaccttgctctg gctgacctgctatatgcctgctccctgcccctgctcatctacaactatgcccaaggtgat cactggccctttggcgacttcgcctgccgcctggtccgcttcctcttctatgccaacctg cacggcagcatcctcttcctcacctgcatcagcttccagcgctacctgggcatctgccac ccgctggccccctggcacaaacgtgggggccgccgggctgcctggctagtgtgtgtagcc gtgtggctggccgtgacaacccagtgcctgcccacagccatcttcgctgccacaggcatc cagcgtaaccgcactgtctgctatgacctcagcccgcctgccctggccacccactatatg ccctatggcatggctctcactgtcatcggcttcctgctgccctttgctgccctgctggcc tgctactgtctcctggcctgccgcctgtgccgccaggatggcccggcagagcctgtggcc caggagcggcgtggcaaggcggcccgcatggccgtggtggtggctgctgcctttgccatc agcttcctgccttttcacatcaccaagacagcctacctggcagtgcgctcgacgccgggc gtcccctgcactgtattggaggcctttgcagcggcctacaaaggcacgcggccgtttgcc agtgccaacagcgtgctggaccccatcctcttctacttcacccagaagaagttccgccgg cgaccacatgagctcctacagaaactcacagccaaatggcagaggcagggtcgctga >gi568815587f:73134160_73335290|GENSCAN_predicted_peptide_4|1118_aa MADGAPRPQLYRSVSFKLLERWSGGPGLREEDTDTPGLRRRASCRPTTAARGQPSRRVSK LASGPLAAPAQPRPLRSLSPSVRQLSRRFDAPRLDDGSAGTRDGGVLPAAAEEAAEGPAR GAWPSVTEMRKLFGGPGSRRPSADSESPGTPSPDGAAWEPPARESRQPPTPPPRTCFPLA GLRSARPLTGPETEGRLRRPQQQQERAQRPADGLHSWHIFSQPQAGARASCSSSSIAASY PVSRSRAASSSEEEEEGPPQLPGAQSPAYHGGHSSGSDDDRDGEGGHRWGGRPGLRPGSS LLDQDCRPDSDGLNLSSMNSAGVSGSPEPPTSPRAPREEGLREWGSGSPPCVPGPQEGLR PMSDSVGGAFRVAKVSFPSYLASPAGSRGSSRYSSTETLKDDDLWSSRGSGGWGVYRSPS FGAGEGLLRSQARTRAKGPGGTSRALRDGGFEPEKSRQRKSLSNPDIASETLTLLSFLRS DLSELRVRKPGGSSGDRGSNPLDGRDSPSAGGPVGQLEPIPIPAPASPGTRPTLKDLTAT LRRAKSFTCSEKPMARRLPRTSALKSSSSELLLTGPGAEEDPLPLIVQDQYVQEARQVFE KIQRMGAQQDDGSDAPPGSPDWAGDVTRGQRSQEELSGPESSLTDEGIGADPEPPVAAFC GLGTTGMWRPLSSSSAQTNHHGPGTEDSLGGWALVSPETPPTPGALRRRRKVPPSGSGGS ELSNGEAGEAYRSLSDPIPQRHRAATSEEPTGFSVDSNLLGSLSPKTGLPATSAMDEGLT SGHSDWSVGSEESKGYQEVIQSIVQGPGTLGRVVDDRIAGKAPKKKSLSDPSRRGELAGP GFEGPGGEPIREVEPMLPPSSSEPILVEQRAEPEEPGATRSRAQSERALPEALPPPATAH RNFHLDPKLADILSPRLIRRGSKKRPARSSHQELRRDEGSQDQTGSLSRARPSSRHVRHA SVPATFMPIVVPEPPTSVGPPVAVPEPIGFPTRAHPTLQAPSLEDVTKQYMLNLHSGEVP APVPVDMPCLPLAAPPSAEAKPPEAARPADEPTPASKCCSKPQVQISIPTKTSVQISGVR TEPETFPGASFDDSKVGVDWGSQMSTSNINRHYPSGEM >gi568815587f:73134160_73335290|GENSCAN_predicted_CDS_4|3357_bp atggcggacggggcaccccggccccagctttaccgcagcgtctcgttcaagctgctggag cgctggagcggcggccccgggctgagggaggaggacacggacacccccggcttgaggcga cgcgcctcgtgccggccgaccacggctgcccggggccagccctctcggcgcgtgtccaag ctggcgtctgggcccctggccgcccccgcgcagccgcgcccgctccgcagcctctcgccg tcggttcgccagctctcccggcgcttcgacgcgccgcgtctggacgacggctccgctggg acccgagacggaggcgtcttacccgcggccgcggaagaagcggccgagggcccagcgcga ggagcctggcccagcgtcaccgagatgcgcaagctcttcggcggtcctggctccaggagg cccagcgccgactctgaatccccaggaacgcccagccccgacggtgccgcgtgggagcct ccggctcgggagtcgcggcagccaccgacgccaccccctcggacatgcttccccctggcg ggtctgcgttcggcgcggcccctgaccgggccggagaccgaagggaggctgcgccggccg cagcagcaacaggagcgggcgcagcgtccagcggatggtttacattcttggcatatcttc tcccaaccgcaggccggggcccgggcctcctgctcctcctcctccatcgccgcctcctat cctgtcagccgcagtcgtgctgccagctccagcgaggaggaagaggagggcccgccgcag ctgcctggagcccagagtccggcctaccacggcggccactcctcgggcagtgacgacgac cgagacggtgagggcggccaccgctggggagggaggcccgggctcaggcctggaagctcc ctattggatcaggactgcaggcctgacagtgatgggttaaatctaagcagcatgaactca gcaggggtttctgggagccctgagcccccaacatctccaagagcccctagagaagaagga ctccgggagtggggtagtggctctccgccctgcgtcccaggtccccaggagggacttcgg cctatgtctgactctgtgggaggagctttccgtgtggccaaggtgagctttccctcgtac ctggccagccccgcaggctcccgcggtagcagccgttattccagcacggagaccctcaag gacgacgacctatggtctagtaggggttctgggggctggggcgtgtaccgctcccctagc tttggagctggggaagggctcctgcggtcccaggctcgaacccgtgccaaaggacctgga ggcacctctagggcattgagggatggaggatttgagcctgaaaagagtcgacagcggaag tccctgtcaaatccagatatcgcctcagagaccctgacgcttctcagtttcctgcgctca gacctttcagagctgagggtccgaaaacctggtgggagctccggggaccgtggaagcaac cccctagatggcagagactcaccatccgcaggtggccctgtggggcaacttgaacccata cccatcccagccccagcatcacctggcacgcgccccacactcaaggacttgacagccact ctgcggagagcaaagtcattcacctgctctgagaagcccatggcccgccgcctgccccgc accagtgctctgaagtccagctcctccgagctcctgctcacaggccctggtgccgaggag gatccgctgcccctcatcgtccaggaccaatatgtgcaggaggcccgccaggtttttgag aagatccagcgcatgggtgcccaacaagatgatggaagcgatgccccccctggaagccct gactgggcaggggatgtgacccgagggcagcggtcccaggaggagctctcaggccctgag tccagtctgacagatgaaggcattggggcagaccctgagcctcctgttgcagcattttgc ggcctgggtaccacagggatgtggcgacctctttcctcatcctcggcccagacgaaccac catggccctgggactgaggacagtctgggcgggtgggccctggtgtcgcctgagacccct cccacaccaggtgccctccgccgacgacgcaaagtcccaccttcaggttctggtgggagc gaattgagcaatggggaggcaggggaggcctacaggtccctgagtgacccaattcctcag cgccaccgggctgccacctctgaagagcctactgggttctctgtggacagcaacctcctg ggctcactgagccccaagacagggctccctgccacctcagccatggatgagggcttgacc agtggtcacagtgactggtctgtgggcagtgaagagagcaagggatatcaggaggttatt cagagcatagttcaggggcctggcaccctggggcgtgtggtggacgacaggattgctggc aaagcccccaagaagaaatccctgagtgaccccagccgccgtggggagctggctgggcct ggattcgagggccctggaggggagcccatccgagaagttgagcccatgctgcctccatcc agcagcgagcccatccttgtagagcagcgggcagagccagaagaacctggtgccaccagg agccgggcacagtctgaaagggccctacctgaggctctgcctccccctgccactgcccac cgaaactttcaccttgaccccaagctggctgacattctgtccccgaggctaatccgccga ggctccaagaagcgcccagctcggagtagtcaccaggagcttcggagagacgagggcagt caggaccagactggcagcctgtctcgggcccggccctcctccagacacgttcgccatgcc agtgtgcccgccacatttatgcctattgtggtgcctgagccaccaacttctgttggtccc cctgtggctgtgccagaacccataggcttccctacccgagcccatcccacgttgcaggca ccatcgctcgaggacgtcaccaagcagtacatgctgaacctgcactccggtgaggtccct gccccagtgccagtggacatgccctgcttgcctctggctgcaccgccctctgctgaggcc aagccccctgaggcagctcggcctgcagatgagcctacccctgccagcaagtgctgcagc aagccacaggtgcagatttccatacccacaaagacatctgtacagatatcaggagtgagg accgagcctgagaccttccctggagcaagctttgatgacagcaaagttggagtagattgg ggcagccagatgtcaacaagtaacataaacaggcattatccatctggagagatgtga