GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:43:24 Sequence gi568815582r:69656538_69857937 : 201400 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3190 3362 173 1 2 83 86 47 0.153 3.59 1.02 Term + 8763 10105 1343 1 2 67 39 464 0.117 31.08 1.03 PlyA + 10930 10935 6 1.05 2.00 Prom + 11077 11116 40 -6.66 2.01 Init + 34405 34551 147 0 0 72 121 53 0.615 7.09 2.02 Intr + 35212 37702 2491 0 1 67 55 878 0.845 69.75 2.03 Term + 38599 38834 236 2 2 97 36 76 0.508 -0.22 2.04 PlyA + 38974 38979 6 1.05 3.07 PlyA - 39326 39321 6 1.05 3.06 Term - 54744 54439 306 0 0 108 50 184 0.999 11.72 3.05 Intr - 56592 56491 102 0 0 81 82 67 0.982 5.77 3.04 Intr - 58540 58427 114 1 0 107 77 104 0.847 11.84 3.03 Intr - 61716 61586 131 1 2 156 80 97 0.971 16.11 3.02 Intr - 61997 61833 165 0 0 77 91 221 0.973 21.23 3.01 Init - 69902 69896 7 2 1 86 109 11 0.491 3.41 3.00 Prom - 79933 79894 40 -2.86 4.10 PlyA - 82684 82679 6 1.05 4.09 Term - 85636 85507 130 0 1 73 34 113 0.950 2.05 4.08 Intr - 86064 85799 266 0 2 91 67 304 0.935 24.91 4.07 Intr - 88480 88336 145 0 1 101 100 228 0.991 25.58 4.06 Intr - 91792 91695 98 1 2 57 55 184 0.767 10.81 4.05 Intr - 92581 92381 201 1 0 42 94 327 0.981 28.18 4.04 Intr - 92801 92676 126 2 0 77 81 126 0.999 11.68 4.03 Intr - 95834 95704 131 0 2 72 94 47 0.762 4.11 4.02 Intr - 98189 98057 133 2 1 89 92 273 0.996 28.02 4.01 Init - 98373 98311 63 0 0 117 96 98 0.979 13.06 4.00 Prom - 98910 98871 40 -6.26 5.02 PlyA - 99188 99183 6 -0.45 5.01 Sngl - 101020 99998 1023 1 0 58 38 936 0.984 82.67 5.00 Prom - 105183 105144 40 -1.76 6.00 Prom + 110902 110941 40 -4.56 6.01 Init + 113512 113560 49 0 1 71 58 61 0.175 0.51 6.02 Intr + 119652 119704 53 1 2 124 91 -10 0.108 1.53 6.03 Intr + 130459 130543 85 0 1 109 113 9 0.475 4.89 6.04 Intr + 142145 142292 148 0 1 120 41 220 0.973 19.79 6.05 Intr + 142637 142795 159 2 0 82 31 126 0.367 5.40 6.06 Intr + 161408 161511 104 2 2 109 65 27 0.001 2.32 6.07 Intr + 167718 167795 78 1 0 103 84 -5 0.000 0.12 6.08 Intr + 183589 183726 138 2 0 87 101 158 0.145 17.44 6.09 Intr + 185487 185583 97 1 1 100 111 58 0.702 8.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 176914 176860 55 1 1 80 86 17 0.861 2.15 S.002 Init + 183597 183726 130 2 1 85 101 145 0.834 15.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_1|505_aa XVDCVGILKLRNADVEARIGIAGSKKKSTRARLVFRVNIMRKDGSTLTLQTPSSPILCML EVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEI KEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQK RACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIY NYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKRKSFCTAKETTIRVNRQ PTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPINKWAKDMNRHFSKEDIYAGKKTH EKMLTITGHQRNTNQNHNEIPSHTS >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_1|1518_bp nnggtggactgcgtagggatattgaaattgaggaatgctgatgtcgaagccagaatagga attgctggttccaagaagaaaagcactcgtgccagattggtttttcgagttaatatcatg aggaaagatggctccactttgacactgcaaacaccctcttctccaattttgtgtatgttg gaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaagccactgctcaatgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcctgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaaccaaagccaaaattgacaaatgggat ctaattaaacggaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacagatttacaagaaaaaaacaaacaaccccatcaacaagtgg gcgaaggacatgaacagacacttctcaaaagaagacatttatgcaggaaaaaaaacacat gaaaaaatgctcaccatcactggccatcagagaaatacaaatcaaaaccacaatgagata ccatctcacaccagttag >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_2|957_aa MKTTGCNLDKVNIIPNALMTPLIPSSMIKSEDVTPMEVTAEKRSSTIFKTTKSVGSTQQT LENISNIAGNGSFSSPSSSHLPSENEKQQQIQPKAYNPETLTTIQTQDISQPGTFPAVSA SSQLPNSDALLQQATQFQTRETQSREILQSDGTVVNLSQLTEASQQQQQSPLQEQAQTLQ QQISSNIFPSPNSVSQLQNTIQQLQAGSFTGSTASGSSGSVDLVQQVLEAQQQLSSVLFS APDGNENVQEQLSADIFQQVSQIQSGVSPGMFSSTEPTVHTRPDNLLPGRAESVHPQSEN TLSNQQQQQQQQQQVMESSAAMVMEMQQSICQAAAQIQSELFPSTASANGNLQQSPVYQQ TSHMMSALSTNEDMQMQCELFSSPPAVSGNETSTTTTQQVATPGTTMFQTSSSGDGEETG TQAKQIQNSVFQTMVQMQHSGDNQPQVNLFSSTKSMMSVQNSGTQQQGNGLFQQGNEMMS LQSGNFLQQSSHSQAQLFHPQNPIADAQNLSQETQGSLFHSPNPIVHSQTSTTSSEQMQP PMFHSQSTIAVLQGSSVPQDQQSTNIFLSQSPMNNLQTNTVAQEAFFAAPNSISPLQSTS NSEQQAAFQQQAPISHIQTPMLSQEQAQPPQQGLFQPQVALGSLPPNPMPQSQQGTMFQS QHSIVAMQSNSPSQEQQQQQQQQQQQQQQQQQSILFSNQNTMATMASPKQPPPNMIFNPN QNPMANQEQQNQSIFHQQSNMAPMNQEQQPMQFQSQSTVSSLQNPGPTQSESSQTPLFHS SPQIQLVQGSPSSQEQQVTLFLSPASMSALQTSINQQDMQQSPLYSPQNNMPGIQGATSS PQPQATLFHNTAGGTMNQLQNSPGSSQQTSGMFLFGIQNNCSQLLTSGPATLPDQLMAIS QPGQPQNEGQPPVTTLLSQQMPENSPLASSINTNQNIEKIDLLVSLQNQGNNLTGSF >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_2|2874_bp atgaaaactactggatgtaatttagataaggtaaatattatccctaatgccctgatgact ccactcataccaagcagtatgattaagagtgaagatgttactccaatggaagtaacagca gaaaaaagatcttccactatttttaagactacaaagtctgttggatcaactcagcaaaca ttagaaaacatctcaaacatagcaggaaatggctctttttcatcaccatcatcttcccac ctaccttctgaaaatgaaaaacagcagcagattcagcccaaggcatacaacccagagacc ctgacaactattcaaacccaggacatctcacagcctggtacttttccagcagtttctgct tctagtcagctgcccaacagcgatgcactattgcagcaggctacacagtttcagacaaga gaaactcagtctagagagatattacagtcagatggtacagtggttaatttgtcacaactg actgaggcatcacaacaacagcagcagtcaccactacaagaacaagcacagactttacag cagcagatttcatcaaatatttttccatcaccaaatagtgtgagtcagcttcagaatact attcagcagctgcaagcagggagtttcacaggcagtactgctagtggcagcagtggaagt gttgacttggtccaacaagttttagaggcacagcagcagttatcttcagttttattttct gctccagatggtaatgagaatgttcaagagcagcttagtgcagatatttttcaacaagtc agtcaaattcagagtggtgtaagccctggaatgttttcctcaacagagccaacagtccat accagaccagataatttattacctggaagagctgaaagtgttcatccacagtctgaaaac acgttatctaatcaacagcagcagcagcagcagcaacagcaagtgatggaatcttcagcc gcaatggtgatggagatgcaacagagtatctgccaggcagctgcccagattcagtcagag ttattcccttcaactgcttcagcaaatggaaaccttcagcaatcgccagtttaccagcag acttctcacatgatgagtgcattgtctaccaatgaggatatgcaaatgcagtgtgaattg ttttcttctcctcctgcagtttctggaaatgaaacttctacaactaccacacagcaggtt gcaacccctggcactaccatgtttcagacatcaagttcaggagatggagaagaaactgga acacaagcaaaacagattcagaacagtgtctttcagaccatggtccaaatgcaacatagt ggggacaatcaacctcaagttaaccttttttcatccacaaaaagtatgatgagtgttcag aatagtggtacccaacaacaaggtaatggtttattccagcaagggaatgagatgatgtca cttcaatctggaaattttttgcagcagtcttctcattcacaggcccaactttttcatcct caaaatcctattgccgatgctcagaacctttcccaggaaactcaaggttctctctttcat agtccaaatcctattgtccacagtcagacttctacaacctcctctgaacaaatgcagcct ccaatgtttcactctcaaagtaccattgctgtgttacagggctcttcagttcctcaagac cagcagtcaaccaacatatttctttcccagagtcccatgaataatcttcagactaacaca gtagcccaagaagcattttttgcagcaccgaactcaatttctccacttcagtcaacatca aacagtgaacaacaagctgctttccaacagcaagctccaatatcacacatccagactcct atgctttcccaagaacaggcacaacccccgcagcagggtttatttcagcctcaggtggcc ctgggctcccttccacctaatccaatgcctcaaagccaacaaggaaccatgttccagtca cagcactcaatagttgccatgcagagtaactctccatcccaggaacagcagcagcagcag caacagcagcagcaacagcagcagcaacaacaacagagcattttattcagtaatcagaat accatggctacaatggcgtctccaaagcaaccaccaccaaacatgatattcaacccaaat caaaatccaatggctaatcaggagcaacagaaccagtcaatttttcaccaacaaagtaac atggccccaatgaatcaagagcaacagcccatgcaatttcagagtcagtccacagtttcc tcacttcagaacccaggtcctacccagtcggaatcatcacagacccccttgttccatagc tctcctcagattcagttggtacaagggtcacctagttctcaagagcagcaagtaactctc ttcttatctccagcatccatgtctgccttgcagaccagtataaatcaacaagatatgcaa cagtctcctctttattcccctcagaacaacatgcctggaattcaaggagccacatcttcg cctcaaccacaggctactttatttcacaacacagcaggaggcacaatgaaccaactgcag aattctcctggctcatctcagcagacatcaggaatgttcttatttggcattcaaaataac tgtagtcagcttttaacctctggaccagctacattgcctgatcagttgatggccataagt cagccaggccaaccacaaaacgagggccagccacctgtgacaacacttctttctcagcaa atgccagagaattctccactggcatcctctataaacaccaaccagaacatcgaaaagatt gatttgcttgtttcattgcaaaaccaagggaacaacttgactggctccttttaa >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_3|274_aa MVGRRALIVLAHSERTSFNYAMKEAAAAALKKKGWEVVESDLYAMNFNPIISRKDITGKL KDPANFQYPAESVLAYKEGHLSPDIVAEQKKLEAADLVIFQFPLQWFGVPAILKGWFERV FIGEFAYTYAAMYDKGPFRSKKAVLSITTGGSGSMYSLQGIHGDMNVILWPIQSGILHFC GFQVLEPQLTYSIGHTPADARIQILEGWKKRLENIWDETPLYFAPSSLFDLNFQAGFLMK KEVQDEEKNKKFGLSVGHHLGKSIPTDNQIKARK >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_3|825_bp atggtcggcagaagagcactgatcgtactggctcactcagagaggacgtccttcaactat gccatgaaggaggctgctgcagcggctttgaagaagaaaggatgggaggtggtggagtcg gacctctatgccatgaacttcaatcccatcatttccagaaaggacatcacaggtaaactg aaggaccctgcgaactttcagtatcctgccgagtctgttctggcttataaagaaggccat ctgagcccagatattgtggctgaacaaaagaagctggaagccgcagaccttgtgatattc cagttccccctgcagtggtttggagtccctgccattctgaaaggctggtttgagcgagtg ttcataggagagtttgcttacacttacgctgccatgtatgacaaaggacccttccggagt aagaaggcagtgctttccatcaccactggtggcagtggctccatgtactctctgcaaggg atccacggggacatgaatgtcattctctggccaattcagagtggcattctgcatttctgt ggcttccaagtcttagaacctcaactgacatatagcattgggcacactccagcagacgcc cgaattcaaatcctggaaggatggaagaaacgcctggagaatatttgggatgagacacca ctgtattttgctccaagcagcctctttgacctaaacttccaggcaggattcttaatgaaa aaagaggtacaggatgaggagaaaaacaagaaatttggcctttctgtgggccatcacttg ggcaagtccatcccaactgacaaccagatcaaagctagaaaatga >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_4|430_aa MAPVEHVVADAGAFLRHAALQDIGKNIYTIREVVTEIRDKATRRRLAVLPYELRFKEPLP EYVRLVTEFSKKTGDYPSLSATDIQVLALTYQLEAEFVGVSHLKQEPQKPKPPQETEKGH SACEPENLEFSSFMFWRNPLPNIDHELQELLIDRGEDVPSEEEEEEENGFEDRKDDSDDD GGGWITPSNIKQIQQELEQCDVPEDVRVGCLTTDFAMQNVLLQMGLHVLAVNGMLIREAR SYILRCHGCFKTTSDMSRVFCSHCGNKTLKKVSVTVSDDGTLHMHFSRNPKVLNPRGLRY SLPTPKGGKYAINPHLTEDQRFPQLRLSQKARQKTNVFAPDYIAGVSPFVENDISSRSAT LQVRDSTLGAGRRRLNPNASRKKFVKKRELDCPMACEHRSAWLPAKEVQLHKNRKNNALE PIFKKGISKG >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_4|1293_bp atggctccagtggagcacgttgtggcggatgctggggctttcctgcggcatgcggctctg caggacatcgggaagaacatttacaccatccgggaggtggtcactgagattcgggacaag gccacacgcaggcggctcgctgtcctgccctacgagctgcggttcaaggagcccttaccg gaatacgtgcggctggtgactgagttttcaaagaaaacaggagactaccccagcctctct gccacggacatccaagtgcttgcactcacataccagttggaagcagagtttgttggggtg tctcacctaaaacaagaaccacagaagcctaaacccccacaagaaacagaaaaaggacac tcagcttgtgagcctgagaacctggaatttagttccttcatgttctggagaaaccctttg cccaacatcgatcatgaactgcaggagctgctgattgacagaggtgaggacgttccaagt gaggaggaggaggaggaagaaaacgggtttgaagacagaaaagatgacagcgatgacgac gggggtggctggataacccccagtaacatcaagcagatccagcaggagctggagcagtgt gacgtccccgaggacgtgcgggttggctgcctgaccacagacttcgccatgcagaatgtt ctgctgcagatggggctgcacgtgctggcggtgaacggcatgctgattcgtgaggcccgg agctacatcttgcgctgccatggctgtttcaagacaacgtctgacatgagccgagtgttc tgctcacactgtgggaacaagaccctgaagaaagtgtccgtgaccgtcagcgacgacggc accctgcacatgcacttctcccgcaaccccaaggtgctgaacccccgcggcctccggtac tcgcttcccactcccaaagggggcaaatacgccatcaacccccatctcaccgaggatcag cgcttccctcagctgcgactctcccaaaaggccaggcagaaaaccaacgtgttcgcccct gactacatcgccggggtgtcaccctttgtcgagaatgacatctccagccgctcagctacc ctgcaggtccgggacagcaccttgggagctgggcggagacgcttaaatcccaacgcttcc agaaagaagtttgtgaagaaaagggaactggactgtcccatggcctgtgagcaccggagc gcctggctgcctgccaaggaagtgcaattgcataaaaacagaaagaacaacgccctggag ccaatcttcaagaaaggaatttccaaaggataa >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_5|340_aa MPLCGKQLRVRFSCHSASLTVRNLPQYVSNEVLEEAFSVFGQVERAVVIVDDRGRPSGKG IVEFSGKPAAGKALDRRSEGSFLLTTFPRPVTMEPMDQLDDEGGLPEKLVIKNQQFHKER EQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKPEMEMEAARHKHQVML MRQNLMRRQEELRRMEELHNQEVQKRKQLELRQEEERRRPEEEMRRQQEEMMQPQQEGFK GTFPDVREQEIWMGQMAMGGAMGINSRGALTPAPVPAGTPAPPGPATMMPDGTLGLTPPT TERFGQAATMEGIGAIGETPPAFNHAAPGAEFAPNKRRRY >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_5|1023_bp atgccactctgtggaaagcagctgcgtgtgcgcttttcctgccatagtgcatcccttaca gttcgaaaccttcctcagtatgtgtccaacgaagtgctggaagaagccttttctgtgttt ggccaggtagagagggctgtagtcattgtggatgatcgaggaaggccctcaggaaaaggc attgttgagttctcagggaagccagctgctgggaaagctctggacagacgcagtgaaggc tccttcctgctaaccacatttcctcgtcctgtgactatggagcccatggaccagttagat gatgaagggggacttccagagaagctggttataaaaaaccagcaatttcacaaggaacga gagcaaccacccagatttgcacagcctggctcctttgagtatgaatatgccatgcgctgg aaggcactcattgagatggagaagcagcagcaggaccaagtggaccgcaacatcaaagag gctcgtgagaagccagagatggagatggaggctgcacgccataagcaccaggtcatgcta atgagacagaatttgatgaggcgccaagaagaacttcggaggatggaagagctgcacaac caagaggtgcaaaaacgaaagcaactggagctcaggcaggaggaagagcgcaggcgccct gaagaagagatgcggcggcagcaagaagaaatgatgcagccacagcaggaaggattcaag ggaaccttccctgatgtgagagagcaggagatttggatgggtcagatggctatgggaggt gctatgggcataaacagcagaggtgccttgacccctgctcctgtgccagctggtacccca gctcctccaggacctgccactatgatgccggatggaactttgggattgaccccaccaaca actgaacgctttggtcaggctgctacaatggaaggaattggggcaattggtgaaactcct cctgcattcaaccatgcagctcctggagctgaatttgctccaaacaaacgtcgccgatac taa >gi568815582r:69656538_69857937|GENSCAN_predicted_peptide_6|304_aa MGFRRVGQAGLELLSSDETERLKRAGVRGLLFHQLHGDDMASASSSRAGVALPFEKSQLT LKVVSAKPKVHNRQPRINSYVEVAVDGLPSETKKTGKRIGSSELLWNEIIILNVTAQSHL DLKVWSCHTLRNELLGTASVNLSNVLKNNGGKSTYDEGGADVILGKFPGMQSLGQKVGVF QVFDACCLCPWQSLDQFTHYYQDSWSIFLSVGGLFFVGEWSVKREVENMQLTLNLQTENK GSVVSGGELTIFLDGPTVDLGNVPNGSALTDGSQLPSRDSSGTAVAPENRHQPPSTNCFG GRSR >gi568815582r:69656538_69857937|GENSCAN_predicted_CDS_6|912_bp atgggatttcgccgtgttggccaggctggtctcgaactcctgagctcagatgaaacagaa aggctaaagagggctggagtcaggggacttctcttccaccagcttcacggtgatgatatg gcatctgccagctctagccgggcaggagtggccctgccttttgagaagtctcagctcact ttgaaagtggtgtccgcaaagcccaaggtgcataatcgtcaacctcgaattaactcctac gtggaggtggcggtggatggactccccagtgagaccaagaagactgggaagcgcattggg agctctgagcttctctggaatgagatcatcattttgaatgtcacggcacagagtcattta gatttaaaggtctggagctgccataccttgagaaatgaactgctaggcaccgcatctgtc aacctctccaacgtcttgaagaacaatgggggcaaaagtacgtatgatgaagggggtgcc gacgtgattcttggtaaattcccaggcatgcagtcgctgggtcagaaggtgggcgtgttt caagtttttgatgcctgctgcctgtgtccatggcagagtctggaccaatttacacactac taccaggattcttggagtatttttcttagtgtaggaggcttattctttgtgggagagtgg tccgtcaaaagggaagtggagaacatgcagctgaccctgaacctgcagacggagaacaaa ggcagcgttgtctcaggcggagagctgacaattttcctggacgggccaactgttgatctg ggaaatgtgcctaatggcagtgccctgacagatggatcacagctgccttcgagagactcc agtggaacagcagtagctccagagaaccggcaccagccccccagcacaaactgctttggt ggaagatcccgn