GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:38:19 Sequence gi568815582r:58014810_58229115 : 214306 bp : 47.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2525 2630 106 2 1 117 95 81 0.985 11.92 1.02 Intr + 4163 4246 84 1 0 143 68 -2 0.906 3.32 1.03 Term + 5332 5436 105 0 0 82 53 185 0.956 12.81 1.04 PlyA + 6783 6788 6 1.05 2.05 PlyA - 6804 6799 6 1.05 2.04 Term - 8169 7927 243 0 0 107 55 60 0.282 0.50 2.03 Intr - 8516 8419 98 0 2 96 86 37 0.739 4.03 2.02 Intr - 10952 10838 115 2 1 36 80 47 0.310 -1.28 2.01 Init - 11697 11356 342 0 0 82 57 300 0.681 23.54 2.00 Prom - 12889 12850 40 -9.16 3.00 Prom + 13245 13284 40 -11.53 3.01 Init + 15589 15796 208 0 1 97 25 178 0.819 11.60 3.02 Intr + 20370 20505 136 1 1 26 78 45 0.167 -3.07 3.03 Intr + 21928 21961 34 1 1 102 61 36 0.477 0.53 3.04 Intr + 22663 22811 149 0 2 91 64 221 0.988 19.03 3.05 Intr + 23457 23585 129 0 0 103 63 222 0.998 21.01 3.06 Intr + 25066 25373 308 1 2 77 77 514 0.988 45.29 3.07 Intr + 25728 25889 162 1 0 103 86 309 0.993 32.15 3.08 Intr + 26808 27061 254 1 2 128 47 203 0.545 17.35 3.09 Intr + 27422 27560 139 1 1 96 105 178 0.994 20.34 3.10 Intr + 28401 28551 151 1 1 123 99 329 0.999 36.72 3.11 Intr + 28703 28818 116 2 2 81 82 146 0.854 13.39 3.12 Intr + 30198 30584 387 1 0 101 57 785 0.269 71.46 3.13 Intr + 38185 38316 132 2 0 37 97 65 0.090 2.92 3.14 Intr + 42590 42724 135 0 0 109 23 49 0.019 1.04 3.15 Intr + 56843 56960 118 0 1 120 91 6 0.019 3.62 3.16 Intr + 78976 79152 177 1 0 139 65 4 0.588 2.23 3.17 Intr + 79217 79304 88 2 1 120 16 65 0.367 2.47 3.18 Intr + 81939 81998 60 2 0 58 94 44 0.161 0.93 3.19 Intr + 83728 83887 160 0 1 76 36 83 0.234 1.46 3.20 Term + 84052 84230 179 2 2 103 41 86 0.281 3.25 3.21 PlyA + 86093 86098 6 1.05 4.08 PlyA - 86657 86652 6 1.05 4.07 Term - 99221 99216 6 2 0 122 36 0 0.461 -3.93 4.06 Intr - 100111 100001 111 1 0 28 121 85 0.934 6.38 4.05 Intr - 100648 100460 189 1 0 114 84 310 0.999 32.98 4.04 Intr - 101343 101232 112 2 1 78 111 88 0.998 10.48 4.03 Intr - 102142 102063 80 1 2 103 100 96 0.999 10.65 4.02 Intr - 105268 105215 54 1 0 115 64 17 0.613 1.18 4.01 Init - 114306 114223 84 0 0 68 58 113 0.864 7.12 4.00 Prom - 129558 129519 40 -6.26 5.14 PlyA - 140345 140340 6 1.05 5.13 Term - 143544 143364 181 2 1 147 42 143 0.937 12.68 5.12 Intr - 146256 146085 172 1 1 93 9 57 0.254 -2.70 5.11 Intr - 150899 150751 149 1 2 91 65 188 0.974 16.68 5.10 Intr - 151875 151775 101 0 2 73 98 163 0.999 14.71 5.09 Intr - 152499 152398 102 0 0 88 99 79 0.997 9.37 5.08 Intr - 152986 152876 111 1 0 75 97 94 0.989 9.58 5.07 Intr - 153884 153801 84 2 0 91 76 17 0.634 0.82 5.06 Intr - 159701 159642 60 2 0 64 106 86 0.838 6.93 5.05 Intr - 172047 171946 102 0 0 41 66 127 0.954 6.17 5.04 Intr - 182035 181924 112 0 1 93 113 127 0.987 16.08 5.03 Intr - 183029 182824 206 2 2 98 55 116 0.698 7.30 5.02 Intr - 186896 186853 44 0 2 102 85 1 0.153 -0.84 5.01 Init - 195533 195437 97 2 1 61 98 62 0.213 4.97 5.00 Prom - 214154 214115 40 -2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:58014810_58229115|GENSCAN_predicted_peptide_1|98_aa XTFIGLEVTSGHAQFLDLVSEVDRVMEEFNLTTFYQDPSFHLSLAWCVGDARLQLEGQCL QELQAIVDGFEDAEVLLRVHTEQVRCKSGNKFFSMPLK >gi568815582r:58014810_58229115|GENSCAN_predicted_CDS_1|297_bp nngacctttattgggcttgaggtcacttcagggcatgcccagttcctggacctggtttca gaggtggacagagtcatggaggaattcaacctcaccactttctaccaggatccttctttc cacctcagcctggcctggtgtgtgggtgatgcacgtctccagctggaggggcagtgcctg caggaactacaggcaatcgtggatgggtttgaagatgctgaggtgctgctgcgcgtgcac actgagcaagtccgctgcaagtctgggaacaagttcttctcgatgcctttgaagtga >gi568815582r:58014810_58229115|GENSCAN_predicted_peptide_2|265_aa MDLRVFGRYAKAQAAQKHQEQRQQSRPRRLLPVAEEAARPARPSGRARVAAHALGAQPSS EHCTPDPRTRLGNLRRSLAPGPAGAPRTSGSRSPPEPPRSPRQIPKPEEKRAGQPSSPRA AAAALLKGQCAPNARTRAGTGGGAPAAGFSQAGAPRAMHPPDPPSKVKQALGLFVPRQGI NKQAKNCLPCFSSYGESPVMEEPRLLLHCQRERSLNGLTLLQRHTQCLGARQAPAASLLL RPSLPTPPPQRDSCFLPPLPPSILH >gi568815582r:58014810_58229115|GENSCAN_predicted_CDS_2|798_bp atggacctccgcgtcttcggccgctacgccaaggcccaggcagcccagaagcaccaggag cagcggcagcagtcgcggccgcgccgcctcctcccggtcgccgaggaggctgcccgtcca gcccggccgtccgggcgcgctcgggtcgctgcccatgctctcggcgcccagcccagctcg gaacactgcacgccggatcctcgcacgcgccttggaaacttgcgacgctccctggctcct ggcccggcaggcgcgccgcggacctcgggctcccggtccccgccggagccgcctaggtcg cctcggcaaattcccaagcccgaggagaagcgggcagggcagccctcctccccgcgcgcc gcggccgccgcgctcttaaagggccagtgtgctcccaatgcgcggacccgggcggggacc gggggcggtgccccagccgcaggcttctcacaagcaggggcccctcgggcgatgcaccca cccgacccaccatcaaaggtgaagcaggccttgggactcttcgtgccaagacaaggaata aacaaacaagccaagaattgtctcccatgtttctcctcttatggagaatctcctgttatg gaagagccccggctgcttctgcactgtcagcgggagcgctcgctcaacgggcttactctg ctccagaggcacactcagtgtctgggtgcccgccaagctcctgccgccagcctcctcctc aggcccagcctgcccacccctccccctcagcgagactcctgcttccttcctccattgcca ccttccattctccattga >gi568815582r:58014810_58229115|GENSCAN_predicted_peptide_3|1073_aa MELGLDPRGKEGCLFPNPMALVGMEEKVGRDPTQPPVSGQVDKPLVPAISQPLEMPPILL PSADGDAGQGWWICCSTPSSTGALGEVSRAWEEQGGYPRHCQYLIELWECALGQRRFKVF GEQCQRNWLRLYGYLPQPSRHMSTMRSAQILASALAEMQRFYGIPVTGVLDEETKEWMKR PRCGVPDQFGVRVKANLRRRRKRYALTGRKWNNHHLTFSIQNYTEKLGWYHSMEAVRRAF RVWEQATPLVFQEVPYEDIRLRRQKEADIMVLFASGFHGDSSPFDGTGGFLAHAYFPGPG LGGDTHFDADEPWTFSSTDLHGNNLFLVAVHELGHALGLEHSSNPNAIMAPFYQWKDVDN FKLPEDDLRGIQQLYGTPDGQPQPTQPLPTVTPRRPGRPDHRPPRPPQPPPPGGKPERPP KPGPPVQPRATERPDQYGPNICDGDFDTVAMLRGEMFVFKGRWFWRVRHNRVLDNYPMPI GHFWRGLPGDISAAYERQDGRFVFFKGDRYWLFREANLEPGYPQPLTSYGLGIPYDRIDT AIWWEPTGHTFFFQEDRYWRFNEETQRGDPGYPKPISVWQGIPASPKGAFLSNDAAYTYF YKGTKYWKFDNERLRMEPGYPKSILRDFMGCQEHVEPGPRWPDVARPPFNPHGGAEPGAD SAEGDVGDGDGDFGAGVNKDGGSRVVVQMEEVARTVNVVMVLVPLLLLLCVLGLTYALVQ MQRKGPQGAHLFIRQKYTENQGSLWGLWHNPTVALRKSPWGSSRQRVEDNRRGAQNVIRR LAACTHAYASSQFKVVQGCRSRLRTLKEHQRRGSHWPVAAASSSSPDTLWILVLLFLPLE SPNTPHSQLTQLRLSSTRSLIPILRASPQVVPPAPQTTPHPPCSSPIAPSPIGVSSSPLH CSSQNPGFILSWSPLAAFFLPLLLAPIFPKNHLLRADMGPWHLICQKGQYHLSSRCICEE QTLSHFCSRAPAALLFFISVFLFKFQREESNSRGLALASPGDHHAIVFSQQPRAGASVFP PGEDFFISPLLAIKPFYAAETQLGAQEGGQPQGPHEDPKTMTTTVTMTTLTVD >gi568815582r:58014810_58229115|GENSCAN_predicted_CDS_3|3222_bp atggagctgggcctggatcccagaggcaaagagggatgcctcttccctaatcccatggcc ctcgtgggaatggaagaaaaagtgggaagagacccaactcagcctccagtatcaggacag gtggataagcccctggtaccagccattagccagcccctggagatgccccccatcctgctg ccatcggccgatggtgatgcaggccaaggatggtggatctgctgctccacaccttccagc acaggagccttgggagaggtttcccgtgcttgggaagaacagggagggtacccacgacat tgccagtatcttattgagctgtgggaatgcgccctgggccagaggagatttaaggtcttt ggggagcagtgccaaagaaactggctgcggctttatggctacctgcctcagcccagccgc catatgtccaccatgcgttccgcccagatcttggcctcggcccttgcagagatgcagcgc ttctacgggatcccagtcaccggtgtgctcgacgaagagaccaaggagtggatgaagcgg ccccgctgtggggtgccagaccagttcggggtacgagtgaaagccaacctgcggcggcgt cggaagcgctacgccctcaccgggaggaagtggaacaaccaccatctgacctttagcatc cagaactacacggagaagttgggctggtaccactcgatggaggcggtgcgcagggccttc cgcgtgtgggagcaggccacgcccctggtcttccaggaggtgccctatgaggacatccgg ctgcggcgacagaaggaggccgacatcatggtactctttgcctctggcttccacggcgac agctcgccgtttgatggcaccggtggctttctggcccacgcctatttccctggccccggc ctaggcggggacacccattttgacgcagatgagccctggaccttctccagcactgacctg catggaaacaacctcttcctggtggcagtgcatgagctgggccacgcgctggggctggag cactccagcaaccccaatgccatcatggcgccgttctaccagtggaaggacgttgacaac ttcaagctgcccgaggacgatctccgtggcatccagcagctctacggtaccccagacggt cagccacagcctacccagcctctccccactgtgacgccacggcggccaggccggcctgac caccggccgccccggcctccccagccaccacccccaggtgggaagccagagcggccccca aagccgggccccccagtccagccccgagccacagagcggcccgaccagtatggccccaac atctgcgacggggactttgacacagtggccatgcttcgcggggagatgttcgtgttcaag ggccgctggttctggcgagtccggcacaaccgcgtcctggacaactatcccatgcccatc gggcacttctggcgtggtctgcccggtgacatcagtgctgcctacgagcgccaagacggt cgttttgtctttttcaaaggtgaccgctactggctctttcgagaagcgaacctggagccc ggctacccacagccgctgaccagctatggcctgggcatcccctatgaccgcattgacacg gccatctggtgggagcccacaggccacaccttcttcttccaagaggacaggtactggcgc ttcaacgaggagacacagcgtggagaccctgggtaccccaagcccatcagtgtctggcag gggatccctgcctcccctaaaggggccttcctgagcaatgacgcagcctacacctacttc tacaagggcaccaaatactggaaattcgacaatgagcgcctgcggatggagcccggctac cccaagtccatcctgcgggacttcatgggctgccaggagcacgtggagccaggcccccga tggcccgacgtggcccggccgcccttcaacccccacgggggtgcagagcccggggcggac agcgcagagggcgacgtgggggatggggatggggactttggggccggggtcaacaaggac gggggcagccgcgtggtggtgcagatggaggaggtggcacggacggtgaacgtggtgatg gtgctggtgccactgctgctgctgctctgcgtcctgggcctcacctacgcgctggtgcag atgcagcgcaagggaccacaaggcgcccatctcttcatccgtcaaaagtatactgagaac cagggttccctctggggtctctggcacaaccccacagttgctcttcgcaagagcccctgg ggatcaagtagacaaagggtggaagataacaggaggggggcacagaatgtgattaggagg cttgctgcatgcacgcatgcctatgccagcagtcagttcaaggtggtgcagggctgcagg agcaggcttcggacattaaaggagcatcagagaagaggcagccattggccagtggcagca gcctcctcatcctctccagacacactctggatcttggttctgctcttcttgcctctggag agccccaacactccccattcccagttgactcagctcaggctctccagcactcgtagcctt atacccatcctgcgggcatctccccaagttgttccaccagctcctcagactaccccccac ccaccctgctcctccccaattgctcccagccccattggtgtctcctccagccccctccat tgctcaagtcagaacccagggttcatcctgagctggtccccgctggctgctttcttcctg cccctgctcctggctcccatcttccccaaaaatcacctcctaagggccgacatgggacct tggcacctcatctgtcaaaaaggacagtaccacctgtccagcaggtgcatctgtgaggaa cagactctttctcacttctgctccagagcccccgcggctctcctcttcttcatctctgtt ttcctcttcaaattccagagagaggaatccaacagccgtggcctggcactggccagccca ggagatcatcatgcaatcgtcttcagccagcagcccagggcaggagcctcagtcttccca cctggtgaagatttcttcatttctcctctcctcgccataaaacccttctatgcagctgag acacagttgggggcccaagaaggaggccaaccacaaggaccacatgaggaccctaagaca atgacaactacagttaccatgacaacactcactgtggactga >gi568815582r:58014810_58229115|GENSCAN_predicted_peptide_4|211_aa MFKNTFQSGFLSILYSIGSKPLQIWDKKKIQEEYRHLGWEHEMASEVRNGHIKRITDNDI QSLVLEIEGTNVSTTYITCPADPKKTLGIKLPFLVMIIKNLKKYFTFEVQVLDDKNVRRR FRASNYQSTTRVKPFICTMPMRLDDGWNQIQFNLLDFTRRAYGTNYIETLRVQIHANCRI RRVYFSDRLYSEDELPAEFKLYLPVQNKAKQ >gi568815582r:58014810_58229115|GENSCAN_predicted_CDS_4|636_bp atgttcaaaaacacgttccagagcggcttcctctccatcctctacagcatcggcagcaag cctctgcaaatctgggacaaaaagaaaatacaggaggaatatcgccacttaggttgggaa catgagatggcgagtgaggtacggaatggccacatcaaaagaatcactgataatgacatc cagtccctggtgctagagattgaagggacaaatgtaagcaccacatatatcacatgccct gcagaccccaagaagacgctgggaattaaacttcctttccttgtcatgattatcaaaaac ctgaagaagtattttaccttcgaagtgcaggtactagatgacaagaatgtgcgtcgtcgc tttcgggcaagtaactaccagagcaccacccgggtcaaacccttcatctgcaccatgccc atgcggctggatgacggctggaaccagattcagttcaacttgctagacttcacacggcga gcatacggcaccaattacatcgagaccctcagagtgcagatccatgcaaattgtcgcatc cgacgggtttacttctcagacagactctactcagaagatgagctgccggcagagttcaaa ctgtatctcccagttcagaacaaggcaaagcaataa >gi568815582r:58014810_58229115|GENSCAN_predicted_peptide_5|506_aa MVDSRAGTGKVEDEPGANFVPKTRKRSEDDRTESPPYPSSYPSETPSRPAARCLLRPTPR VPRRAAAATLCAPRRPPVPPAMPGPAAGSRARVYAEVNSLRSREYWDYEAHVPSWGNQDD YQLVRKLGRGKYSEVFEAINITNNERVVVKILKPVKKKKIKREVKILENLRGGTNIIKLI DTVKDPVQLYQILTDFDIRFYMYELLKALDYCHSKGIMHRDVKPHNVMIDHQQKKLRLID WGLAEFYHPAQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSLGCMLASMIFRREPFFHG QDNYDQLVRIAKVLGTEELYGYLKKYHIDLDPHFNDILGQHSRKRWENFIHSENRHLVSP EALDLLDKLLRYDHQQRLTAKEAMEHPYFCRVGGWVGEAIECMTQTKLSWAFSPTREALM SFIEALKHQSLSQASSQTFCNMSALTRSVAVLPLFHKQNKNQIKRLNAYREITFREQTQN GGRFGEHELDQAKGSPPPYIKPHFRM >gi568815582r:58014810_58229115|GENSCAN_predicted_CDS_5|1521_bp atggttgattccagggctggaacaggaaaagtagaagatgaaccaggagcaaattttgtg ccaaaaacaagaaagcgatcagaggacgataggacagagtcaccaccctatccctcttca tatccaagtgagactccttcgcgcccggcggcccgctgcctcctccgcccgacgccccgc gtcccccgccgcgccgccgccgccaccctctgcgccccgcgccgccccccggtcccgccc gccatgcccggcccggccgcgggcagcagggcccgggtctacgccgaggtgaacagtctg aggagccgcgagtactgggactacgaggctcacgtcccgagctggggtaatcaagatgat taccaactggttcgaaaacttggtcggggaaaatatagtgaagtatttgaggccattaat atcaccaacaatgagagagtggttgtaaaaatcctgaagccagtgaagaaaaagaagata aaacgagaggttaagattctggagaaccttcgtggtggaacaaatatcattaagctgatt gacactgtaaaggaccccgtgcaactctaccagatcctgacagactttgatatccggttt tatatgtatgaactacttaaagctctggattactgccacagcaagggaatcatgcacagg gatgtgaaacctcacaatgtcatgatagatcaccaacagaaaaagctgcgactgatagat tggggtctggcagaattctatcatcctgctcaggagtacaatgttcgtgtagcctcaagg tacttcaagggaccagagctcctcgtggactatcagatgtatgattatagcttggacatg tggagtttgggctgtatgttagcaagcatgatctttcgaagggaaccattcttccatgga caggacaactatgaccagcttgttcgcattgccaaggttctgggtacagaagaactgtat gggtatctgaagaagtatcacatagacctagatccacacttcaacgatatcctgggacaa cattcacggaaacgctgggaaaactttatccatagtgagaacagacaccttgtcagccct gaggccctagatcttctggacaaacttctgcgatacgaccatcaacagagactgactgcc aaagaggccatggagcacccatacttctgcagggtaggtggctgggtaggagaagctatt gagtgtatgacgcagaccaagctttcctgggccttctcaccaactagagaagcgctgatg tcgttcattgaggcacttaaacaccagtcactcagccaggcctcctcccagacattctgt aacatgtcagcactcacaaggtctgttgcggttctcccacttttccataagcagaacaag aaccaaatcaaacgtcttaacgcgtatagagagatcacgttccgtgagcagacacaaaac ggtggcaggtttggcgagcacgaactagaccaagcgaagggcagcccaccaccgtatatc aaacctcacttccgaatgtaa