GENSCAN 1.0 Date run: 16-Aug-121 Time: 14:40:02 Sequence gi568815591f:86665146_86964352 : 299207 bp : 36.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 52617 52657 41 2 2 81 99 37 0.324 3.81 1.02 Term + 61687 61843 157 1 1 77 38 99 0.524 0.22 1.03 PlyA + 61940 61945 6 1.05 2.00 Prom + 63802 63841 40 -4.45 2.01 Init + 68411 68550 140 1 2 62 92 106 0.868 8.06 2.02 Intr + 68894 69036 143 2 2 111 63 39 0.900 2.78 2.03 Term + 70071 70138 68 1 2 92 36 68 0.797 -0.88 2.04 PlyA + 70496 70501 6 1.05 3.00 Prom + 70669 70708 40 -6.65 3.01 Init + 71040 71087 48 2 0 65 96 58 0.733 5.30 3.02 Intr + 73097 73215 119 1 2 95 40 66 0.022 0.84 3.03 Intr + 93169 93261 93 1 0 58 88 79 0.023 3.06 3.04 Intr + 99982 100468 487 1 1 37 110 288 0.045 17.99 3.05 Intr + 100809 100994 186 2 0 -3 119 101 0.545 3.06 3.06 Term + 121116 122021 906 2 0 119 38 1226 0.848 112.00 3.07 PlyA + 123427 123432 6 1.05 4.00 Prom + 124641 124680 40 -7.55 4.01 Sngl + 132025 132369 345 0 0 49 38 211 0.830 7.99 4.02 PlyA + 132488 132493 6 1.05 5.03 PlyA - 133869 133864 6 1.05 5.02 Term - 142632 142451 182 1 2 66 42 132 0.514 3.19 5.01 Init - 160540 160483 58 1 1 62 82 65 0.830 4.82 5.00 Prom - 160590 160551 40 -3.05 6.00 Prom + 163860 163899 40 -4.65 6.01 Init + 167162 167266 105 1 0 73 95 18 0.057 -0.73 6.02 Intr + 167877 167958 82 2 1 101 81 51 0.101 3.99 6.03 Intr + 173694 174760 1067 1 2 51 89 773 0.710 62.76 6.04 Intr + 185225 185399 175 1 1 82 84 87 0.642 6.29 6.05 Term + 188730 188833 104 1 2 38 42 94 0.176 -2.74 6.06 PlyA + 189387 189392 6 1.05 7.24 PlyA - 189990 189985 6 1.05 7.23 Term - 195901 195840 62 2 2 124 35 46 0.526 -0.11 7.22 Intr - 204160 204091 70 1 1 59 108 28 0.004 -0.26 7.21 Intr - 226744 226579 166 0 1 40 100 78 0.470 3.24 7.20 Intr - 227955 227777 179 0 2 65 106 84 0.548 5.70 7.19 Intr - 232487 232361 127 1 1 51 110 42 0.353 2.46 7.18 Intr - 238301 238275 27 1 0 130 92 1 0.099 1.01 7.17 Intr - 243398 243302 97 0 1 48 97 15 0.142 -3.45 7.16 Intr - 244856 244667 190 2 1 73 87 83 0.962 4.94 7.15 Intr - 248059 247796 264 1 0 71 111 240 0.979 21.39 7.14 Intr - 249715 249578 138 1 0 66 95 43 0.812 2.54 7.13 Intr - 253399 253297 103 0 1 41 103 78 0.807 3.86 7.12 Intr - 254165 254075 91 0 1 87 69 13 0.403 -2.57 7.11 Intr - 260511 260383 129 1 0 93 115 9 0.929 3.85 7.10 Intr - 261771 261591 181 0 1 106 97 107 0.985 11.92 7.09 Intr - 273118 272981 138 1 0 77 76 118 0.845 9.24 7.08 Intr - 273815 273739 77 0 2 85 89 11 0.443 -0.78 7.07 Intr - 274970 274865 106 2 1 123 63 55 0.586 5.37 7.06 Intr - 276959 276873 87 2 0 76 84 62 0.933 3.85 7.05 Intr - 279901 279754 148 0 1 106 41 188 0.999 15.22 7.04 Intr - 282777 282582 196 1 1 81 71 200 0.973 15.15 7.03 Intr - 284260 284196 65 0 2 68 77 44 0.220 -1.26 7.02 Intr - 284710 284591 120 0 0 51 84 66 0.158 1.19 7.01 Init - 293516 293344 173 2 2 65 37 132 0.221 4.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_1|65_aa MGIEERAFEILKNRTGQSPSFKSDNYLSGEVSVLAIRNKFEMDNFDIAARIEAGGEKTQV GIKYK >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_1|198_bp atgggcatcgaggaaagggcttttgaaatattaaagaacagaactggtcaatctccgtca ttcaagagtgataactatctgtcaggagaggtctcagtactggcaataaggaacaagttt gaaatggacaactttgacattgcagccaggatagaggcagggggagaaaagacccaagtg ggaatcaaatacaagtaa >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_2|116_aa MEKIRVYLSLQGVGRLIRTGKKVFDEAKMQQRNRECQVLTDLGGVGGEYPVLLRLCLFRK VAKESRGEKRHQTGDSKEYLIGYKEPYLVLRQWEAFSPNLLTADIPAVDLHTPQYK >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_2|351_bp atggagaagataagagtttatttgagtcttcaaggagtaggaagacttataaggacaggg aaaaaggtttttgatgaagcaaagatgcagcagcgaaatagggagtgtcaggttctaaca gatctggggggagttggtggggagtatccagttttactgagactgtgtttatttaggaaa gtagccaaagaaagtaggggggagaaaagacatcagactggggactccaaagaatacctg ataggctataaggagccatatttagttctgagacagtgggaagccttcagtcctaactta ttaacagcagatatacctgctgtagatctccacactcctcagtacaaataa >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_3|612_aa MVIVGEGVVKILMGRQKEQKMELDWKMSNSHMAQQVKSMDAFEQSQGEQNGIFTGLAREK AFEKSSFYTRFMLTNLKLLLDFAAWKRGTETGFMKMLTRLQVLTLALFSKGFLLSLGDHN FLRREIKIEGDLVLGGLFPINEKGTGTEECGRINEDRGIQRLEAMLFAIDEINKDDYLLP GVKLGVHILDTCSRDTYALEQSLEFVRASLTKVDEAEYMCPDGSYAIQENIPLLIAGVIG GSYSSVSIQVVSYPPPGIDHLAQECSLSDGRGARECKLSCTKLFQVSACFAPADIPLAKA THRAQSRVKELVANLLRLFQIPQISYASTSAKLSDKSRYDYFARTVPPDFYQAKAMAEIL RFFNWTYVSTVASEGDYGETGIEAFEQEARLRNICIATAEKVGRSNIRKSYDSVIRELLQ KPNARVVVLFMRSDDSRELIAAASRANASFTWVASDGWGAQESIIKGSEHVAYGAITLEL ASQPVRQFDRYFQSLNPYNNHRNPWFRDFWEQKFQCSLQNKRNHRRVCDKHLAIDSSNYE QESKIMFVVNAVYAMAHALHKMQRTLCPNTTKLCDAMKILDGKKLYKDYLLKINFTGKPR AFKHLLRCKQVK >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_3|1839_bp atggtgattgttggggaaggtgttgtcaagatcctcatgggaaggcagaaagaacagaaa atggagctagactggaagatgtcaaattcacacatggctcagcaagtaaaaagtatggat gcttttgaacaaagtcaaggagaacagaatggcatattcacaggattagctagagagaaa gcttttgagaaatccagtttctacactcggtttatgctgacaaacctgaaacttctatta gactttgcagcttggaagagaggtacagaaacaggattcatgaagatgttgacaagactg caagttcttaccttagctttgttttcaaagggatttttactctctttaggggaccataac tttctaaggagagagattaaaatagaaggtgaccttgttttagggggcctgtttcctatt aacgaaaaaggcactggaactgaagaatgtgggcgaatcaatgaagaccgagggattcaa cgcctggaagccatgttgtttgctattgatgaaatcaacaaagatgattacttgctacca ggagtgaagttgggtgttcacattttggatacatgttcaagggatacctatgcattggag caatcactggagtttgtcagggcatctttgacaaaagtggatgaagctgagtatatgtgt cctgatggatcctatgccattcaagaaaacatcccacttctcattgcaggggtcattggt ggctcttatagcagtgtttccatacaggttgtctcttatcctcctcctggaatcgatcat ctagcccaggaatgttctctgagtgatggcagaggtgcaagagaatgcaaactcagctgt acaaagctctttcaagtttctgcttgcttcgcacctgctgacatcccattagcaaaagca actcacagggctcaatccagagtgaaggagctggtggcaaacctgctgcggctcttccag atccctcagatcagctacgcatccaccagcgccaaactcagtgataagtcgcgctatgat tactttgccaggaccgtgccccccgacttctaccaggccaaagccatggctgagatcttg cgcttcttcaactggacctacgtgtccacagtagcctccgagggtgattacggggagaca gggatcgaggccttcgagcaggaagcccgcctgcgcaacatctgcatcgctacggcggag aaggtgggccgctccaacatccgcaagtcctacgacagcgtgatccgagaactgttgcag aagcccaacgcgcgcgtcgtggtcctcttcatgcgcagcgacgactcgcgggagctcatt gcagccgccagccgcgccaatgcctccttcacctgggtggccagcgacggctggggcgcg caggagagcatcatcaagggcagcgagcatgtggcctacggcgccatcaccctggagctg gcctcccagcctgtccgccagttcgaccgctacttccagagcctcaacccctacaacaac caccgcaacccctggttccgggacttctgggagcaaaagtttcagtgcagcctccagaac aaacgcaaccacaggcgcgtctgcgacaagcacctggccatcgacagcagcaactacgag caagagtccaagatcatgtttgtggtgaacgcggtgtatgccatggcccacgctttgcac aaaatgcagcgcaccctctgtcccaacactaccaagctttgtgatgctatgaagatcctg gatgggaagaagttgtacaaggattacttgctgaaaatcaacttcacgggtaagccaaga gcctttaaacatcttctcagatgcaaacaagtgaaataa >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_4|114_aa MWKRLWNWVTGRGWNSLEGSEEDRKMWESLELPRNLLNGFDQNADSNMDNEVQLELVSDG DEELVGNWSKGNSCYVLAKRLAAFCPCPRDLWSFELERDDLGYLAEEISKQFKM >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_4|345_bp atgtggaagcgactttggaactgggtaacaggcagaggttggaacagtttggagggctca gaagaagacaggaaaatgtgggaaagtttggaacttcctagaaacttgttgaatggtttt gaccaaaatgctgatagtaatatggacaatgaagtccagcttgaattggtctcagatgga gatgaggaacttgttgggaactggagcaaaggcaactcttgttatgttttagcaaagaga ctggcagcattttgtccctgcccaagagatttgtggagctttgaacttgagagagatgat ttagggtatctggcagaagaaatttctaagcagttcaaaatgtga >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_5|79_aa MNFPKTSTAAKHAVTEISMVLEVLARAIRREKEIKGTQLGKEEVKLSLFADDMIVYLENP IISAQNPLKLISNFSKASG >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_5|240_bp atgaactttccgaagacaagcacagcagctaagcatgcagtaactgaaatttccatggtg ttggaagttctggccagggcaatcaggcgagagaaagaaataaagggtactcaattagga aaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaacccc atcatctcagcccaaaatccccttaagctgataagcaacttcagcaaagcctcaggataa >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_6|510_aa MARSQLTATPASRVQAILLPQPPKELGLQVCHHIQFGFFHAVATPVKGEVMYFSISDDAY SHAPFNPNKDADSIVKFDTFGDGMGRYNVFNFQNVGGKYSYLKVGHWAETLSLDVNSIHW SRNSVPTSQCSDPCAPNEMKNMQPGDVCCWICIPCEPYEYLADEFTCMDCGSGQWPTADL TGCYDLPEDYIRWEDAWAIGPVTIACLGFMCTCMVVTVFIKHNNTPLVKASGRELCYILL FGVGLSYCMTFFFIAKPSPVICALRRLGLGSSFAICYSALLTKTNCIARIFDGVKNGAQR PKFISPSSQVFICLGLILVQIVMVSVWLILEAPGTRRYTLAEKRETVILKCNVKDSSMLI SLTYDVILVILCTVYAFKTRKCPENFNEAKFIGFTMYTTCIIWLAFLPIFYVTSSDYRVQ TTTMCISVSLSGFVVLGCLFAPKVHIILFQPQKNVVTHRLHLNRFSVSGTGTTYSQSETE LIWKIHDAFTLIPGILAGSLSMLAPNEQVP >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_6|1533_bp atggcgcgatctcagctcactgcaacccctgcctcccgggttcaagcgattcttctgcct cagcctcccaaggagctgggactacaggtgtgccaccacatccagtttggtttctttcat gctgttgcaacaccagtgaaaggtgaagtgatgtatttttccatttctgatgatgcttac tcccatgctccattcaacccaaataaagatgcagatagcatagtcaagtttgacactttt ggagatggaatggggcgatacaacgtgttcaatttccaaaatgtaggtggaaagtattcc tacttgaaagttggtcactgggcagaaaccttatcgctagatgtcaactctatccactgg tcccggaactcagtccccacttcccagtgcagcgacccctgtgcccccaatgaaatgaag aatatgcaaccaggggatgtctgctgctggatttgcatcccctgtgaaccctacgaatac ctggctgatgagtttacctgtatggattgtgggtctggacagtggcccactgcagaccta actggatgctatgaccttcctgaggactacatcaggtgggaagacgcctgggccattggc ccagtcaccattgcctgtctgggttttatgtgtacatgcatggttgtaactgtttttatc aagcacaacaacacacccttggtcaaagcatcgggccgagaactctgctacatcttattg tttggggttggcctgtcatactgcatgacattcttcttcattgccaagccatcaccagtc atctgtgcattgcgccgactcgggctggggagttccttcgctatctgttactcagccctg ctgaccaagacaaactgcattgcccgcatcttcgatggggtcaagaatggcgctcagagg ccaaaattcatcagccccagttctcaggttttcatctgcctgggtctgatcctggtgcaa attgtgatggtgtctgtgtggctcatcctggaggccccaggcaccaggaggtataccctt gcagagaagcgggaaacagtcatcctaaaatgcaatgtcaaagattccagcatgttgatc tctcttacctacgatgtgatcctggtgatcttatgcactgtgtacgccttcaaaacgcgg aagtgcccagaaaatttcaacgaagctaagttcataggttttaccatgtacaccacgtgc atcatctggttggccttcctccctatattttatgtgacatcaagtgactacagagtgcag acgacaaccatgtgcatctctgtcagcctgagtggctttgtggtcttgggctgtttgttt gcacccaaggttcacatcatcctgtttcaaccccagaagaatgttgtcacacacagactg cacctcaacaggttcagtgtcagtggaactgggaccacatactctcagtcagagactgag ctgatctggaagatccatgatgctttcactcttattcctggcatcttggcaggaagcctg tccatgttggctcctaatgagcaagttccataa >gi568815591f:86665146_86964352|GENSCAN_predicted_peptide_7|977_aa MKVPSGAASGFPGCSVRSEWLALREDVGRARDSGNSIGRLLSVLPRTLDFDLETAIHRLF GQGVPAGQCQAAFSPCFSLSPVLVGGQSLEEAEVAGSCSRLHGSGHSRLAAAAISIALKA FSCASGEYLEMKNQVCSKCGEGTYSLGSGIKFDEWDELPAGFSNIATFMDTVVGPSDSRP DGCNNSSWIPRGNYIESNRDDCTVSLIYAVHLKKSGYVFFEYQYVDNNIFFEFFIQNDQC QEMDTTTDKWVKLTDNGEWGSHSVMLKSGTNILYWRTTGILMGSKAVKPVLVKNITIEGV AYTSECFPCKPGTFSNKPGSFNCQSCKIIDPFLKRFFYVAEEGSSECTERPPCTTKDYFQ IHTPCDEEGKTQIMYKWIEPKICREDLTDAIRLPPSGEKKDCPPCNPGFYNNGSSSCHPC PPGTFSDGTKECRPCPAGTEPALGFEYKWWNVLPGNMKTSCFNVGNSKCDGMNGWEVAGD HIQSGAGGSDNDYLILNLHIPGFKPPTSMTGATGSELGRITFVFETLCSADCVLYFMVDI NRKSTNVVESWGGTKEKQAYTHIIFKNATFTFTWAFQRTNQGQDNRRFINDMVKIYSITA TNAVDGVASSCRACALGSEQSGSSCVPCPPGHYIEKETNQCKECPPDTYLSIHQVYGKEA CIPCGPGSKNNQGKKMALCTNNITDFTVKEIVAGSDDYTNLVGAFVCQSTIIPSESKGFR AALSSQSIILADTFIGVTVETTLKNINIKEDMFPVPTSQIPDVHFFYKTIYRQIIDIKCP AGTCDGCTFYFLWESAEACPLCTEHDFHEIEGACKRGFQETLYVWNEPKWCIKGISLPEK KLATCETVDFWLKVGAGVGAFTAVLLVALTCYFWKKNQKLEYKYSKLVMTTNSKECELPA ADSCAIMEGEDNEEEVVYSNKQSLLGKLKSLATKTAIGVFGGPWSLGYVISCQYSETRSS CIISVHAKNEILAIEEE >gi568815591f:86665146_86964352|GENSCAN_predicted_CDS_7|2934_bp atgaaggtgccaagtggagcagcatctggttttcctgggtgctctgtgagatctgaatgg ctggcactcagggaggatgtgggaagggccagagattctggcaactcaatcggccggctg ctctctgtcctaccaaggactctggactttgacctggagacagccattcacaggctgttc ggtcaaggagtgcctgcaggccagtgccaagctgccttcagcccctgcttcagtctctct cctgtgcttgttggtggccaaagtctggaggaggctgaggtggcaggaagctgtagccgg cttcatggcagtggtcactccaggctggctgctgctgccatcagtatagcccttaaagct ttctcctgtgcttctggagagtatctagaaatgaagaaccaggtatgcagtaagtgtggt gaaggcacctattccttgggcagtggcatcaaatttgatgaatgggatgaattgccggca ggattttctaacatcgcaacattcatggacactgtggtgggcccttctgacagcaggcca gacggctgtaacaactcttcttggatccctcgtggaaactacatagaatctaatcgtgat gactgcacggtgtctttgatctatgctgtgcaccttaagaagtcaggctatgtcttcttt gagtaccagtatgtcgacaacaacatcttctttgagttctttattcaaaatgatcagtgc caggagatggacaccaccactgacaagtgggtaaaacttacagacaatggagaatggggc tctcattctgtaatgctgaaatcaggcacaaacatactctactggagaactacaggcatc cttatgggttctaaggcggtcaagcctgtgctggtaaaaaatatcacaattgaaggggtg gcgtacacatcagaatgttttccttgcaagccaggcacattcagcaacaaaccaggttca ttcaactgccagagttgcaaaataattgacccatttctgaaacgctttttttatgttgca gaggaaggatccagtgagtgtacagagcgccctccctgtaccacaaaagactatttccag atccatactccatgtgatgaagaaggaaagacacagataatgtacaagtggatagagccc aaaatctgccgggaggatctcacagatgctattagattgcccccttctggagagaagaag gattgtccgccttgcaaccctggattttataacaatggatcatcttcttgccatccctgt cctcctggaacattttcagatggaaccaaagaatgtagaccatgtccagcaggaacggag cctgcacttggctttgaatataaatggtggaatgtccttcctggcaacatgaaaacttcc tgcttcaatgttgggaattcaaagtgcgatggaatgaatggttgggaggtggctggagat catatccagagtggggctggaggttctgacaatgattacctgatcttaaacttgcatatc ccaggatttaaaccaccaacatctatgactggagccacgggttctgaactaggaagaata acatttgtctttgagaccctctgttcagctgactgtgttttgtacttcatggtggatatt aatagaaaaagtacaaatgtggtagaatcgtggggtggaaccaaagaaaaacaagcttac acccatatcatcttcaagaatgcaacttttacatttacatgggcattccagagaactaat cagggtcaagataatagacggttcatcaatgacatggtgaagatttattctatcacagcc actaatgcagttgatggggtggcgtcctcatgccgtgcctgtgccctcggttctgaacag tcgggttcatcgtgtgtcccctgccctccaggccactacattgagaaagaaaccaaccag tgcaaggaatgtccacctgacacctacctgtccatacatcaggtctatggcaaagaggct tgtattccatgcgggcctgggagtaaaaacaatcaggggaagaagatggctctctgtacc aacaatataacagactttacagtaaaagaaatagtggcagggtcagatgattacacaaat ttggtaggggcatttgtatgccagtcaacaattattccttctgaaagtaagggtttccga gcagccttatcatcacaatccatcattctggcagatacattcataggagtcacagttgaa accacattgaaaaatattaatataaaagaagatatgttcccagttccaacaagccaaata ccagatgtgcatttcttttataaaacaatttatagacagataatagatatcaagtgccca gcaggtacctgtgatgggtgtacgttctatttcctgtgggagagtgctgaagcttgccct ctgtgtacggagcatgacttccatgagattgagggagcctgcaagagaggatttcaggaa accttgtatgtgtggaatgaacctaaatggtgcattaaaggaatttctttgcctgagaaa aagttggcaacctgtgaaacggttgacttttggctgaaggtgggagccggtgtgggagct tttactgccgttttgctggtggctctgacctgctacttctggaaaaagaatcaaaaactg gaatacaaatattccaagttagtaatgacgactaactcaaaagagtgtgaactcccggct gcagacagttgtgctatcatggaaggagaagataatgaagaggaagttgtatattccaat aaacagtcactactaggaaaactcaaatctttggcaaccaagactgctatcggtgttttt ggtggcccctggagcttaggatatgtcatatcctgtcagtactctgagacaagatcaagt tgtattatttctgtgcatgctaaaaatgaaattttagcgattgaagaggaataa