GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:42:32 Sequence gi568815578f:34293812_34607791 : 313980 bp : 44.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 345 270 76 0 1 96 114 76 0.972 10.09 1.08 Intr - 1774 1584 191 2 2 95 78 388 0.999 37.80 1.07 Intr - 9520 9432 89 0 2 16 109 79 0.570 2.41 1.06 Intr - 9845 9707 139 0 1 43 117 102 0.388 8.12 1.05 Intr - 17593 17470 124 1 1 49 36 87 0.287 -0.24 1.04 Intr - 22385 22275 111 2 0 74 59 83 0.561 4.58 1.03 Intr - 38953 38794 160 0 1 93 29 126 0.346 7.19 1.02 Intr - 40083 40053 31 1 1 106 77 30 0.756 0.89 1.01 Init - 42065 41993 73 2 1 71 109 18 0.707 2.08 1.00 Prom - 54129 54090 40 -2.86 2.00 Prom + 79059 79098 40 -3.16 2.01 Init + 100046 100070 25 1 1 79 119 -2 0.117 1.76 2.02 Intr + 119931 120068 138 1 0 85 106 69 0.925 8.84 2.03 Intr + 130669 130714 46 2 1 40 121 70 0.093 2.97 2.04 Intr + 144663 144820 158 0 2 111 72 65 0.369 6.85 2.05 Intr + 146344 146533 190 2 1 47 92 19 0.247 -3.16 2.06 Intr + 151476 151650 175 0 1 73 72 56 0.278 2.44 2.07 Intr + 155600 155669 70 1 1 101 98 32 0.896 4.25 2.08 Intr + 163579 163663 85 2 1 67 90 47 0.898 1.68 2.09 Intr + 168282 168410 129 0 0 79 95 96 0.961 9.21 2.10 Intr + 176237 176309 73 2 1 87 92 95 0.992 9.21 2.11 Term + 178740 178835 96 2 0 48 53 120 0.959 2.47 2.12 PlyA + 179294 179299 6 1.05 3.02 PlyA - 179523 179518 6 1.05 3.01 Sngl - 182663 182199 465 2 0 84 48 397 0.759 29.45 3.00 Prom - 184191 184152 40 -9.75 4.00 Prom + 185679 185718 40 -5.66 4.01 Init + 185859 185978 120 2 0 22 115 61 0.787 2.39 4.02 Intr + 186788 186921 134 1 2 87 97 60 0.994 6.24 4.03 Intr + 187255 187395 141 1 0 52 94 106 0.876 7.07 4.04 Intr + 195455 195575 121 2 1 42 80 54 0.200 0.50 4.05 Intr + 196011 196115 105 2 0 63 72 44 0.564 0.71 4.06 Intr + 198690 198786 97 2 1 32 71 112 0.839 3.48 4.07 Intr + 210520 210592 73 2 1 73 119 34 0.901 3.46 4.08 Intr + 213884 213979 96 2 0 58 -6 139 0.012 0.62 4.09 Intr + 218338 218389 52 1 1 45 76 30 0.008 -3.59 4.10 Intr + 222573 222650 78 2 0 57 89 92 0.025 5.95 4.11 Intr + 232457 232532 76 1 1 103 107 113 0.995 13.79 4.12 Intr + 240817 240984 168 2 0 117 110 268 0.995 31.82 4.13 Intr + 244932 245170 239 1 2 64 40 57 0.105 -4.17 4.14 Intr + 251116 251265 150 0 0 76 86 111 0.747 10.06 4.15 Intr + 264530 264607 78 1 0 91 85 77 0.828 7.45 4.16 Intr + 264884 265097 214 1 1 32 85 71 0.650 -0.51 4.17 Intr + 265397 265452 56 0 2 86 117 95 0.886 10.90 4.18 Intr + 265536 265642 107 2 2 112 90 255 0.622 27.11 4.19 Term + 265925 266087 163 2 1 67 52 360 0.623 27.71 4.20 PlyA + 266508 266513 6 1.05 5.09 PlyA - 266757 266752 6 1.05 5.08 Term - 267168 267055 114 0 0 126 47 194 0.999 17.67 5.07 Intr - 270329 270303 27 2 0 100 75 24 0.407 0.61 5.06 Intr - 272654 272544 111 2 0 9 54 121 0.108 1.38 5.05 Intr - 281435 281293 143 0 2 108 110 138 0.997 18.07 5.04 Intr - 287861 287737 125 1 2 113 76 180 0.990 19.53 5.03 Intr - 291769 291626 144 0 0 84 106 113 0.967 12.10 5.02 Intr - 294796 294642 155 1 2 96 92 46 0.535 4.67 5.01 Intr - 301720 301665 56 2 2 101 81 33 0.530 2.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 213884 213983 100 2 1 58 32 146 0.889 3.60 S.002 Init - 221882 221822 61 2 1 86 94 45 0.842 4.93 S.003 Init + 229773 229820 48 2 0 93 85 2 0.892 0.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:34293812_34607791|GENSCAN_predicted_peptide_1|332_aa MYPQVSSILEAFMLEILSFSFQWAAFPPHKTLKEWREHVSECGIWPATPSTDTGASSMRA RGHTRCIASRGTWRHPGKDACDPEAPERTPRAAIMKYHAFGGLQQQKFILSQFWKPEVQN QGIGRKLHWGLDLEKGHQQLTSRAAFSGVGGDPQGSTVSSFLYIQPDEEIEAQGSELAEF QFRWVLTLNQLAEPARGRPGWLSGLSRRSLPDRSCSQTEAQPPSPVSITSAASMSDKLPY KVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKPLKGARIAGCLHMTVETAVLIETL VTLGAEVQWSSCNIFSTQDHAAAAIAKAGIPX >gi568815578f:34293812_34607791|GENSCAN_predicted_CDS_1|996_bp atgtacccacaggtctcctctatcctggaggcattcatgcttgagatcctgagcttctca ttccaatgggcagcttttcctccgcataagaccctgaaggaatggagggagcacgtgagt gagtgcgggatctggccagccactccaagcactgacacaggagcaagctccatgcgggcc cgtggacacaccaggtgtattgcctcaagggggacatggcggcatccaggcaaggatgcc tgcgaccctgaagccccagagaggactcccagagctgccattatgaagtaccacgcattt ggtggcttacaacaacagaaatttattctctcacagttctggaagccagaagtccaaaat caaggcattggcaggaagctccactggggcctggacctggagaagggacatcaacagtta acgtccagggcagctttcagcggcgtgggcggggatccccagggaagcactgttagcagt ttcttgtacatccagcctgatgaggaaattgaggcacagggaagtgaactggcggaattt cagttccgctgggttttgacactgaatcaactggctgaaccagcgagaggacggcccggc tggctctcaggcctctcccggcgctcccttccggaccgttcctgttcccagactgaggcc cagcccccttcgcccgtttccatcacgagtgccgccagcatgtctgacaaactgccctac aaagtcgccgacatcggcctggctgcctggggacgcaaggccctggacattgctgagaac gagatgccgggcctgatgcgtatgcgggagcggtactcggcctccaagccactgaagggc gcccgcatcgctggctgcctgcacatgaccgtggagacggccgtcctcattgagaccctc gtcaccctgggtgctgaggtgcagtggtccagctgcaacatcttctccacccaggaccat gcggcggctgccattgccaaggctggcattccggnn >gi568815578f:34293812_34607791|GENSCAN_predicted_peptide_2|394_aa MKSQLQITVEEVVVTLQLGGDKEPTETIGDLSICLDGLQLESEVVTNGETTCSESASQND DGSRSKDETRVSTNGSDDPEDAGAGENRRVSGNNSPSLSNGGFKPSRPPRPSRPPPPTPR RPASVNGSPSATSESDGSSTGSLPPTNTNTNTSEGATSGLIIPLTISGGSGPRPLNPVTQ APLPPGWERRVDNMGRIYYVDHFTRTTTWQRPTLESVRNYEQWQLQRSQLQGAMQQFNQR FIYGNQDLFATSQSKEFDPLGPLPPGWEKRTDSNGRVYFVNHNTRITQWEDPRSQGQLNE KPLPEGWEMRFTVDGIPYFVDHNRRTTTYIDPRTGKSALDNGPQIAYVRDFKAKVQYFRF WCQPDVEVIIFKHQWYCSNYSFKKYGIECKGKRG >gi568815578f:34293812_34607791|GENSCAN_predicted_CDS_2|1185_bp atgaaatcacagcttcagatcactgttgaagaagtagttgtgactttgcagcttggaggt gacaaagagccaacagagacaataggagacttgtcaatttgtcttgatgggctacagtta gagtctgaagttgttaccaatggtgaaactacatgttcagaaagtgcttctcagaatgat gatggctccagatccaaggatgaaacaagagtgagcacaaatggatcagatgaccctgaa gatgcaggagctggtgaaaataggagagtcagtgggaataattctccatcactctcaaat ggtggttttaaaccttctagacctccaagaccttcacgaccaccaccacccaccccacgt agaccagcatctgtcaatggttcaccatctgccacttctgaaagtgatgggtctagtaca ggctctctgccgccgacaaatacaaatacaaatacatctgaaggagcaacatctggatta ataattcctcttactatatctggaggctcaggccctaggccattaaatcctgtaactcaa gctcccttgccacctggctgggaacggcgggttgacaacatgggacgtatttattatgtt gaccatttcacaagaacaacaacgtggcagaggccaacactggaatccgtccggaactat gaacaatggcagctacagcgtagtcagcttcaaggagcaatgcagcagtttaaccagaga ttcatttatgggaatcaagatttatttgctacatcacaaagtaaagaatttgatcctctt ggtccattgccacctggatgggagaagagaacagacagcaatggcagagtatatttcgtc aaccacaacacacgaattacacaatgggaagaccccagaagtcaaggtcaattaaatgaa aagcccttacctgaaggttgggaaatgagattcacagtggatggaattccatattttgtg gaccacaatagaagaactaccacctatatagatccccgcacaggaaaatctgccctagac aatggacctcagatagcctatgttcgggacttcaaagcaaaggttcagtatttccggttc tggtgtcagccagatgtggaagtgattattttcaagcaccagtggtattgtagcaactac agcttcaagaagtacggaattgaatgcaaagggaaaagaggttga >gi568815578f:34293812_34607791|GENSCAN_predicted_peptide_3|154_aa MAAAGGARLLRAASAVLGDPAGRWLHHAGSRAGASGLLRSRGPGRSAEASRPLSVSAGAR SSSEDKVTVHFINCDGETLTTKGKVGDSLLDVVVENNPDIDGFGACEGTLACLTCHLIFE DHIYEKLDAITDEENHMLDLAYGLTDHSWAAKSV >gi568815578f:34293812_34607791|GENSCAN_predicted_CDS_3|465_bp atggctgccgctgggggcgcccggctgctgcgcgccgcttctgcggtcctcggcgacccg gccggccggtggctgcaccacgccgggtcccgcgctggagccagcggcctgctgaggagc cggggaccgggccggagcgcggaggcgagccggccgctgagcgtgtcggcaggggcgcgg agcagctcagaagataaagtgacagtccactttataaactgtgatggtgaaacattaaca accaaaggaaaagttggtgattctctgctagacgttgtggttgaaaataatccagatatt gatggctttggtgcatgtgagggaactctggcttgtttaacctgtcatctcatctttgaa gatcacatatatgagaagttagatgcaatcactgatgaggagaatcacatgctcgatctg gcatatggactaacagatcacagttgggctgccaaatctgtttga >gi568815578f:34293812_34607791|GENSCAN_predicted_peptide_4|755_aa MYCLFEYAGKDNYCLQINPASYINPDHLKYFRFIGRFIAMALFHGKFIDTGFSLPFYKRI LNKPVGLKDLESIDPEFYNSLIWVKENNIEECDLEMYFSVDKEILGEIKSHDLKPNGGNI LVTEENKEEYIRMVAEWRLSRGVEEQTQAFFEGFNEILPQQYLQYFDAKELEVLLCGMQE IDLNDWQRHAIYRHYARTSKQIMWFWQFVKEIDNEKRMRLLQFVTGTCRLPVGGFADLMG SNGPQKFCIEKVGKENWLPRSHTCFNRLDLPPYKSYEQLKEKLLFAIEETEGFGQDAETV VMEACVIEIWPATKPLRRRRKAQDSLSVRYAGLPDRSEMAEVEETLKRLQSQKGVQGIIV VNTEGIPIKSTMDNPTTTQYASLMHSFILKARSTVRDIDPQNDLTFLRIRSKKNEIMVAP AHLRGKNQALGLALNSHKVTWLRSCHFAHAQQGAADQGSSEYGVKPRSLSPLHLWIYPGL GKPFNSLRLRVLRVTRGFTELLYRPHSQQKCELAWDHLVSQSRLLLVLLLYVAIGPENMP WGVSAETKMKPREGAGPLAPAVSHGAQNRVSPDTDSECCDLTSPGELPPAAAAAVLSASP GALEREARSPRSPQTADTSPRPRAPACAPSRARAMPSDRPFKQRRSFADRCKEVQQIRDQ HPSKIPVIIERYKGEKQLPVLDKTKFLVPDHVNMSELVKIIRRRLQLNPTQAFFLLVNQH SMVSVSTPIADIYEQEKDEDGFLYMVYASQETFGF >gi568815578f:34293812_34607791|GENSCAN_predicted_CDS_4|2268_bp atgtattgcctgtttgaatatgcagggaaggataactactgcttgcagataaaccccgct tcttacatcaatccagatcacctgaaatattttcgttttattggcagatttattgccatg gctctgttccatgggaaattcatagacacgggtttttctttaccattctataagcgtatc ttgaacaaaccagttggactcaaggatttagaatctattgatccagaattttacaattct ctcatctgggttaaggaaaacaatattgaggaatgtgatttggaaatgtacttctccgtt gacaaagaaattctaggtgaaattaagagtcatgatctgaaacctaatggtggcaatatt cttgtaacagaagaaaataaagaggaatacatcagaatggtagctgagtggaggttgtct cgaggtgttgaagaacagacacaagctttctttgaaggctttaatgaaattcttccccag caatatttgcaatactttgatgcaaaggaattagaggtccttttatgtggaatgcaagag attgatttgaatgactggcaaagacatgccatctaccgtcattatgcaaggaccagcaaa caaatcatgtggttttggcagtttgttaaagaaattgataatgagaagagaatgagactt ctgcagtttgttactggaacctgccgattgccagtaggaggatttgctgatctcatgggg agcaatggaccacagaaattctgcattgaaaaagttgggaaagaaaattggctacccaga agtcatacctgttttaatcgcctggacctgccaccatacaagagctatgagcaactgaag gaaaagctgttgtttgccatagaagaaacagaaggatttggacaagatgctgaaacagtt gttatggaggcctgcgttattgagatctggcctgccacgaaacctttgcgcaggcgcaga aaggcacaggactcgctaagtgttcgctacgcggggctaccggatcggtcggaaatggca gaggtggaggagacactgaagcgactgcagagccagaagggagtgcagggaatcatcgtc gtgaacacagaaggcattcccatcaagagcaccatggacaaccccaccaccacccagtat gccagcctcatgcacagcttcatcctgaaggcacggagcaccgtgcgtgacatcgacccc cagaacgatctcaccttccttcgaattcgctccaagaaaaatgaaattatggttgcacca gcccatcttcgtgggaagaaccaagctttaggcttggctctgaacagccacaaagtgact tggctgaggtcctgccatttcgctcatgctcagcagggggcagcagaccagggcagttca gagtatggggtcaaacccaggtccctgtccccactccacctgtggatttaccctggattg ggcaagccctttaactctctgaggctccgtgttctcagagtgactcgaggattcactgag ctcctttatagaccacattcccagcagaagtgtgagctcgcttgggaccacctggtgtca cagtccaggctgctgctggtcctgctcctctacgtggcaatcggcccagagaatatgcct tggggggtctcagcagaaactaagatgaagccccgtgaaggcgccgggcctttggctccg gctgtttcccacggcgcccagaaccgcgtcagtcctgacacagactcggaatgttgtgac ctgacgtcaccgggcgagttacctcccgcagccgcagccgccgtgctcagcgcgagcccc ggagcccttgagcgcgaggcgcggagcccccggagcccccaaaccgcagacacatccccg cgccccagagccccggcctgcgcgcccagccgggcccgcgcgatgccctcagaccggcct ttcaagcagcggcggagcttcgccgaccgctgtaaggaggtacagcagatccgcgaccag caccccagcaaaatcccggtgatcatcgagcgctacaagggtgagaagcagctgcccgtc ctggacaagaccaagtttttggtcccggaccatgtcaacatgagcgagttggtcaagatc atccggcgccgcctgcagctgaaccccacgcaggccttcttcctgctggtgaaccagcac agcatggtgagtgtgtccacgcccatcgcggacatctacgagcaggagaaagacgaggac ggcttcctctatatggtctacgcctcccaggaaaccttcggcttctga >gi568815578f:34293812_34607791|GENSCAN_predicted_peptide_5|291_aa XNYSLPSVFVVDWFQDLLQRQYIPVKMKSKAFWIFSWEYAMMYVGSLVVIICLSFFLLSS WDFIPAVYGFILSVPDLTPNIGLFWYFFAEMFEHFSLFFVCVFQINVFFYTIPLAIKLKE HPIFFMFIQIAVIAIFKSYPTVGDVALYMAFFPVWNHLYRFLRNIFVLTCIIIVCSLLFP VLWHLWIYAGSANSNFFYAITLTFNVGQLPHELFIMVPEGGGPGALMAAIERNVDRAVPG GSAERDPIERSTLHILLISDYFYAFLRREYYLTHGLYLTAKDGTEAMLVLK >gi568815578f:34293812_34607791|GENSCAN_predicted_CDS_5|876_bp ngtaattacagtcttccctcggtatttgtggtggattggttccaggacctcctgcagcgg cagtacatacctgtgaaaatgaagagcaaagccttctggatcttttcttgggagtatgcc atgatgtatgtgggaagcctagtggtaatcatttgcctctccttcttccttctcagctct tgggatttcatccccgcagtctatggctttatactttctgttccagatctcactccaaac attggtcttttctggtacttctttgcagagatgtttgagcacttcagcctcttctttgta tgtgtgtttcagatcaacgtcttcttctacaccatccccttagccataaagctaaaggag caccccatcttcttcatgtttatccagatcgctgtcatcgccatctttaagtcctacccg acagtgggggacgtggcgctctacatggccttcttccccgtgtggaaccatctctacaga ttcctgagaaacatctttgtcctcacctgcatcatcatcgtctgttccctgctcttccct gtcctgtggcacctctggatttatgcaggaagtgccaactctaatttcttttatgccatc acactgaccttcaacgttgggcagctgccccatgagctcttcatcatggtcccagaggga ggcggccctggagccctcatggctgccatagagaggaatgtggaccgagccgtccctggg ggcagcgctgagagggaccccatcgagaggtccacattacatatcctgctcatctctgat tacttctatgccttcctgcggcgggagtactacctcacacatggcctctacttgaccgcc aaggatggcacagaggccatgctcgtgctcaagtag