GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:31:44 Sequence gi568815597r:10361493_10572458 : 210966 bp : 46.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 200 333 134 1 2 94 88 124 0.770 12.44 1.02 Intr + 1791 1852 62 0 2 76 115 4 0.920 0.28 1.03 Intr + 3608 3807 200 0 2 86 3 154 0.514 5.77 1.04 Intr + 4049 4156 108 1 0 83 70 196 0.589 17.78 1.05 Intr + 6975 7046 72 2 0 105 105 69 0.995 9.90 1.06 Intr + 9649 9770 122 0 2 77 100 34 0.982 2.79 1.07 Intr + 12824 12973 150 2 0 79 113 98 0.995 10.68 1.08 Intr + 13362 13554 193 0 1 85 73 156 0.959 13.29 1.09 Intr + 13763 13881 119 1 2 87 107 164 0.962 17.46 1.10 Term + 15053 15095 43 2 1 130 42 29 0.828 -0.87 1.11 PlyA + 15328 15333 6 -0.45 2.00 Prom + 19468 19507 40 -2.86 2.01 Init + 37482 37633 152 2 2 69 81 95 0.761 6.53 2.02 Intr + 38137 38212 76 1 1 114 91 190 0.995 21.42 2.03 Intr + 38901 39080 180 2 0 102 73 203 0.987 20.26 2.04 Intr + 41579 41644 66 1 0 77 98 79 0.987 6.90 2.05 Intr + 42669 42787 119 2 2 76 89 46 0.753 2.76 2.06 Intr + 46579 46618 40 1 1 123 0 59 0.695 -1.17 2.07 Intr + 49926 50060 135 2 0 107 94 247 0.993 27.96 2.08 Intr + 51570 51759 190 2 1 76 23 240 0.973 15.46 2.09 Intr + 55495 55625 131 2 2 75 81 160 0.977 14.41 2.10 Intr + 55776 55859 84 2 0 62 44 70 0.502 0.02 2.11 Intr + 55884 56017 134 2 2 45 115 127 0.991 10.54 2.12 Intr + 57334 57433 100 1 1 75 91 140 0.997 13.11 2.13 Intr + 57925 58047 123 0 0 90 115 59 0.999 9.58 2.14 Term + 58138 58257 120 0 0 117 43 115 0.999 8.37 2.15 PlyA + 58631 58636 6 1.05 3.00 Prom + 61626 61665 40 -3.16 3.01 Init + 64859 64966 108 1 0 82 35 205 0.908 12.83 3.02 Intr + 69011 69076 66 1 0 61 92 66 0.247 3.40 3.03 Intr + 72350 72473 124 1 1 73 48 154 0.911 10.06 3.04 Intr + 73165 73198 34 2 1 121 98 37 0.722 5.28 3.05 Term + 78740 78878 139 2 1 58 41 64 0.138 -3.86 3.06 PlyA + 79598 79603 6 1.05 4.00 Prom + 80743 80782 40 -5.16 4.01 Init + 88732 88830 99 0 0 65 105 209 0.987 18.57 4.02 Term + 92493 92525 33 2 0 82 55 19 0.273 -4.41 4.03 PlyA + 96411 96416 6 1.05 5.08 PlyA - 98453 98448 6 1.05 5.07 Term - 100210 99998 213 1 0 53 43 231 0.994 12.33 5.06 Intr - 101717 101566 152 0 2 118 64 143 0.999 14.78 5.05 Intr - 102128 101939 190 2 1 53 97 139 0.982 10.46 5.04 Intr - 105840 105698 143 1 2 63 103 138 0.995 12.87 5.03 Intr - 107846 107685 162 0 0 61 110 152 0.997 14.65 5.02 Intr - 110432 110373 60 0 0 69 92 71 0.949 4.31 5.01 Init - 110966 110786 181 2 1 69 76 203 0.905 16.55 5.00 Prom - 111567 111528 40 -5.66 6.00 Prom + 111859 111898 40 -6.36 6.01 Init + 113475 113510 36 2 0 91 110 30 0.407 5.55 6.02 Intr + 133782 133829 48 2 0 101 119 25 0.718 5.78 6.03 Intr + 134197 134229 33 0 0 83 115 3 0.477 0.92 6.04 Term + 134421 134432 12 2 0 129 41 17 0.579 -0.70 6.05 PlyA + 135508 135513 6 1.05 7.05 PlyA - 135606 135601 6 1.05 7.04 Term - 146384 146046 339 2 0 57 37 125 0.106 -1.16 7.03 Intr - 149610 149526 85 2 1 10 59 134 0.072 2.52 7.02 Intr - 165422 165190 233 2 2 6 100 145 0.028 4.07 7.01 Init - 172118 172026 93 2 0 88 103 30 0.485 4.88 7.00 Prom - 178118 178079 40 -3.26 8.07 PlyA - 178823 178818 6 1.05 8.06 Term - 179857 179790 68 2 2 123 45 41 0.055 1.40 8.05 Intr - 188085 188047 39 1 0 118 101 9 0.722 3.40 8.04 Intr - 190014 189877 138 1 0 2 100 136 0.738 6.64 8.03 Intr - 192398 192216 183 0 0 35 81 86 0.529 2.36 8.02 Intr - 194374 194324 51 2 0 83 94 14 0.288 0.38 8.01 Init - 199672 199633 40 1 1 82 72 61 0.444 4.25 8.00 Prom - 206770 206731 40 -2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 85653 85549 105 0 0 31 56 125 0.807 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_1|400_aa LDHCIQPAVITKDVCMVFYSRDAKISPPRSLRSLFGSGYSKSPDSNRVTGIYELSLCKMS DTGSPGMQRRRRKILDTSVAYVRGEENLAGWRPRGDSLILEHQWELEKLELLHEVSRGRV VQMQELSDKIAKISTTTFESAITPSESSGYDSGDIESLVDREKELATKCLQLLTHTFNRE FSQVHGSVSDCKLSDISPIGRDPSESSFSSATLTPSSTCPSLVDSRSNSLDQKTPEANSR ASSPCPEFEQFQIVPAVETPYLARAGKNEFLNLVPDIEEIRPSSVVSKKGYLHFKEPLYS NWAKHFVVVRRPYVFIYNSDKDPVERGIINLSTAQVEYSEDQQAMVKTPNTFAVCTKHRG VLLQALNDKDMNDWLYAFNPLLAGTIRSKLSRRCPSQSKY >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_1|1203_bp ctggatcattgcatccagccggctgtcatcaccaaggatgtgtgcatggtcttctactcc cgagatgccaagatctcaccaccacgctctctgcgtagcctctttggcagcggctactca aagtcaccagattcgaatcgagtcactggcatttacgaactcagcttatgcaaaatgtca gacacaggtagtccaggtatgcagagaaggagaagaaaaatcttagatacgtcagtggca tatgtgcggggagaagagaacttagcaggctggcggccccgtggagacagcctcatcctt gagcaccagtgggagctggagaagctggagctcctacatgaggtatccaggggcagggtt gttcagatgcaagaactctcggacaagattgccaaaatctcaaccactacctttgaaagc gccatcacacctagcgagagcagtggctatgattcaggagacatcgaaagcctggtggac cgagagaaagagctggctaccaagtgcctgcaacttctcacccacactttcaacagagaa ttcagccaggtgcacggcagcgtcagtgactgtaagttgtctgatatctctccaattgga cgggatccctctgagtccagtttcagcagtgccaccctcactccctcctccacctgtccc tctctggtagactctaggagcaactctctggatcagaagaccccagaagccaattcccgg gcctctagtccctgcccagaatttgaacagtttcagattgtcccagctgtggaaacacca tatttggcccgagcaggaaaaaacgaatttctcaatcttgttccagatattgaagaaatt agaccaagctcagtggtctctaagaaaggataccttcatttcaaggagcctctttacagt aactgggctaaacattttgttgtcgtccgtcggccttatgtcttcatctataacagtgac aaagaccctgtggagcgtggaatcattaacctgtccacagcacaggtggagtacagtgag gaccagcaggccatggtgaagacaccaaacacctttgctgtctgcacaaagcaccgtggg gtccttttgcaggccctcaatgacaaagacatgaacgactggttgtatgccttcaaccca cttctagctggcacaatacggtcaaagctttcccgcagatgcccgagccagtcgaaatac taa >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_2|549_aa MAPPPSIRLAGAEKPGVSGRSFWREPLRVFPSLVLRASPLFGSALSAAMAQADIALIGLA VMGQNLILNMNDHGFVVCAFNRTVSKVDDFLANEAKGTKVVGAQSLKEMVSKLKKPRRII LLVKAGQAVDDFIEKLVPLLDTGDIIIDGGNSEYRDTTRRCRDLKAKGILFVGSGVSGGE EGARYGPSLMPGGNKEAWPHIKTIFQGIAAKVGDEGAGHFVKMVHNGIEYGDMQLICEAY HLMKDVLGMAQDEMAQAFEDWNKTELDSFLIEITANILKFQDTDGKHLLPKIRDSAGQKG TGKWTAISALEYGVPVTLIGEAVFARCLSSLKDERIQASKKLKGPQKFQFDGDKKSFLED IRKNIGPSGISTADENKTGRHKAVTLLMAILALYASKIISYAQGFMLLRQAATEFGWTLN YGGIALMWRGGCIIRSVFLGKIKDAFDRNPELQNLLLDDFFKSAVENCQDSWRRAVSTGV QAGIPMPCFTTALSFYDGYRHEMLPASLIQAQRDYFGAHTYELLAKPGQFIHTNWTGHGG TVSSSSYNA >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_2|1650_bp atggctccacccccttccattcgattggccggcgccgaaaagccgggcgtgagcggccgc agtttctggagggagccgctgcgggtctttccctcactcgtcctccgcgcgtcgccgctc ttcggttctgctctgtccgccgccatggcccaagctgacatcgcgctgatcggattggcc gtcatgggccagaacttaattctgaacatgaatgaccacggctttgtggtctgtgctttt aataggactgtctccaaagttgatgatttcttggccaatgaggcaaagggaaccaaagtg gtgggtgcccagtccctgaaagagatggtctccaagctgaagaagccccggcggatcatc ctcctggtgaaggctgggcaagctgtggatgatttcatcgagaaattggtaccattgttg gatactggtgacatcatcattgacggaggaaattctgaatatagggacaccacaagacgg tgccgagacctcaaggccaagggaattttatttgtggggagcggagtcagtggtggagag gaaggggcccggtatggcccatcgctcatgccaggagggaacaaagaagcgtggccccac atcaagaccatcttccaaggcattgctgcaaaagtgggagatgagggagcaggccacttc gtgaagatggtgcacaacgggatagagtatggggacatgcagctgatctgtgaggcatac cacctgatgaaagacgtgctgggcatggcgcaggacgagatggcccaggcctttgaggat tggaataagacagagctagactcattcctgattgaaatcacagccaatattctcaagttc caagacaccgatggcaaacacctgctgccaaagatcagggacagcgcggggcagaagggc acagggaagtggaccgccatctccgccctggaatacggcgtacccgtcaccctcattgga gaagctgtctttgctcggtgcttatcatctctgaaggatgagagaattcaagctagcaaa aagctgaagggtccccagaagttccagtttgatggtgataagaaatcattcctggaggac attcggaagaatattggcccttctgggatctccactgctgatgagaataagactggtaga cataaggcggtcactctcctaatggcaatcctagcactctacgcttccaagatcatctct tacgctcaaggctttatgctgctaaggcaggcagccaccgagtttggctggactctcaat tatggtggcatcgccctgatgtggagagggggctgcatcattagaagtgtattcctagga aagataaaggatgcatttgatcgaaacccggaacttcagaacctcctactggacgacttc tttaagtcagctgttgaaaactgccaggactcctggcggcgggcagtcagcactggggtc caggctggcattcccatgccctgttttaccactgccctctccttctatgacgggtacaga catgagatgcttccagccagcctcatccaggctcagcgggattacttcggggctcacacc tatgaactcttggccaaaccagggcagtttatccacaccaactggacaggccatggtggc accgtgtcatcctcgtcatacaatgcctga >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_3|156_aa MRPHSPALGWSMGLGAVEQGATLIGEARATQEPTEWGRPAVMEEEAETEEQQRFSYQQRL KAAVHYTVGCLCEEVALDKEMQFSKQTIAAISELTFRQCENFAKDLEMFARMGYLEIPAS LKSDRELLCFIKTLNNTKHFGDLLLPFLQTCEKNHN >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_3|471_bp atgcgcccacactccccagcccttgggtggtcgatgggactgggcgccgtggagcagggg gccacgctcatcggggaggctcgggccacacaggagcccacggagtggggtcggcccgca gtgatggaggaggaggcggagaccgaggagcagcagcgattctcttaccaacagaggcta aaggcagcagttcactatactgtgggttgtctttgcgaggaagttgcattggacaaagag atgcagttcagcaaacagaccattgcggccatttcggagctgactttccgacagtgtgaa aattttgccaaagaccttgaaatgtttgcaaggatgggctacttagagatccctgctagt ttgaagtctgaccgtgaacttctgtgtttcatcaaaactttaaataataccaaacatttt ggtgacttgcttctgccctttctgcagacatgcgaaaagaaccacaattaa >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_4|43_aa MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEDLAPEQLPNC >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_4|132_bp atgccattgtcccccggcctcctgctgctgctgctctccggggccacggccaccgctgcc ctgcccctggagggtggccccaccggccgagacagcgaggatttggcaccagaacagctc cctaactgctga >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_5|366_aa MEVTGDAGVPESGEIRTLKPCLLRRNYSREQHGVAASCLEDLRSKGWLGTPGEGVGEPGP GTYAGKRELLLTISGGPDKEPCDILAIDKSLTPVTLVLAEDGTIVDDDDYFLCLPSNTKF VALASNEKWAYNNSDGGTAWISQESFDVDETDSGAGLKWKNVARQLKEDLSSIILLSEED LQMLVDAPCSDLAQELRQSCATVQRLQHTLQQVLDQREEVRQSKQLLQLYLQALEKEGSL LSKQEESKAAFGEEVDAVDTGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTK EDPKALAVALNWDIKKTETVQEACERELALRLQQTQSLHSLRSISASKASPPGDLQNPKR ARQDPT >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_5|1101_bp atggaggtgaccggggacgccggggtaccagaatctggcgagatccggactctaaagccg tgtctgctgcgccgcaactacagccgcgaacagcacggcgtggccgcctcctgcctcgaa gacctgaggagcaagggttggctcgggaccccgggcgagggtgtgggggagccagggccg ggaacctatgcaggaaagagggagctgctactgaccatttctggaggtccagacaaggaa ccctgtgacattctggccattgataagtccctgacaccagtcaccctggtcctggcagag gatggcaccatagtggatgatgacgattactttctgtgtctaccttccaatactaagttt gtggcattggctagtaatgagaaatgggcatacaacaattcagatggaggtacagcttgg atttcccaagagtcctttgatgtagatgaaacagacagcggggcagggttgaagtggaag aatgtggccaggcagctgaaagaagatctgtccagcatcatcctcctatcagaggaggac ctccagatgcttgttgacgctccctgctcagacctggctcaggaactacgtcagagttgt gccaccgtccagcggctgcagcacacactccaacaggtgcttgaccaaagagaggaagtg cgtcagtccaagcagctcctgcagctgtacctccaggctttggagaaagagggcagcctc ttgtcaaagcaggaagagtccaaagctgcctttggtgaggaggtggatgcagtagacacg ggtatcagcagagagacctcctcggacgttgcgctggcgagccacatccttactgcactg agggagaagcaggctccagagctgagcttatctagtcaggatttggagttggttaccaag gaagaccccaaagcactggctgttgccttgaactgggacataaagaagacggagactgtt caggaggcctgtgagcgggagctcgccctgcgcctgcagcagacgcagagcttgcattct ctccggagcatctcagcaagcaaggcctcaccacctggtgacctgcagaatcctaagcga gccagacaggatcccacatag >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_6|42_aa MASSEQAEQPSQPSSTPGSENVLPREPLGASASVVFKEQGLL >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_6|129_bp atggcgtcctcggagcaggcagagcagccgagccagccaagctctactccaggaagtgaa aatgtgctgcctcgagagccgctgggtgcgagcgcctcagtggtctttaaagaacagggc ctgctgtaa >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_7|249_aa MECSVISNSFKKNVGDIAKNSINERIQFKHQTHQKTEESQQSRSATNGMPPSPRSCYRMS IPNLKIQNPKCSKIQNFLSANMTVKGHAQRKCSLEHFGFGIFRFEMLNQQTSSSAACTFN DAYRCTTATEQDVENEEVNFAALTLTMRNLWMLIETQIKDLPEGAEKVEIKDPGYCIFFT YADSSDSLETCRHHFKSWSPEQIAQLSNLLPASKRQNNKRLLPPRSYQGHSTLLYRHHMK RSYYLLSIP >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_7|750_bp atggagtgctctgtcattagtaattcatttaagaagaacgtgggggatatagcaaagaac agcataaacgagcgaatccaatttaaacatcagacccaccagaaaacagaagaaagtcaa caatcaagatctgctactaacgggatgccaccaagtcctagatcctgctacaggatgagc atccctaatctgaaaatccaaaatccaaaatgctccaaaatccagaactttttgagtgcc aacatgacagtcaaaggtcatgcacaaaggaaatgctcactggagcatttcggatttggg attttcagatttgagatgctcaaccaacaaaccagcagctctgctgcctgcacatttaac gatgcctatcggtgcaccacagctacggaacaggatgtggagaacgaggaggtgaacttt gctgccttaactctgacaatgaggaatttgtggatgttgatagaaacacagataaaggat ttacctgagggggcagagaaagtagaaataaaagatcctggatattgtattttctttact tatgcggattccagtgactccctggagacctgcaggcaccactttaagagctggagtcca gagcaaattgctcaattgtccaaccttctccccgccagcaagagacagaacaacaagcgg ctgctgccccctagaagctaccaggggcatagcaccttgctatatagacatcacatgaaa agatcctattatctcctctccatcccataa >gi568815597r:10361493_10572458|GENSCAN_predicted_peptide_8|172_aa MAVSEEVYAEVITEGQNEASSAVMMQRPATGVQGSPENCRGQISKQVDCHKQISGNTCRY NREGQTQGRADEETDDKGVKRSFLEAEFSVEESPSRIRAAGIKLPNAPASSALCNDIHAN GDELGFMEVRLLRMDLKGLPKGEPLIGKKKCSKTKPVLSPLVALTPEKGFYR >gi568815597r:10361493_10572458|GENSCAN_predicted_CDS_8|519_bp atggccgtatctgaggaggtatatgctgaagtaattacagagggccagaatgaagcaagt tctgctgttatgatgcaacgcccggccacaggtgtgcagggaagccctgaaaattgccga ggccagatcagcaaacaggttgactgtcacaagcagatcagtggaaacacatgcagatat aacagagaaggacaaactcagggaagagcagatgaggaaacagatgacaaaggagtgaag aggagctttctggaagctgagttttcagtggaagaatctccaagccgaatccgtgctgct gggataaagctgcctaatgcaccagcctccagcgcgctgtgtaatgacatccacgctaat ggagatgagctcggcttcatggaggttcgactcctgagaatggatcttaaaggactgcct aaaggagagcctttgattgggaaaaaaaaatgttccaaaaccaaacccgttttgtctccc ctggtagctctcactcctgagaaaggcttttaccgctaa