GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:02:20 Sequence gi568815577r:28776388_28985345 : 208958 bp : 38.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19301 19440 140 1 2 78 76 96 0.054 7.06 1.02 Term + 21653 21824 172 2 1 91 55 91 0.078 2.42 1.03 PlyA + 22045 22050 6 1.05 2.00 Prom + 38521 38560 40 -1.45 2.01 Init + 40431 40466 36 2 0 71 119 52 0.437 6.67 2.02 Term + 42224 42355 132 1 0 106 38 71 0.827 1.01 2.03 PlyA + 42461 42466 6 -0.45 3.00 Prom + 44191 44230 40 -4.55 3.01 Init + 53991 54033 43 2 1 80 127 -6 0.554 3.38 3.02 Term + 55064 55410 347 0 2 5 45 254 0.542 6.47 3.03 PlyA + 56852 56857 6 1.05 4.00 Prom + 57280 57319 40 -9.75 4.01 Init + 58421 58463 43 1 1 86 68 58 0.723 2.29 4.02 Intr + 59133 59283 151 1 1 75 88 106 0.541 7.60 4.03 Term + 62984 63269 286 2 1 28 42 195 0.594 2.79 4.04 PlyA + 63666 63671 6 1.05 5.09 PlyA - 65865 65860 6 1.05 5.08 Term - 74496 74410 87 0 0 120 39 8 0.491 -4.22 5.07 Intr - 74995 74810 186 1 0 53 55 225 0.819 14.66 5.06 Intr - 76182 76023 160 2 1 61 91 123 0.647 8.97 5.05 Intr - 97210 97128 83 1 2 19 111 73 0.186 0.22 5.04 Intr - 103566 103483 84 0 0 47 83 70 0.672 1.50 5.03 Intr - 105863 105773 91 1 1 90 91 115 0.990 11.08 5.02 Intr - 106684 106598 87 0 0 99 103 -7 0.503 0.07 5.01 Init - 108958 108825 134 1 2 62 49 300 0.690 23.16 5.00 Prom - 109451 109412 40 -5.75 6.04 PlyA - 109792 109787 6 1.05 6.03 Term - 111491 111268 224 0 2 -18 38 303 0.996 10.80 6.02 Intr - 111882 111525 358 0 1 8 56 416 0.922 24.30 6.01 Init - 112766 112323 444 2 0 25 77 556 0.571 44.26 6.00 Prom - 124252 124213 40 -3.25 7.19 PlyA - 126359 126354 6 1.05 7.18 Term - 127149 126708 442 2 1 7 44 548 0.926 36.04 7.17 Intr - 134006 133863 144 1 0 83 45 128 0.246 6.48 7.16 Intr - 143528 143473 56 2 2 74 72 17 0.065 -4.44 7.15 Intr - 154935 154768 168 0 0 97 111 3 0.860 2.72 7.14 Intr - 156277 156083 195 1 0 84 111 106 0.983 11.19 7.13 Intr - 165019 164833 187 0 1 34 99 101 0.531 4.57 7.12 Intr - 166949 166875 75 1 0 71 61 115 0.946 4.81 7.11 Intr - 175877 175773 105 1 0 124 61 55 0.959 4.81 7.10 Intr - 176963 176830 134 2 2 54 110 46 0.215 1.92 7.09 Intr - 181039 180945 95 2 2 62 68 90 0.158 3.16 7.08 Intr - 183310 183071 240 2 0 85 90 81 0.378 4.60 7.07 Intr - 184319 184130 190 2 1 79 58 46 0.216 -1.06 7.06 Intr - 190792 189983 810 1 0 78 61 365 0.741 23.50 7.05 Intr - 193126 193079 48 1 0 54 70 75 0.403 0.26 7.04 Intr - 195057 194884 174 0 0 84 87 53 0.756 4.01 7.03 Intr - 204912 204732 181 2 1 39 76 135 0.989 6.35 7.02 Intr - 205981 205929 53 1 2 85 92 52 0.993 2.09 7.01 Intr - 208535 208305 231 2 0 52 84 264 0.998 19.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 20843 20761 83 0 2 123 38 72 0.883 2.58 S.002 Init - 23717 23633 85 2 1 90 82 77 0.819 8.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_1|103_aa MMIHKIYSEEPNVQEMQTICSVNLEHRFENLEADRKGNLVYNFVIIRLSFCQQPVLRIQA SDPVLSFRSPFSPSFIKQQPLLSVLLCTRSIDRFRVCEAQLGV >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_1|312_bp atgatgattcataaaatatattcagaggagcccaatgttcaggaaatgcaaactatctgc agtgtgaatttggagcacaggtttgaaaacttagaggctgatcgaaaaggaaatttggtt tacaattttgtaattatcaggcttagtttctgccaacagccagtcctgaggatccaggca agtgaccctgtgttatctttccgctctcccttctctccttcgttcatcaagcagcaaccg ctactttcagtgcttctctgcacacggagtatagacaggttccgagtatgtgaagctcag cttggggtttga >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_2|55_aa MNGASRSFSSCLEFSANPYLRKLQGGVGLMGLPHLGPTAVSLEPHRDCITVRGAQ >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_2|168_bp atgaacggtgcatccaggagcttcagcagctgcctggaattttcagcaaatccatatctt agaaagctgcagggaggagttggcctgatgggcttgccacacttgggccccactgcagtt tccctggagcctcacagggactgcatcacagtccgtggggctcaataa >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_3|129_aa MVRRACFPFTFCHDERKERKKERKKERKKERKKERKKERRKEGKKERRKEGKKERKKEGK KERRKEGKKERRKEGRKERKKERKKERKKERKKEGRKEGRKEGRKEGRKERKKERKKERK KETVSPFKQ >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_3|390_bp atggtgagacgtgcttgcttccccttcaccttctgccatgacgaaagaaaagaacgaaag aaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaagg aaagaaggaaagaaggaaagaaggaaagaaggaaagaaggaaagaaagaaagaaggaaag aaggaaagaaggaaagaaggaaagaaggaaagaaggaaagaaggaaggaaagaaagaaag aaagaaagaaagaaagaaagaaagaaagaaagaaagaaggaaggaaggaaggaaggaagg aaggaaggaaggaaggaaggaaggaaggaaagaaagaaagaaagaaagaaagaaagaaag aaagaaacagtatcacctttcaaacagtga >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_4|159_aa MVGVRWALQFARELEPTQMRRNQKTNPGNTTKQGSSTPPKNHISSPAMDPNQEEIPDLPE KEFRRAIRQEKEIKRIQISKEEVKLSLIANNMIDYLENPKDSSRKLLELIKEFSKVSRYK INVHKSVALPYTKSNQADNQIKNTTPFTIAAKIKYLGIY >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_4|480_bp atggtgggagtgagatgggcccttcagttcgcacgggagctggagcctacccaaatgaga aggaaccagaaaaccaacccaggtaatactacaaaacaaggctcttcaacacccccaaaa aaccacattagttcaccagcaatggatccaaaccaagaagaaatccctgatttacctgaa aaagaattcaggagagccatcagacaagagaaagaaataaagcgcattcaaatcagtaaa gaggaagtcaaactgtcactgattgctaacaatatgattgattaccttgaaaaccctaag gactcctccagaaagctcctagaactgataaaagaattcagcaaagtttccagatacaag attaatgtacacaaatcagtggctcttccatacaccaagagcaaccaagcagacaatcaa atcaagaacacaaccccttttacaatagctgcaaaaataaaatacttaggaatatactaa >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_5|303_aa MAGENFATPFHGHVGRGAFSDVYEPAEDTFLLLDALEAAAAELAGVEICLEVGSGSGVVS AFLASMIGPQALYMCTDINPEAAACTLETARCNKVHIQPVITDLVKGLLPRLTEKVDLLV FNPPYVVTPPQEVNGKLQPNSDRTTNDPDLSGMKVWVTTRTPLASTRATVDWGNGNDQTF QGLLDTGSELTLTPGDTKCHCRSPVKVGAYGGQWGPEQEKVLQQVQAAVQAVLTLGPYDP SDPMVLEVSVADRDAVWSLWQAPVGETQWQPLGFGPALIASWGVHYNQSIEEEKTRAWLI NSP >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_5|912_bp atggcaggggagaacttcgctacgccgttccacgggcacgtgggccgcggcgccttcagc gacgtgtacgagcccgcggaggacacgtttctgcttttggacgcgctggaggcagcggct gccgaactggcaggagtggaaatatgcctggaagtagggtcagggtctggtgtagtatct gcattcctagcctctatgataggccctcaggctttgtacatgtgcactgatatcaaccct gaggcagcagcttgtaccctagagacagcacgctgtaacaaagttcacattcaaccagtt attacagatttggtcaaaggcttgctaccaagattgaccgaaaaagttgatcttctggtg tttaatcccccctatgtagtgactccacctcaagaggtcaatggaaaactacagcctaat tcagacaggaccactaatgatccagacctttcaggaatgaaggtttgggtcaccaccagg acacctctagcttctaccagggcaactgtggattggggaaatggaaatgatcagacattt cagggactactggacactggctctgagctgacattgactccaggggacacaaaatgtcat tgtcgttctccagttaaagtaggggcttatggaggtcagtggggtccagaacaagagaag gttctgcaacaggtccaggctgctgtgcaagctgttcttacacttgggccatatgaccca tcagatccaatggtgcttgaggtgtcggtggcagatagggatgccgtttggagcctttgg caggcccctgtaggtgaaacacagtggcagcctctaggatttggacctgcactgatagct tcttggggagttcactataatcaatcgatagaggaagagaagacaagggcctggttaata aactccccttag >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_6|341_aa MLQGVDLLLTAVAITMGPKGKTVITEQNWGSPKVTKDGVTVARSIDLKDKYKNIAAKLVQ DVANNTNEEAGDGTTTATVAKEVFKKFSKGANPVEIKRSVMLAVDAVIAELKKQSKPVTT PEEIAQVATISANGDKEIGNLISDAMKKVGNVNSKKISSVQSIVPDLEIANAHRKLLVII AENVDGETLSTLILNRRKLGLQVVAVKAPGFGDNRKNHLKGMTIATGGAVFGEEELTSNL EDVQPHDLGEVGEVTVIKDDAMLLKGKEIIEQLDVTTSEYEKEKLNEQLAKLSDGVAVLK VSGTSHVEVNEKKNRVTDALNATRAAVEEGIVLGGGCTLLR >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_6|1026_bp atgcttcaaggtgtagaccttttactcactgctgtagccattacaatggggccaaaggga aaaacagtgattactgagcagaactggggaagtcccaaagtaacaaaagatggtgtgact gttgcaaggtccattgacttaaaggataaatataaaaatattgcggctaaacttgttcaa gatgttgccaataacacaaatgaagaggctggggatggcactaccactgctactgttgcc aaggaagtcttcaagaagtttagcaaaggtgctaatccagtggaaatcaagagaagtgtg atgttagctgttgatgctgtaattgctgaacttaaaaagcagtctaaacctgtgaccacc cctgaagaaattgctcaggttgctacaatttctgcaaatggagacaaagaaattggcaac ctaatttccgatgcaatgaaaaaggtcggaaatgtgaattccaagaaaatttctagtgtc cagtccattgtacctgatcttgaaattgccaatgctcaccgtaagcttttggtcataatt gctgaaaatgttgatggagaaactctaagtacactcattttgaataggcgaaaacttggt cttcaggttgtagcagtcaaagctccagggtttggtgacaatagaaagaaccaccttaaa ggtatgacaattgctactggtggtgcagtgtttggagaagaggagttgacctcaaatctt gaagatgttcagcctcatgacttaggagaagttggagaggtcactgtgatcaaagatgat gctatgctcttaaaaggaaaagaaatcattgagcagttggatgtcacaactagtgaatat gaaaaggaaaaactgaatgaacagctggcaaaactttcagatggagtagctgtgctgaag gttagtgggacaagtcacgttgaagtgaatgaaaagaaaaacagagttacagatgccctt aatgctacaagagctgctgttgaagaaggcattgttctgggagggggttgtaccctactt cgatga >gi568815577r:28776388_28985345|GENSCAN_predicted_peptide_7|1175_aa DHDRRVREATQQAFEKLILKVKKQLAPYLKSLMGYWLMAQCDTYTPAAFAAKDAFEAAFP PSKQPEAIAFCKDEITSVLQDHLIKETPDTLSDPQTVPEEEREAKFYRVVTCSLLALKRL LCLLPDNELDSLEEKFKSLLSQNKFWKYGKHSVPQIRSAYFELVSALCQRIPQLMKEEAS KVSPSVLLSIDDSDPIVCPALWEAVLYTLTTIEQNLGEEEIEQMLVNDQLIPFIDAVLKD PGLQHGQLFNHLAETLSSWEAKADTEKDEKTAHNLENVLIHFWERLSEICVAKISEPEAD VESVLGVSNLLQVLQKPKSSLKSSKKKNGKVRFADEILESNKENEKCVSSEGEKIEGWEL TTEPSLTHNSSGLLSPLRKKPLEDLVCKLADISINYVNERKSEQHLRFLSTLLDSFSSSR VFKMLLGDEKQSIVQAKPLEIAKLVQKNPAVQFLYQKLIGWLNEDQRKDFGFLVDILYSA LRCCDNDMERKKVLDDLTKACPSSDKHALVTPWLKGDILGEKLVNLADCLCNEDLESRVS SESHFSERWTLLSLVLSQHVKNDYLIGDVYVERIIVRLHETLFKTKKLSEAESSDSSVSF ICDVAYNYFSSAKGCLLMPSSEDLLLTLFQLCAQSKEKTHLPESEDSYLMGVYIGSVMPN DSEWEKMRQSLPMQWCEELDNPPIFLIGFCEILQKMNITYDNLRVLGNTSGLLQLLFNRS REHGTLWSLIIAKLILSRSISSDEVKPHYKRKERLMPELPQYDQDNLKSYGDEEEEPALS PPAALMSLLSIQEDLLENVLGCIPVGQIVTIKPLSEDFCYVLGYLLTWKLILTFFKAASS QVKARATTREVMATYTIEDIVIELIIQLPSNYPLGSIIVESGKRVGVAVQQWRNWMLQLS TYLTHQNGSIMEGLALWKNNVDKRFEGVEDCMICFSVIHGFNYSLPKKACRTCKKKFHSA CLVSQPSPNARNVEKTALLLGTSSKAEEGLEKNMKKKSLSEFKVWQEYFPKPIQLTINDN SISIKSKRSAKCVEEAYTLLNEYGDDMYGPEKFTDKDQQPSGSEGEDDDVEAVLKKEVGD IEASTEMSLRGFQSVESGANNVAFIRTLGKEPEKLVHHILQDEYKTKKTQVILRMLPISG TCKAFLKDMKKYAETFLEPWFKAPNKGTFQIVYKS >gi568815577r:28776388_28985345|GENSCAN_predicted_CDS_7|3528_bp gatcatgaccgtcgcgtccgagaagccacacaacaagcttttgaaaaacttatccttaaa gtaaagaaacagttggctccctacttaaaaagtttaatgggatattggctaatggctcag tgtgatacttacacaccagctgcgtttgcagcaaaagatgcatttgaagcggcttttcct ccaagcaagcaacctgaagccatagcattttgtaaggatgaaattacaagtgtgctgcag gatcatcttataaaagaaacacctgatacactcagtgacccgcaaactgttccagaggaa gaaagagaagctaaattctaccgggttgtaacttgttccttattggcattaaagagatta ctttgccttttacctgataatgagcttgattctctggaggagaaatttaagtctctttta tcacagaataagttttggaagtatggaaaacacagtgtacctcagattcgctcagcttat tttgagttagtctctgcattgtgccagcgcattccacagttgatgaaagaggaagcatcc aaagtgagcccatcagttctacttagcattgatgacagtgacccaattgtctgcccagct ctctgggaagctgtactctatacacttacaactattgagcaaaacttaggtgaggaagag attgaacagatgctcgtcaatgatcagttgatcccttttattgatgcagttctcaaagac ccaggattgcaacatgggcagctatttaaccatttagcagaaactctaagttcctgggaa gccaaagcagacacggaaaaagatgaaaaaacagctcacaacttggagaacgtactgata catttctgggaaagactgtcagagatctgtgttgcgaaaatcagtgagccagaagctgat gttgagtccgttttgggtgtatctaacctattacaggtgcttcagaagccgaagagctca ttgaagtcaagtaaaaaaaaaaatggtaaggttagatttgctgatgagatacttgaaagc aataaagagaatgaaaaatgtgtatcttcagaaggagagaagattgaaggctgggaatta acaactgaaccttctctcactcataattcttcaggccttttgtctcctctaaggaaaaaa cctttggaagacttagtctgtaaactcgcagatataagtattaattatgtcaatgaacga aagtcagagcaacatctaaggtttctttctactctgcttgactccttttcttcaagccga gtatttaaaatgctacttggtgatgaaaaacagagtattgtccaagccaaacctcttgaa atagccaagcttgtacaaaaaaatcctgcggtgcagtttttataccagaaactgataggt tggctaaatgaagatcaaaggaaggattttggtttcctggtggacattttgtacagtgct ctccggtgctgtgacaatgatatggaaagaaaaaaagtcttggatgatctaaccaaggca tgtcctagttcagataaacatgctttagtaactccttggctcaaaggcgatatccttggt gagaaattggtcaacttggcagattgtctttgtaatgaggacttggaatccagggtatct tcagaatctcacttctcagaaagatggactcttctaagcttggtattatcccaacatgtt aaaaatgattacttgattggagacgtatatgttgaaagaatcattgttagacttcatgaa actttattcaaaacaaagaaattatcagaagctgaaagcagtgactcatcagtgtctttt atctgtgatgtggcctataactatttcagctcagcgaaaggatgcttgctaatgccatca tctgaagatttattattaactctctttcagttatgtgctcagagcaaagaaaaaacacat ttgccagagagtgaagattcttatcttatgggagtttatattggaagtgtaatgccgaac gacagtgaatgggaaaagatgaggcagtctcttcctatgcagtggtgtgaagaattagat aacccacctatttttctaattggattttgtgaaatacttcaaaaaatgaatattacgtat gataacttacgtgtacttggtaatacgtcgggccttttgcagctgttatttaacaggtcc agagaacatggcacactgtggtctcttattattgctaagttgatcctttcccgaagcatt tcatctgatgaagtaaaaccacattataagagaaaagaaagattgatgcctgaattacca cagtatgatcaggataatctaaagtcatacggagatgaagaagaagagccagccttgtca ccaccagcagcactgatgtctcttcttagcattcaagaggacttactagaaaatgttttg gggtgtattcctgttggacagatagttactattaaaccactgagtgaagacttctgttat gttctgggataccttctcacttggaaattaatactaactttcttcaaagctgcatcatca caggttaaagctcgagctactactcgagaggtaatggctacttatactattgaggacata gttattgaacttataatacaactgccttcaaattatccactgggttcaataatagtagaa agtgggaaaagagtaggagtagctgttcagcagtggcggaactggatgctgcagttaagc acttacctcacccatcagaatggaagtattatggaaggcttagctttatggaaaaataac gtagacaaacgttttgagggtgttgaagattgcatgatctgtttctcagtcattcacggt ttcaactattcccttcccaaaaaagcctgtagaacatgcaagaaaaaattccattcagcc tgcttggtgagccaaccaagtcctaatgctagaaatgttgaaaaaactgccttactcctt gggacttcttcaaaagcagaggaagggctggagaaaaacatgaaaaagaaaagcttatcg gaattcaaggtttggcaagaatactttcccaagcccatccagcttaccatcaatgataat tcaatttcaatcaaatcaaagagaagcgccaagtgtgtggaggaggcttacaccctgctc aacgaatatggcgatgacatgtatgggccagaaaagtttacagacaaggatcagcagccc tctggaagtgagggagaagatgatgatgtggaggctgtcttgaagaaagaagttggtgac attgaggcatctacagagatgagcctaagaggattccagtcagtggaaagtggagcaaat aacgtcgccttcatcaggacacttgggaaagaacctgagaaactggtgcatcatattctc caggatgagtacaaaaccaagaagactcaagttattctacgaatgttacccatctcaggc acatgcaaagcttttttaaaagatatgaaaaaatatgcagaaacatttttggaaccctgg tttaaagctccaaacaaagggacatttcagattgtgtacaaatcttga