GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:10:37 Sequence gi568815586r:102302510_102580381 : 277872 bp : 40.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 101 96 6 1.05 1.03 Term - 2445 2119 327 0 0 46 44 257 0.819 10.82 1.02 Intr - 2960 2727 234 2 0 36 93 101 0.461 2.36 1.01 Init - 3142 3089 54 1 0 82 47 56 0.512 2.23 1.00 Prom - 5752 5713 40 -3.65 2.03 PlyA - 6268 6263 6 1.05 2.02 Term - 8711 8218 494 0 2 39 39 221 0.227 6.08 2.01 Init - 10370 10070 301 2 1 88 -8 323 0.554 20.16 2.00 Prom - 14249 14210 40 -4.85 3.00 Prom + 22920 22959 40 -5.65 3.01 Init + 30069 30300 232 2 1 46 86 166 0.821 10.67 3.02 Intr + 47992 48068 77 2 2 77 42 65 0.007 -0.88 3.03 Term + 59142 59348 207 2 0 36 42 249 0.231 11.46 3.04 PlyA + 60191 60196 6 1.05 4.00 Prom + 60519 60558 40 -11.14 4.01 Init + 60724 60959 236 0 2 82 96 106 0.310 8.46 4.02 Term + 61695 61866 172 0 1 98 43 136 0.991 6.42 4.03 PlyA + 62958 62963 6 1.05 5.04 PlyA - 63436 63431 6 1.05 5.03 Term - 67546 67385 162 1 0 52 55 72 0.170 -2.75 5.02 Intr - 67870 67680 191 2 2 -60 75 199 0.364 2.18 5.01 Init - 68054 67976 79 2 1 75 88 68 0.496 6.77 5.00 Prom - 68856 68817 40 -4.15 6.04 PlyA - 69014 69009 6 1.05 6.03 Term - 82324 81932 393 1 0 93 42 236 0.982 13.55 6.02 Intr - 100693 100601 93 1 0 25 47 120 0.009 0.84 6.01 Init - 117209 117000 210 2 0 77 121 147 0.905 15.73 6.00 Prom - 118870 118831 40 -6.45 7.00 Prom + 119092 119131 40 -5.85 7.01 Init + 130152 130268 117 2 0 104 19 69 0.273 1.85 7.02 Intr + 135749 136017 269 1 2 63 39 183 0.388 6.31 7.03 Intr + 137755 137988 234 1 0 65 72 142 0.224 6.18 7.04 Term + 145216 145528 313 1 1 48 34 148 0.264 -1.11 7.05 PlyA + 146178 146183 6 1.05 8.04 PlyA - 146600 146595 6 1.05 8.03 Term - 147954 147668 287 1 2 3 48 265 0.023 8.58 8.02 Intr - 173290 173134 157 1 1 111 89 147 0.439 15.76 8.01 Init - 188740 188138 603 1 0 70 92 242 0.204 18.21 8.00 Prom - 192076 192037 40 -9.35 9.09 PlyA - 192157 192152 6 1.05 9.08 Term - 193362 193120 243 0 0 20 32 204 0.249 2.92 9.07 Intr - 198933 198856 78 0 0 106 95 33 0.396 4.63 9.06 Intr - 210480 210324 157 2 1 104 89 81 0.928 8.89 9.05 Intr - 216866 216813 54 1 0 82 108 67 0.103 5.28 9.04 Intr - 217556 217502 55 0 1 104 80 -4 0.047 -2.48 9.03 Intr - 220692 220639 54 1 0 93 89 20 0.231 0.63 9.02 Intr - 221825 221696 130 2 1 73 36 92 0.166 1.85 9.01 Init - 232462 232373 90 1 0 81 72 130 0.611 11.24 9.00 Prom - 241599 241560 40 -6.45 10.03 PlyA - 241913 241908 6 1.05 10.02 Term - 254894 254671 224 0 2 86 38 149 0.846 5.80 10.01 Init - 259493 259277 217 2 1 76 61 95 0.392 4.50 10.00 Prom - 277176 277137 40 -3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 90669 90848 180 2 0 54 41 177 0.820 4.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_1|204_aa MVALSNRKDEKQKGEREKSHQLSSFPFWGRKKLSVSHDPNSVTHSCQQRVQGRLFQRKQQ LTSRSANPVLSQKGLYQEPSFVNVLQCIVVHSERSTFLSDPVRCKRPLTGSKPVNSRIKS VPGPSPVSVTTPNPVWIRNLPKETQRAQNTNPWSSKIQEGTYPRCPAALRDQWTQVGPAG TLRVYSALLGVTRSSTLDPASDII >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_1|615_bp atggttgccctgagtaacagaaaagatgaaaaacagaaaggagagagagaaaagtcccat cagctttcaagttttcccttttgggggaggaaaaagctctccgtgtcccacgatcctaat tctgtcacccatagttgtcagcaaagagtgcaaggcagattattccaaagaaaacagcag ttgacatcccgtagtgccaaccctgttcttagccaaaagggactttaccaagagccctca tttgtaaatgtacttcagtgcattgttgttcattcggaacgttccactttcctttctgac ccagtcagatgtaagaggcctctaactggatccaagccagttaattcccggatcaaatct gttcctggacccagtccagtttctgtcacaactccaaacccagtttggatcagaaatttg cccaaagaaactcagagagctcaaaacacaaatccgtggagctccaaaatccaagaggga acttacccacgatgcccagctgctctgagagatcaatggacacaagtgggtcctgcaggt accttgcgtgtttactcagcactcctgggggtcactagaagctccactttggatcctgct tctgacattatctga >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_2|264_aa MGKKQKRKTGNSKKQSASPPPKKRSSSPATEQSWMENDFDELREEVFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELPPHHTYSKIDHTVGSKALPS KCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFE ANENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTYSKAS RRQEITKIRAELKEIETQKTLQKN >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_2|795_bp atggggaaaaaacagaaaagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaagaaacgcagttcctcaccagcaacagaacaaagctggatggagaatgactttgac gagctgagagaagaagtcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta ccaccacaccacacctattccaaaattgaccacacagttggaagtaaagctctccccagc aaatgtaaaagaacagagattataacaaactatctctcagaccacagtgcaatcaaacta gaactcaggattaagaatctcactcaaaaccgctcaactacatggaaactgaacaacctg ctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctttgag gccaacgagaacaaagacacaacataccagaatctctgggacgcattcaaagcagtgtgt agagggaaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgac accctaacatcacaattaaaagaactagaaaagcaagagcaaacatattcaaaagctagc agaaggcaagaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacc cttcaaaaaaattaa >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_3|171_aa MKKYSEENTKGVPGLSFNEELMGLYKQQHCQFKLKEMQKGLKEGRLLDSSGFVGEKMKES FGCEHVLSFNKRKNDPKDSTLLKGERSKEGQSINASIGHFLHLEASHRWERERERERKKK EEGRRSRNRRRKEEAEEEEEEEEEEDGAEENDEGRKEKKEKKGEERANFFK >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_3|516_bp atgaaaaagtattcagaagagaacaccaagggtgtgcctggactgtcattcaatgaagag cttatgggactatataagcagcaacattgccagtttaaactgaaggagatgcagaaagga ctaaaggaaggaaggttattagactcatcaggatttgtgggggaaaagatgaaagagtct tttggctgtgaacatgtgctatccttcaataagaggaaaaatgaccctaaagactccaca ctcttaaaaggagagagaagcaaagaaggacagtcaatcaatgccagcattggtcatttt ctgcaccttgaagcatcacacaggtgggagagagagagagaaagagagaggaagaagaag gaagaaggaagaagaagcagaaacagaagaagaaaagaagaagcagaagaagaagaagaa gaggaggaggaggaggacggagcagaagaaaacgacgaaggaaggaaggagaagaaggag aagaaaggagaggagagggctaatttttttaaataa >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_4|135_aa MTCVPAMRPETEYVQPDHGNPRLAVFVLKPALNQIVEFNSVYGQHRETPVPSAGPFGELM PSLLQGAMFPRFHYQLVFRLYCWRKKPSSNTNKHKHKDKTGMDAELQGSHRSIEVMSGSN SITSNFKGQQWLQGN >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_4|408_bp atgacttgtgttccagccatgagacctgagacagagtacgtccagcctgaccatggcaac cccaggcttgcagtattcgtgctgaagcctgctctgaatcagatagttgagttcaatagt gtttatgggcaacatagagagactcctgtgccttctgctggaccctttggagagctaatg ccatctctgttgcagggagctatgtttcccaggttccattaccaactggttttcagactt tactgttggagaaaaaagccttccagtaatacaaataaacacaaacacaaagacaaaaca gggatggatgcagaacttcaaggatcacatagatctattgaagtgatgagtggcagcaac agcattacttcaaatttcaagggacagcaatggctgcagggcaattag >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_5|143_aa MLIVIWTTKSRVISDGDEKILGNYSKEEISKWQNIREKAQHKSLENLQADNTIEKKNPFS GEKFKPAGEICLSNKVQNVNHQDNGENVSRQLQLQPWLKGTKVQLGSWLHKVQAPSLGSF HMVLSLQVHRSQELRFGNLCLDF >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_5|432_bp atgctgatagtgatatggacaacaaagtccagggtgatctcagatggagatgagaaaatt cttgggaactacagcaaagaagaaatttctaagtggcaaaacattcgagagaaagcacag cataaaagtttggaaaatttgcaggctgacaatacgatagaaaagaaaaacccattttct ggggagaaattcaagcctgctggagaaatttgcctaagtaacaaggtgcagaacgttaat caccaagataacggggaaaatgtctccaggcagcttcagctccagccatggctaaaaggg accaaggtacagcttggatcatggcttcataaagtgcaagctccaagccttggcagcttc catatggtgttgagcctgcaggtgcacagaagtcaagaattgaggtttggaaatctctgc ctagatttctga >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_6|231_aa MRPLCLICADKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKSARSVRAQ RHTDMPKTQKFIDEETETQEIATHTSLHSQKIEKLRVELSPLTPHYKGLTYLIHHIQLPK NYKPYQKTKTSFEKTEQASEPNMAGILELSDQEFKTTVINMPRALMDKGDSIQEHMDNVS KQMEILRKNEIEMLDIKNTITEIKNDFDGLLSRLDTVEERISEHEDTAIET >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_6|696_bp atgaggccactctgtttgatttgtgcagacaagcccacagggtatggctccagcagtcgg agggcgcctcagacaggcatcgtggatgagtgctgcttccggagctgtgatctaaggagg ctggagatgtattgcgcacccctcaagcctgccaagtcagctcgctctgtccgtgcccag cgccacaccgacatgcccaagacccagaagtttatagatgaagaaacggaaacacaagag attgccactcatacaagtttacacagccagaaaatagaaaagctacgagttgagctcagc ccacttaccccacactacaaggggcttacttacctgatacatcacatccaactgccaaaa aattataaaccataccaaaagacaaaaacatcatttgaaaagacagagcaagcatcagaa ccaaatatggcagggatattggaattatcagaccaggaatttaaaacaactgtgattaac atgccaagggctctaatggataaaggagatagcatccaagaacacatggacaatgtaagc aaacagatggaaatcctaagaaagaatgaaatagaaatgctagatatcaaaaacaccata acagaaataaagaatgactttgatgggcttcttagtagactggacacagttgaggaaaga atctctgagcatgaggatacagcaatagaaacttaa >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_7|310_aa MGRNQDLSLASKGFLSRSKTKENEIDDKGLWSLNFENHQRHRRLACTLAAGDLESHPKCQ ILCQRTPKDPRDLFSSAFTKELFKKFRAKSLGNGCKRYLILQSLKVHQPIQDGRGTAYSS STVKKSVSCPHRERTALVLMGLPKYWDGPASPSCVGLEDVDSVGDSRFLQESWNVRAARH FRANPDPSPCYSNQETENLEDQMILSRKLKLDRFLTSYTKINSRWIKDFNVRPKTIKTLE ENLGNTIQDIGMDKHFLTKTPIAMARKAKTDKWDVIELKSFCTAKETIIRVNRQPTEWEK IFSIYPSDKG >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_7|933_bp atggggcgtaaccaagacttgagtttggcatccaaaggttttctaagtagaagtaaaacc aaggaaaatgaaattgatgacaagggactctggtctctcaactttgaaaatcaccaaaga catcgacgactggcctgcacattggctgctggtgatttggaatcacatcccaaatgccaa atcctatgtcaaaggacacctaaggatccaagagatcttttcagctcagcattcaccaaa gagttgttcaagaagttccgtgcaaaaagcttaggaaacggctgtaaacgttatcttatt ttgcagagtcttaaagttcaccagcctattcaagatggtcgaggaacagcctattcaagt tccacagtgaagaaatctgtttcatgcccccacagagagaggacagctttggtcctgatg ggcctccccaaatactgggatgggcctgcttctccatcttgtgtgggattggaagatgtg gattcagttggagatagccgatttctccaagaatcatggaatgtcagagctgcaagacat ttcagagcgaatccagacccatctccttgctactcaaatcaggaaactgagaacttggaa gaccaaatgattctctccagaaaattgaaactggaccgcttccttacatcttatacaaaa attaactcaagatggattaaagactttaatgtaagacctaaaaccataaaaactctagaa gaaaacctaggcaataccattcaggacataggcatggacaaacacttcctgactaaaaca ccaatagcaatggcaagaaaagccaaaactgacaaatgggatgtaattgaactaaagagc ttctgcacagcaaaagaaactatcatcagagtgaacaggcaacctacagaatgggagaaa attttttcaatctatccatctgacaaagggtaa >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_8|348_aa MDKLLDTYTLPRLNQEEVESLNKPITSTEIEAVINSLPTKKISGPDGFTAKFFQRYKEEL VPFLLKLYQTREKEGLLPNSFYEASIILIPKPGTDTTKKENFRPISLMNVNAKILNKILA NRIQQHIKKLIHDDQVSFIPGMQGWFSICKPINIIHHINRTDDKNHMIISRDAEKSFNKI QQPFMLKTLNKLGISGTYLKIVKMHTMSSSHLFYLALCLLTFTSSATAGPETLCGAELVD ALQFVCGDRGFYFMEQCTMAVSIRGRELLGPSEQEMLHKESGKQRQKANTIPVTSKIVHL ALYATLLLFVMEQFLGESHKSREIFSFEQQISELGKESMKFSEEKEKE >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_8|1047_bp atggataaattgctggacacatacaccctgccaagactaaaccaggaagaagtcgaatcc ctgaataaaccaataacaagtactgaaattgaggcagtaattaatagcctaccaactaaa aaaatctcaggaccagatggattcacagccaaattcttccagaggtacaaagaggagctg gtaccattccttctgaaactataccaaacaagagaaaaagagggactcctccctaactca ttttatgaagccagcatcattctgataccaaaacctggcacagacacaacaaaaaaagaa aatttcaggccaatatccctgatgaacgtcaatgcaaaaatcctcaataaaatactagca aaccgaatccagcagcacatcaaaaagcttatccatgatgaccaagtcagcttcatccct gggatgcaaggctggttcagcatatgcaaaccaataaacataatccatcacataaacaga accgatgacaaaaaccacatgattatctcaagagatgcagaaaagtccttcaataaaatt caacagcccttcatgctaaaaactctcaataaactaggtatcagtggaacatatctcaaa atagtgaagatgcacaccatgtcctcctcgcatctcttctacctggcgctgtgcctgctc accttcaccagctctgccacggctggaccggagacgctctgcggggctgagctggtggat gctcttcagttcgtgtgtggagacaggggcttttatttcatggagcagtgtacgatggct gtttccattcgtggcagagagctgctggggccatcggagcaagagatgcttcacaaagaa agtggaaagcagcgccaaaaggcaaatacaattcctgtcacatccaaaatagttcattta gctctttatgctactctcttactctttgtcatggagcaatttctaggagagtcgcataaa agtagggagatttttagctttgagcagcaaataagtgaactgggcaaggagagcatgaaa tttagtgaggaaaaagaaaaagaatga >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_9|286_aa MWDELTKPTRESNESNGSWTSIGSQTMERGPLTLGTVWAEMRNGDEHFARHRTSPTTGIY PAPNVNNAKVENPDLSGHDVRQTIYFTKIMTVRWVRWDRSNVTIWVHTLSPRELFQAHGN IGVKLRTKIRVLVLISVSVLGAMLPAEQAVLLWIYLHTFPSKLFPLPIPLPLPTAANTQM SFTKALWRHQINMPKSLSTLNEKAFPLPLDVFAFQSSGVNLITLAPRREAHQSRMWDCYL DGNPSWERACQWVGSALTPARSKSEVTIFLVKSQGCEAREQLIKGN >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_9|861_bp atgtgggatgaattaactaagccaaccagggaaagcaacgaaagcaacggcagttggact tcaatcggttcccagacaatggagcgtgggcctcttactcttggtactgtgtgggcagag atgaggaatggtgatgaacattttgcaaggcacaggacatcccccacaacaggtatttat ccagccccaaatgtgaataatgccaaggttgaaaatccagatctatcaggacacgatgtt aggcagacaatttattttactaagataatgacagtcagatgggttaggtgggaccgatca aatgtcacaatttgggttcacacactcagtccacgagaacttttccaggcacatggaaat attggcgtcaaattaagaaccaaaatcagggtcctggtccttatctcagtctctgtgctt ggggccatgctccctgctgaacaggcagtccttctctggatctacctccacaccttccca tcaaagctttttccattgccaattccactacctttgcccacagcagccaacacccagatg agcttcactaaagctctttggcgtcatcaaataaacatgcccaaatctctttcaacttta aatgaaaaggctttccctttgcctctggatgtcttcgctttccaaagcagtggagtgaat ttaataactttggcacctcgcagagaagctcaccagtccaggatgtgggattgctacctc gatgggaacccaagctgggaaagagcctgccagtgggttggcagtgctctgacgcctgca agatccaaatctgaagtcaccattttccttgtgaagagtcagggatgtgaagccagagag cagttgattaaagggaactag >gi568815586r:102302510_102580381|GENSCAN_predicted_peptide_10|146_aa MPISCYGGAGKVEAQALENSELQPLNVDSKGCISNLNMDLEQVQTKRMKLLFNKDLSTQR EDFCVKISEIWPDASFQLQRVLNCSAVLELNEKLSANPVPSFSPHRRADTTRFEDSGWLW VRSGFGLVVWHELQKMRNEHTDVNPY >gi568815586r:102302510_102580381|GENSCAN_predicted_CDS_10|441_bp atgcctatttcctgctatggtggtgcagggaaggtagaagcacaggctctagagaattca gaattgcaacctttaaatgtagatagcaaaggctgcatttcgaatctaaatatggacctt gagcaagtacagacaaagagaatgaagcttctttttaataaggatcttagtacccagaga gaagacttttgtgtgaaaatatcagagatttggccagatgcctccttccaactccaaagg gtgctcaactgctcagcagttctagaacttaatgagaaactctctgctaaccctgtccca agtttttctccccaccgcagagcagatacaacccgctttgaagactcgggttggctctgg gttagaagtgggtttggattggttgtgtggcatgagctgcagaagatgaggaatgagcat actgacgtgaacccatattag