GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:04:59 Sequence gi568815575r:21555857_21757788 : 201932 bp : 38.17% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 83 191 109 1 1 88 54 42 0.370 1.23 1.02 Intr + 5615 5704 90 0 0 83 107 36 0.860 4.05 1.03 Term + 9063 9157 95 1 2 108 49 94 0.753 4.51 1.04 PlyA + 10230 10235 6 1.05 2.03 PlyA - 10520 10515 6 1.05 2.02 Term - 18122 17944 179 0 2 -10 41 158 0.231 -1.83 2.01 Init - 18377 18242 136 2 1 53 57 153 0.827 9.05 2.00 Prom - 21132 21093 40 -6.15 3.00 Prom + 22621 22660 40 -6.75 3.01 Init + 23058 23212 155 2 2 57 72 187 0.557 13.50 3.02 Intr + 33813 33856 44 0 2 43 96 9 0.111 -5.83 3.03 Intr + 35166 35338 173 1 2 57 94 107 0.904 6.94 3.04 Intr + 39118 39191 74 0 2 65 92 54 0.956 0.79 3.05 Intr + 39468 39539 72 0 0 125 103 42 0.990 7.00 3.06 Intr + 45426 45493 68 0 2 78 98 70 0.962 4.63 3.07 Intr + 50923 51023 101 2 2 63 123 99 0.995 9.81 3.08 Intr + 53215 53761 547 0 1 47 106 486 0.690 37.83 3.09 Intr + 84205 84290 86 2 2 55 15 91 0.137 -2.88 3.10 Intr + 85677 85803 127 2 1 44 72 123 0.633 5.63 3.11 Intr + 92975 93171 197 0 2 90 116 34 0.621 4.61 3.12 Term + 96450 96665 216 2 0 123 36 122 0.980 6.66 3.13 PlyA + 97711 97716 6 1.05 4.06 PlyA - 98367 98362 6 1.05 4.05 Term - 101954 99998 1957 1 1 66 42 2116 0.381 189.15 4.04 Intr - 102443 102285 159 1 0 58 64 203 0.807 13.08 4.03 Intr - 103322 103216 107 2 2 82 94 26 0.911 0.69 4.02 Intr - 105687 105560 128 1 2 70 78 41 0.626 0.78 4.01 Init - 111704 111644 61 2 1 76 75 95 0.510 8.46 4.00 Prom - 113495 113456 40 -5.45 5.00 Prom + 116292 116331 40 -6.45 5.01 Init + 118522 118802 281 0 2 38 37 129 0.308 -0.67 5.02 Intr + 120832 120978 147 1 0 137 9 90 0.028 4.43 5.03 Intr + 129853 129914 62 1 2 98 115 43 0.339 5.46 5.04 Term + 130956 131299 344 1 2 13 43 194 0.271 1.09 5.05 PlyA + 131374 131379 6 1.05 6.05 PlyA - 131488 131483 6 1.05 6.04 Term - 132491 132333 159 2 0 59 38 143 0.211 3.36 6.03 Intr - 134788 134706 83 2 2 71 110 7 0.272 -0.36 6.02 Intr - 141388 141188 201 2 0 95 66 62 0.784 3.04 6.01 Init - 141726 141642 85 0 1 74 64 72 0.650 4.43 6.00 Prom - 143534 143495 40 -3.95 7.09 PlyA - 145509 145504 6 1.05 7.08 Term - 149393 149283 111 2 0 99 42 78 0.052 1.88 7.07 Intr - 155179 155016 164 2 2 73 65 62 0.019 1.17 7.06 Intr - 156867 156730 138 1 0 113 7 61 0.013 0.01 7.05 Intr - 177901 177800 102 2 0 95 93 77 0.657 8.13 7.04 Intr - 178512 178385 128 2 2 65 46 87 0.721 1.60 7.03 Intr - 181841 181708 134 2 2 88 -1 171 0.469 6.72 7.02 Intr - 187980 187894 87 0 0 127 78 93 0.990 11.45 7.01 Init - 198434 198390 45 2 0 43 111 47 0.395 3.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 103034 102765 270 1 0 44 21 199 0.873 4.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_1|97_aa MGMYVDKKSRDRTSSDLIGKQFCKAQFVRKMKSIGEGKLRPISMPVEYNWVGDYEDPNKM KRDSRRDIQSQPEISTLSRLKGDHFCYYHYSEELYEH >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_1|294_bp atgggaatgtatgtagataagaaatcaagagacagaactagttctgacttgattggcaaa cagttctgtaaagctcagtttgtcagaaaaatgaagagtattggtgagggcaagctacga cctatatctatgccagtggaatataattgggtgggggactatgaagatccaaataagatg aagagagatagtagaagagacattcagagtcagcctgaaatcagtactcttagtcgtttg aaaggggaccacttttgttattaccactattcagaagagctgtatgagcactga >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_2|104_aa MWESLELLRHLLNGFDQNADSDMYNEVQAEVVSDEDEKLVGNWSKEQSVQEEADHKNLEK LQPDNAVEKKNPFSGEKFKPAAEICMSNKEPNVNHQNNGENVSR >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_2|315_bp atgtgggaaagtttggaacttcttagacacttgttgaatggctttgaccaaaatgctgat agtgatatgtacaatgaagtccaggctgaggtggtctccgatgaagatgagaaacttgtg gggaactggagcaaagagcaaagtgttcaagaggaagcagaccataaaaatttggaaaag ttacagcctgacaatgcagtagaaaagaaaaacccattttctggggagaaattcaagcct gctgcagaaatttgcatgagtaacaaggagccgaatgttaatcaccaaaacaatggggaa aatgtctccaggtga >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_3|619_aa MSTNQSISSNFNKKLVPFSMEIHRFMVPYSGAEGGDPIGAEGVVDLAADSHQHIKLFRYL AWSEQTGPIAGKSKRRISCKDLGRGDCEGWLWKKKDAKSYFSQKWKKYWFVLKDASLYWY INEEDEKAEGFISLPEFKIDRASECRKKYAFKACHPKIKSFYFAAEHLDDMNRWLNRINM LTAGYAERERIKQEQDYWSESDKEEADTPSTPKQDSPPPPYDTYPRPPSMSCASPYVEAK HSRLSSTETSQSQSSHEEFRQEVTGSSAVSPIRKTASQRRSWQDLIETPLTSSGLHYLQT LPLEDSVFSDSAAISPEHRRQSTLPTQKCHLQDHYGPYPLAESERMQVLNGNGGKPRSFT LPRDSGFNHCCLNAPVSACDPQDDVQPPEVEEEEEEEEEEGEAAGENIGEKRAKLRRCFQ SLAIEFGNCDSITRALVQFSANTLRELVEPLHAKSDPLLLALNPLDSQIDKLMFREFRSE RVGESREEKLGDSLQDLYRALEQASLSPLGEHRISTKMEYKLSFIKRCNDPVMNEKLHRL RILKSTLKAREGEVAIIDKVLDNPDLTSKEFQQWKQMYLDLFLDICQNTTSNDPLSISSE VDVITSSLAHTHSYIETHV >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_3|1860_bp atgagcacaaaccagagcatttcctccaacttcaacaagaagctggtacccttctccatg gagattcacagattcatggtcccatactcaggtgctgaaggaggagatccgattggagct gaaggagttgtagacctggctgctgactcacaccaacatatcaagctcttcaggtactta gcatggagtgaacagacaggtcctatagcaggcaagagcaaaagacgaatttcttgcaaa gatcttggccgtggtgactgtgagggctggctttggaaaaagaaagatgcgaagagttac ttttcacagaaatggaaaaaatattggtttgtcctaaaggatgcatccctttattggtat attaatgaggaggatgaaaaagcagaaggattcattagcctgcctgaatttaaaattgat agagccagtgaatgccgcaaaaaatatgcattcaaagcctgtcatcctaaaatcaaaagc ttttattttgctgctgaacatcttgatgatatgaacaggtggcttaacagaattaatatg ctgactgcaggatatgcagaaagagagaggattaagcaggaacaagattactggagtgag agtgacaaggaagaagcagatactccatcaacaccaaaacaagatagccctccaccccca tatgatacatacccacgacctccctcgatgagttgcgccagtccttatgtggaagcaaaa catagccgactttcctccacggagacttctcagtctcagtcttctcatgaggagtttcgc caggaagtaactgggagcagtgcagtgtctcccattcgcaagacagccagtcagcgccgc tcctggcaggatttaattgagacgccactgacaagttcaggcttacactatcttcagact ctgcccctggaggattctgtcttctctgactccgcggccatctccccagagcacaggcgg cagtctaccctgccaactcagaaatgccacctgcaggatcactatgggccatacccctta gctgagagtgagaggatgcaagtgctaaatggaaatgggggcaagcctcgaagttttact ctgcctcgagatagcgggttcaaccattgctgtctgaatgctccagttagtgcctgtgac ccacaggatgacgtgcaacccccagaggtggaggaagaggaggaggaggaggaggaggaa ggggaggcagcaggggaaaacataggagaaaaaagagctaaactgaggagatgtttccag tctcttgcaatagaattcggcaattgtgactcgattaccagggctctagttcagtttagt gctaatacactgcgagagttggtagaacctctccatgccaaatcggatccacttctgttg gcactcaacccattggactcacagattgataagctaatgtttagagaatttagatcggag agagtcggtgaaagcagagaagaaaagttaggagactcattgcaagatttatacagggca ctggagcaggccagtctgtcaccactaggagaacatcgtatttcaaccaagatggaatac aagctatcatttataaaaagatgtaatgatcctgtaatgaatgaaaaactacaccggctg agaattctcaaaagcactttaaaggccagagaaggggaagtagccattatcgataaagtc ctagacaatccagacttgacatctaaagaattccaacaatggaagcagatgtacctcgac cttttcttggatatctgtcaaaataccacctcaaatgacccactgagtatttcttctgaa gtagatgtaatcacttcctctctagcacacactcattcatacattgaaacgcatgtctaa >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_4|803_aa MREMEVSAEARIRAGSYREKCWVTKRASREPKSLWSECIRPPEIHMLKLNLQYDSIKKCD FWEEVYNEWQKIQVNHSFLVHAAKGWVGAGSKNRKGKLRTRPPRPPRRGQSPRPALPGRR ICAVGHPAPPESTGNSVLPGDGVQPPPPLACRLLPALGRMSYFLSYCKAHGGALLTGYQA LRAEGFLCDVTLETEGSEFPAHRSLLACSSDYFRALFKSHTQESRARVIHLHVPSAAGLQ RLLDFIYTAWLSLSMDTVEDTLEAASYLQVTEALGLCGRYLERQLAPENCCFAANVAARF GLAHTLDAAERCIVSHLQELLARGAGPAGLLELNPTSLRAVLGAPDVARVPEARLLGLAL AWLRQEPTTERLAHCTELLERVRFGLVPADVLRRVYSGSGLVLPARVKGLIIQALNYHTT PSRQPLMQGEQTSIRSPQTRILLVGGRRAREVVIEEVAAPQRAARGQVAAPEPEEEEEEL EEEEEEEEWELTQNVVAFDVYNHRWRSLTQLPTPLLGHSVCTAGNFLFVLGGESPSGSAS SPLADDSRVVTAQVHRYDPRFHAWTEVPAMREARAHFWCGAVGERLLAVGGLGAGGEVLA SVEMYDLRRDRWTAAGALPRALHGHAGAVGDRGVVYISGGKAGRGEGGASSLRDLYVLGP EEQVWSKKAPMGTARFGHHMAVLRGAVFAFLGRYEPFSEIERYDPGADQWTRLRPLPYDR FCYGLAVVEETALLLGGLKWRDSRQVPTRNVVGYDLDLDRWEDIGCALPWAWSGLRCAVL QLAEGGDDEREGEVGEALDLVLG >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_4|2412_bp atgagggagatggaggtgtcagcagaagccagaattagagcagggagttatcgggaaaaa tgttgggtcacaaagcgtgcttctagggaacccaaatcgctatggtctgaatgtatacgt cctcctgaaattcatatgttgaaacttaatctccaatatgatagtattaagaagtgtgac ttttgggaggaagtgtacaatgagtggcagaaaatccaagtcaaccacagtttccttgtt catgctgcaaagggatgggtcggggcagggtccaagaataggaagggaaaactcagaacc cgtcctcctcgtccaccccggagaggtcagagtcctcgccctgcactcccgggacgccgc atctgcgccgtcggacatccagcacccccggagtccaccggcaactcggttcttccaggt gatggtgtgcagccaccgccgcccctggcgtgcaggctgcttccagccctgggtagaatg agttacttcctgtcttactgcaaagctcatggcggcgcgctgctcaccggctaccaggcc ctgcgcgccgagggcttcctgtgcgacgtgacactggagaccgagggcagcgaattcccg gcgcacaggtcgctcctggcgtgctccagtgactacttcagggccctgttcaagagccac acccaggaatcccgggcgcgcgtgatccacctgcacgtgccatcggcagccggcctgcag cgcctgctggacttcatctacactgcctggctgtcgctttccatggacactgtagaggac actctggaggccgccagctacctgcaggtcactgaggccctggggctctgtgggcgctac ttggagcgccagctggctccagagaactgctgcttcgccgccaacgtggcagcgcgcttt ggcctggctcacacgctggacgcggccgagcgctgcatcgtgagccacttgcaggagctg ctggcgcggggcgcgggccccgcgggactcctggagctcaaccctacatcgctgagggct gtactgggtgcccccgacgtggcgcgggtgcccgaggcccggctactgggcctggcgtta gcttggttgcggcaggagcccacaactgagcgcctggcacactgtacagagttgctggag cgtgtccgctttggcctggttcccgccgacgtactgcggcgcgtgtactcgggctctggc ctcgtgctgcccgcccgggtcaagggcctcatcatccaggccctcaactaccacacgacg ccctcccgccagccgctcatgcagggcgagcagaccagcatccggagcccccagacccgc atcttgttggtgggggggcgcagggcacgggaggtggtgattgaggaggtcgcggccccg cagagggcagctaggggccaggtcgccgccccagagcccgaggaagaagaggaagagttg gaggaagaggaggaggaggaggagtgggagctcacccagaacgtggtggccttcgatgtg tacaatcaccgctggcgcagccttacgcagctacccacaccactgctggggcacagcgtg tgcaccgcgggcaacttcctgtttgtcctgggtggggagagcccttccggcagtgcatcc tctcccctggccgacgactcgcgggtggtcacggcccaagtgcaccgttacgacccgcgc ttccacgcttggacggaagtgcccgccatgcgggaagcgcgggcccacttctggtgcggc gcggtgggcgagaggctcctggccgtcgggggcctgggtgcgggcggtgaggtgctggcc tcggtggagatgtacgacctgcgtcgggaccgctggacggcggctggggcactaccgcgg gctctgcacggtcacgcgggggccgtcggggaccgcggtgttgtgtacatctcggggggc aaggcagggagaggcgagggcggagcgagcagcctccgggacttatacgtcctgggccct gaggagcaggtttggagcaagaaggcacccatgggcaccgcacgtttcgggcaccacatg gcagtgctgcgcggcgctgtgtttgcttttctggggcgatacgagcccttctctgagatc gagcgctacgaccccggcgccgaccagtggactcggttgcggccgctgccctacgaccgc ttctgctatgggctggccgtggtcgaggagacagcgttgctgctgggcggcctcaagtgg cgggactcgcgccaggtgcctacccgcaacgtagtgggctacgacctcgacctggaccgt tgggaggacatcggctgcgcgttgccctgggcctggagcggcctgcggtgcgcagtgctg cagctggccgagggtggggacgacgagagggagggagaggttggagaggcgctagatttg gtgctgggctga >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_5|277_aa MCWKKKKEKEEERKEERRKKERRKERKEGRKEGRKEGRKEGRKEGRKKEKERKKKKSKQA GRQEGKKARGQESKLASGGVIINQLYLYHVSDETKISLISIVHAYVRNSNSLEKYKFKKA DNHPQNTIQKRLSLTLREIKTIAEARDKLTSPMTKLKSNQKMKDRKKESPFPWRSQGTLC STSFLWKARYYPGLPAVIRRPKPLSVVPCFNGHAPCPLSCSSRTPGSSLKFLEDNGRRNS ESLKVFDLSDKYIEENADVCCLPSLLRLPEREGPPVP >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_5|834_bp atgtgctggaaaaaaaagaaagaaaaagaagaagaaagaaaggaagaaagaagaaagaaa gaaagaaggaaagaaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaa ggaaggaaggaaggaaggaagaaagaaaaagaaagaaagaaaaagaaaagcaagcaagca ggaaggcaggaaggcaagaaggcaagaggtcaagaaagcaagctagcttcaggtggggta atcattaaccagctttatttgtatcacgtctcagatgaaacaaaaattagtcttatcagc atagtgcatgcttatgtgaggaattcaaatagtttagaaaagtacaaatttaagaaagct gacaatcacccacaaaataccatccagaagcgactatcactgacattaagagagatcaag acaattgcggaagccagggacaagttgacaagtccaatgactaagctaaaatccaaccag aagatgaaagaccgcaaaaaggagtctccctttccttggaggagtcagggaacactctgc tccaccagcttcttgtggaaggctcgatattatccaggcctgcccgcagtcatccggagg cctaaacccctctctgtggtgccgtgcttcaatggtcacgctccttgtccactttcatgt tcctcccgtactcctggttcctctttgaagttcttagaagataacggtagaagaaatagt gaaagtcttaaagtctttgatctttctgataagtacatagaagaaaacgctgacgtatgc tgccttccctctctgcttcggctacctgaaagggaagggccccctgtcccatga >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_6|175_aa MIMVPRLKIRRVEATGICGSNGLIMGSPALISETPLRDSFYLSKCALGKGKMYIFGGLLD TDSEITLIPEDQESYCDPLIRWTEDKWSLDRSPPNDLIFEIKICYCKSASECQKICLSCK RTQVTINHNQSRTGLTENSVSSGTPLGNLPRADTINPTSLHSSLAISQNSHETVH >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_6|528_bp atgatcatggttcctagactgaaaattagaagagtcgaagccacagggatctgtggcagt aatggactgatcatgggatccccagccctcatcagtgagactccactgcgagactcattt taccttagtaaatgtgcactgggaaagggaaaaatgtacatctttgggggattattggac actgactctgagatcacactaattcctgaggaccaagaaagttactgtgacccattaatc agatggacagaagataaatggagtttggacaggagtccacctaacgatttaatatttgaa ataaaaatctgctattgcaaatcagcaagtgaatgccagaaaatctgtttaagttgcaag cgaacccaggtcaccatcaaccacaaccagtcaaggacaggtctaacagaaaactctgtt tctagtggaacaccacttggcaaccttccgagagcagacaccattaatccaacaagtcta cactcaagccttgccatttctcagaacagtcatgaaacagtccactga >gi568815575r:21555857_21757788|GENSCAN_predicted_peptide_7|302_aa MNMSKQPVSNVRAIQANINIPMGAFRPGAGQPPRRKECTPEVEEGVPPTSDEEKKPIPGA KKLPGPAVNLSEIQNIKSELKYVPKAEQYCSRYHLRLHGQRSGEPAEPLWAGVQTDFVSI SVCSRPLDQSIVVKELSDIKELKRNKISIKVVMTHQLQEKEMISKDVIELCLGREILGQY FSNEVGGPFRDREDWVLMDEEKFPTKSGAACGMLGLLEFYRDPGVGEAEGQGDLQEDTCN GSCFICHLMPGSFCKEESGLFCRLNKRSASKSRAIIAYTCDWPQSQQCLMSLSFPEKVPV WS >gi568815575r:21555857_21757788|GENSCAN_predicted_CDS_7|909_bp atgaatatgtcgaaacagccagtttccaatgttagagccatccaggcaaatatcaatatt ccaatgggagcctttcggccaggagcaggtcaaccccccagaagaaaagaatgtactcct gaagtggaggagggtgttcctcccacctcggatgaggagaagaagccaattccaggagcg aagaaacttccaggacctgcagtcaatctatcggaaatccagaatattaaaagtgaacta aaatatgtccccaaagctgaacagtattgtagcagataccacctcaggcttcatggacaa aggtctggggaacctgcagaaccgctctgggctggtgttcaaacagactttgtgtccatt tctgtttgttcccgacccttagatcagagcatagtggtaaaagaactcagtgatatcaaa gaattgaagaggaacaaaataagtatcaaagtggtaatgacgcatcaattacaagaaaaa gaaatgattagcaaagatgttattgaactctgccttggtagggaaatcttgggtcaatat ttctccaatgaagtcggtggcccctttagagacagggaggactgggttctgatggatgaa gagaaatttcccacaaaatctggagctgcctgtggcatgttgggccttcttgaattctac cgggacccgggagtgggggaagctgagggacagggtgatctgcaggaggacacctgcaat ggctcctgcttcatctgccacctcatgcctggctccttctgcaaagaagaatcaggttta ttttgtaggctgaacaagagatcagcttccaaaagcagggctatcattgcctacacctgt gactggccacagagtcagcagtgcctgatgagtttaagttttcccgaaaaggtgcctgtc tggtcatag