GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:56:09 Sequence gi568815595r:8652980_8868187 : 215208 bp : 44.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5145 5254 110 2 2 63 89 77 0.230 4.38 1.02 Intr + 6001 6062 62 1 2 78 88 20 0.221 -0.62 1.03 Intr + 8591 8721 131 0 2 30 90 102 0.831 5.01 1.04 Intr + 22079 22301 223 1 1 36 38 165 0.385 4.00 1.05 Intr + 22470 22564 95 1 2 32 82 113 0.437 4.78 1.06 Term + 31863 31946 84 2 0 101 49 74 0.358 2.45 1.07 PlyA + 34655 34660 6 1.05 2.00 Prom + 34937 34976 40 -6.86 2.01 Init + 35255 35690 436 1 1 45 -15 326 0.239 13.33 2.02 Intr + 35850 36176 327 1 0 96 72 77 0.110 2.57 2.03 Intr + 39266 39350 85 0 1 57 13 98 0.037 -1.92 2.04 Intr + 44182 44396 215 1 2 4 77 155 0.033 4.36 2.05 Intr + 55928 56116 189 0 0 106 93 -13 0.051 0.56 2.06 Intr + 56987 57099 113 0 2 76 81 -3 0.039 -2.10 2.07 Intr + 61243 61353 111 0 0 97 72 39 0.270 3.78 2.08 Intr + 62275 62379 105 0 0 83 61 64 0.349 3.61 2.09 Intr + 64166 64712 547 1 1 -98 58 304 0.001 1.26 2.10 Intr + 66291 66948 658 1 1 44 5 311 0.021 9.30 2.11 Intr + 67307 68458 1152 2 0 17 53 309 0.103 8.44 2.12 Intr + 80192 80354 163 2 1 99 49 123 0.467 9.48 2.13 Intr + 80835 81011 177 2 0 50 102 234 0.808 21.12 2.14 Intr + 85310 85465 156 1 0 87 42 74 0.625 2.91 2.15 Term + 92547 92888 342 2 0 100 42 618 0.737 52.91 2.16 PlyA + 93760 93765 6 1.05 3.03 PlyA - 97438 97433 6 1.05 3.02 Term - 100245 99998 248 1 2 94 46 355 0.999 27.85 3.01 Init - 100573 100528 46 1 1 50 80 59 0.871 2.15 3.00 Prom - 107389 107350 40 -5.86 4.02 PlyA - 111539 111534 6 1.05 4.01 Sngl - 115208 114171 1038 2 0 83 44 1991 0.982 191.33 4.00 Prom - 118803 118764 40 -4.66 5.00 Prom + 120965 121004 40 -4.36 5.01 Init + 124508 124557 50 1 2 65 99 40 0.760 3.22 5.02 Intr + 125727 125863 137 0 2 83 80 39 0.512 2.81 5.03 Intr + 151191 151297 107 1 2 39 41 130 0.095 3.33 5.04 Intr + 158091 158246 156 2 0 47 92 75 0.104 4.01 5.05 Term + 162927 162968 42 2 0 91 54 16 0.025 -4.34 5.06 PlyA + 164439 164444 6 1.05 6.04 PlyA - 165950 165945 6 1.05 6.03 Term - 169562 169456 107 0 2 107 42 63 0.138 2.17 6.02 Intr - 187318 187217 102 2 0 95 75 19 0.431 1.45 6.01 Init - 202062 201921 142 0 1 86 82 94 0.902 8.90 6.00 Prom - 214764 214725 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 64274 65017 744 1 0 49 43 302 0.836 18.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_1|234_aa LLHQKTGDDPHGDIYYRLRDDDMCKQLAECLWLAESHSPKRYYYKAHFLEKETMVQRGGV RDSRIRDKGLYYHSNSNSQTIKVFAPVPSASVSTGPHEEGQPRTKDYRPVQDLCLLNQAT LTLHPTVPNPSTLLGLLPAEDSWFTCLDLKDAFFPIRLSPERQKLFAFQWEDPESVGCPK GTDALHQHLEDCGYKASKKKAQICRQQGIQNNRTGGVYTLCDIESHIILFRSGY >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_1|705_bp cttctacatcagaaaactggggatgaccctcatggagatatttattacaggctaagagat gacgacatgtgtaagcagttagcggagtgcctgtggctagcagaatctcacagccctaag aggtactattacaaggcccattttctagaaaaggaaactatggttcagagaggaggcgtg agagactcccgaatccgagacaaaggactttattatcacagcaacagcaatagccagaca atcaaggtttttgcaccagtgccctcagcctcagtttccacaggaccacatgaagagggc cagccacggaccaaggactaccggccagtacaggatttgtgcttgcttaatcaagctaca ctgactttacatccaacagtacctaacccgtccacattgttgggtttgctgccagctgag gacagctggttcacctgcttggacctgaaagacgctttctttcctatcagattatcccct gagaggcagaagctgtttgcctttcagtgggaagatccggagtcagtcgggtgtcccaag ggaacagatgccctacaccagcacctggaggactgtgggtataaggcgtccaagaagaaa gctcagatctgccgacagcagggtattcagaacaataggacaggaggggtgtacaccctc tgcgatattgagagtcatatcatcctctttcgctctggatattag >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_2|1591_aa MGLSEDPELQPVLAGLSLSMCLVTVLRNLLSILAVSSDSHLHTPMYFFLSNLCWADIGFT SATVPKIIVDMQSHSRVISYVGCLTRMSFLVLFACIEDMLLTVMAYDCFVAICRPLHYPV IVNPHLRVFLVLVSFFLSLLDSQLHRILLSYCKIVPSILRISTSDGKYKAFSTCGSHLAL VCLFYGAGIGVYLTSAVSPPPRNGVVVSVMYTVVTPMLNPFIYSLRNRDIQSTLRRLLSR TVESHDLFHPFSCVGKRKSNQECDTTHLDGCDKQTSIGETRGKEYTLCDSRSNLTQGYKE QIQRMHTCCDISSNIHLGYYEYYHNVNTPCDIRSNIPLQYWEPYHTVYTPCDIREDPGFQ HPEIVQRKRERETPLVPPPWDVPKDHAVAPTPTPRFLVYKGPPCLQSPLRLTRVLRKGRP MNRGSRWPSVAAVTAQHKAKHFYLLYHSKAASACKGSNSPKQAPGPKESPRQGNIECQAQ GIRPSGHLDTVRLQELPPLHQEELTYQGDSLETACPIKAMIPNLTLQLREDIQTKGKEVE NFEKNLEECITRITNTEKCLKELMELKTKARELRKECSSLRSRCHQLEERVSAMEDEMNE MKWEGKFREKRIKRNKQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIVQNFPN LARQTNVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKEIQTTIREYYK HLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGF TAEFYQRYKEELVPFLLQLFQSIEKEGILPNSFYEASIILIPKAGRDTTKKENFRPISLM NVDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMI ISIDAEKAFDKIQQPFMLKTLNKLGIDGTNRQTESQIMSELPFTIASKRIKYLAIQLTRD VKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRVNAIPIKLPMTF FTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWDQNRDI DQWNRTEPSEIMPHIYNCLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPY TKINSRWIKDLNVRPKTIKTLEENLGNTIQDTGMGKDFMSKTPKAMATKDKIDKWDLIKL KSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELQQIYKKKTNNPIKKWAKDM DRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVSMAIIKKSGNNSLAPHHSP STPAPGGNLRNPQSSDLLQVTKQQGQALAIQREAPLHRIPAPEAIPWYFQPQPATQLGSP PVDPPSSAMMAEEHTDLEAQIVKDIHCKEIDLVNRDPKNINEDIVKNACLLQKNKHPFAP RLRLHNTQRNQMPTSPTNTCSRFVLIRDTLLHKEETDLVDFEDVIAEPVGTYSFDGVWKV SYTTFTVSKYWCYRLLSTLLGVPLALLWGFLFACISFCHIWAVVPCIKSYLIEIQCISHI YSLCIRTFCNPLFAALGQVCSSIKVVLRKEV >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_2|4776_bp atgggactctcagaggatccagaactgcagcccgtcctcgctgggctgtccctgtccatg tgtctggtcacggtgctgaggaacctgctcagcatcctggctgtcagctctgactcccac ctccacacccccatgtacttcttcctctccaacctgtgctgggctgacatcggtttcacc tcggccacggttcccaagataattgtggacatgcagtcgcatagcagagtcatctcttat gtgggctgcctgacacggatgtcttttttggtcctttttgcatgtatagaagacatgctt ctgactgtgatggcctatgactgctttgtagccatctgtcgccctctacactacccagtc atcgtgaatcctcacctccgtgtcttcttagttttggtgtcctttttccttagcctgttg gattcccagctgcacaggatccttttgtcttactgtaaaattgttccctccattctaagg atttcaacatcagatgggaaatataaagccttctccacctgtggctctcacctggcactt gtttgcttattttatggagcaggcattggcgtgtacctgacttcagctgtgtcaccaccc cccaggaatggtgtggtggtgtcagtgatgtacactgtggtcacccccatgctgaaccct ttcatctacagcctgagaaacagggacattcaaagcaccctgaggaggctgctcagcaga acagtcgaatctcatgatctgttccatcctttttcttgtgtggggaagcgcaaatccaac caagaatgtgataccacacatttggatggatgtgataaacaaacgagcattggtgaaact agagggaaggagtacaccctgtgtgacagtaggagtaacctcacccaaggatataaggaa caaatacagaggatgcacacgtgttgtgacattagcagtaacatccatttaggatattac gaatattaccacaatgtgaataccccctgtgatattaggagtaacatccccctacaatat tgggaaccatatcacacggtgtacaccccctgtgacattagagaagacccaggctttcag cacccagagattgtgcagaggaagagagagagagagacaccactggtgcccccaccttgg gacgtgcctaaagatcatgcggtagcccctacccccacccccaggtttctggtttataaa ggacctccatgtcttcaatctcccctgagactcacaagagtcctgagaaaagggaggccc atgaaccgaggaagcaggtggccatctgtggcagctgtcacagcccagcataaagctaag cacttctacttattatatcattcaaaagctgcctcagcctgcaagggcagtaattcccca aagcaggctccagggcccaaggaaagccctcggcagggaaacattgagtgtcaggcacag ggaatcaggccatcgggtcacctggacaccgtaaggctccaagagctgccccctctccac caggaggaactgacttaccaaggtgacagtctagagactgcttgcccaattaaagccatg atccccaatctcacactccaactacgggaggacattcaaaccaaaggcaaagaagttgaa aactttgaaaaaaatttagaagaatgtataactagaataaccaatacggagaagtgctta aaggagctgatggagctgaaaaccaaggctcgagaactacgtaaagaatgcagtagcctc aggagccgatgccatcaactggaagaaagggtatcagcaatggaagatgaaatgaatgaa atgaagtgggaagggaagtttagagaaaaaagaataaaaagaaataagcaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctacgtctcattggtgtacctgaaagtgac ggggagaatggaactaagttggaaaacactctgcaggatatcgtccagaacttccccaat ctagcaaggcagaccaatgttcagattcaggaaatacagagaacgccacaaagatactcc tcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaa aaaatgttaagggcagccagagagaaagaaatacaaactaccatcagagaatactacaaa cacctctacgcaaataaactagaaaatctagaggaaatggataaattccttgacacatac actctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctct gaaattgtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattc acagccgaattctaccagaggtacaaggaggaactggtaccattccttctacaactattc caatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctg ataccaaaggctggcagagacacaaccaaaaaagagaattttagaccaatatccttgatg aacgttgatgcaaaaatcctcaataaaatattggcaaaacgaatccagcagcacatcaaa aagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatata cgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgatt atctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaact ctcaataaattaggtattgatgggaccaacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacctagcaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttacagagtcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatc gccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactgggaccaaaacagagatata gatcaatggaacagaacagagccctcagaaataatgccacatatctacaactgtctgatc tttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacacaggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaacta aagagcttctgtacagcaaaagaaactaccatcagagtgaacaggcaacccacaaaatgg gagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatg gacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctca ccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagcatggcaatcattaaaaagtcaggaaacaacagcctggccccacaccacagtcca tcaactcctgctccaggtggaaacctcagaaacccacaaagctcggacctgctccaggtc accaagcagcagggccaggcactggcaatccaacgagaagccccactacatcgcattcca gctcctgaagccattccctggtattttcagccccagccggccacacagctcggatctcct cctgtggatccccccagctctgcgatgatggcagaagagcacacagatctcgaggcccag atcgtcaaggatatccactgcaaggagattgacctggtgaaccgagaccccaagaacatt aacgaggacatagtcaagaacgcctgccttcttcagaaaaacaaacacccatttgccccc cggctcaggctacacaacacccagaggaaccagatgcccaccagccctacaaacacttgt tccaggtttgtgctcatcagggacaccctgttgcacaaagaagaaactgacctggtggat tttgaagacgtgatcgcagagcctgtgggcacctacagctttgacggcgtgtggaaggtg agctacaccaccttcactgtctccaagtactggtgctaccgtctgttgtccacgctgctg ggcgtcccactggccctgctctggggcttcctgttcgcctgcatctccttctgccacatc tgggcggtggtgccatgcattaagagctacctgatcgagatccagtgcatcagccacatc tactcactctgcatccgcaccttctgcaacccactcttcgcggccctgggccaggtctgc agcagcatcaaggtggtgctgcggaaggaggtctaa >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_3|97_aa MHEKVAVMDSASAEGASAFIIVMLLASLNSCCNPWIYMLFTGHLFHELVQRFLCCSASYL KGRRLGETSASKKSNSSSFVLSHRSSSQRSCSQPSTA >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_3|294_bp atgcatgaaaaggtggctgtgatggattcagcttctgctgagggagcctcggccttcatc atcgtcatgctcctggccagcctcaacagctgctgcaacccctggatctacatgctgttc acgggccacctcttccacgaactcgtgcagcgcttcctgtgctgctccgccagctacctg aagggcagacgcctgggagagacgagtgccagcaaaaagagcaactcgtcctcctttgtc ctgagccatcgcagctccagccagaggagctgctcccagccatccacggcgtga >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_4|345_aa MEGALAANWSAEAANASAAPPGAEGNRTAGPPRRNEALARVEVAVLCLILLLALSGNACV LLALRTTRQKHSRLFFFMKHLSIADLVVAVFQVLPQLLWDITFRFYGPDLLCRLVKYLQV VGMFASTYLLLLMSLDRCLAICQPLRSLRRRTDRLAVLATWLGCLVASAPQVHIFSLREV ADGVFDCWAVFIQPWGPKAYITWITLAVYIVPVIVLAACYGLISFKIWQNLRLKTAAAAA AEAPEGAAAGDGGRVALARVSSVKLISKAKIRTVKMTFIIVLAFIVCWTPFFFVQMWSVW DANAPKEGSQGWETQEEGAWWLGEALILLPQNVQGSVDFLGDKRV >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_4|1038_bp atggagggcgcgctcgcagccaactggagcgccgaggcagccaacgccagcgccgcgccg ccgggggccgagggcaaccgcaccgccggacccccgcggcgcaacgaggccctggcgcgc gtggaggtggcggtgctgtgtctcatcctgctcctggcgctgagcgggaacgcgtgtgtg ctgctggcgctgcgcaccacacgccagaagcactcgcgcctcttcttcttcatgaagcac ctaagcatcgccgacctggtggtggcagtgtttcaggtgctgccgcagttgctgtgggac atcaccttccgcttctacgggcccgacctgctgtgccgcctggtcaagtacttgcaggtg gtgggcatgttcgcctccacctacctgctgctgctcatgtccctggaccgctgcctggcc atctgccagccgctgcgctcgctgcgccgccgcaccgaccgcctggcagtgctcgccacg tggctcggctgcctggtggccagcgcgccgcaggtgcacatcttctctctgcgcgaggtg gctgacggcgtcttcgactgctgggccgtcttcatccagccctggggacccaaggcctac atcacatggatcacgctagctgtctacatcgtgccggtcatcgtgctcgctgcctgctac ggccttatcagcttcaagatctggcagaacttgcggctcaagaccgctgcagcggcggcg gccgaggcgccagagggcgcggcggctggcgatggggggcgcgtggccctggcgcgtgtc agcagcgtcaagctcatctccaaggccaagatccgcacggtcaagatgactttcatcatc gtgctggccttcatcgtgtgctggacgcctttcttcttcgtgcagatgtggagcgtctgg gatgccaacgcgcccaaggaaggtagccagggctgggagacccaggaggagggagcctgg tggctgggggaggcccttatcttgctgcctcagaatgtccaggggtctgtggacttcctg ggggataagcgggtttga >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_5|163_aa MEKENTPEESEALKHFCTNPYSQSLADSSQFHCTAASVALAHPICLLSTHADLPVPGLAP QAEWPKSKTLKTPNAGEDVEQKKVSSIAGGKAKWYNYFAFVGFSCPPQGSQHHAGKSAGP CSGDGAPETLPYLDKSLKPLSLHSLICEKQSCDSNSEELRPKL >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_5|492_bp atggaaaaagagaataccccggaggaatctgaagctctcaaacacttctgcacaaacccg tactcccagtccttagcagactcttctcaattccactgcacagctgcttctgtggccctg gcccaccctatctgcctcctttccacccacgctgatcttcctgtaccaggcttggctccc caagcagaatggccaaaatccaaaacactgaaaacaccaaatgctggtgaggatgtggag caaaagaaagtctcatccattgctggtgggaaagcaaaatggtacaactactttgccttc gtgggctttagctgccctccccagggcagtcagcaccatgctggcaagagtgcaggtccg tgctcaggagatggggcaccagaaaccctcccctatctggacaagtcactgaagcccctg agcctccattccctcatctgtgaaaagcagtcatgtgattctaattcagaggaactaaga cctaagctttga >gi568815595r:8652980_8868187|GENSCAN_predicted_peptide_6|116_aa MGGCKEAACDRVKSGQFWDQDPKGHNVFVNPPQKAFHRNFVLQNSSPEDFTFESLLNSKT EQLPEGFCCCDQTLSAGETLSGHNPGTDSEPCSPIDDSLLSLVNSIIKTIQPVFQE >gi568815595r:8652980_8868187|GENSCAN_predicted_CDS_6|351_bp atggggggctgtaaggaggcagcatgtgacagggtgaagagtgggcagttttgggatcaa gatcccaaggggcacaacgtctttgtaaatcctccacaaaaggccttccacagaaacttt gtacttcaaaattccagccctgaggactttacatttgaaagccttttaaactccaagact gaacaactgccagagggattttgctgctgtgaccagacactttcagctggagagacactt tcaggacacaatccagggactgattcagaaccttgttcaccaatagatgattccttgctt tccttggtaaactccatcatcaaaaccatccaaccagtcttccaggagtag