GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:53:53 Sequence gi568815595f:8633877_8845864 : 211988 bp : 44.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 793 866 74 0 2 69 94 45 0.483 3.75 1.02 Intr + 2882 2949 68 2 2 126 46 38 0.201 1.95 1.03 Intr + 6558 6684 127 1 1 68 76 50 0.152 1.54 1.04 Intr + 15867 15953 87 0 0 54 95 39 0.124 0.29 1.05 Intr + 18160 18293 134 1 2 57 12 129 0.382 2.49 1.06 Intr + 19050 19256 207 1 0 118 42 90 0.233 6.45 1.07 Intr + 23430 23557 128 1 2 42 75 6 0.170 -4.90 1.08 Intr + 24248 24357 110 1 2 63 89 77 0.257 4.38 1.09 Intr + 24587 24635 49 2 1 87 96 -15 0.022 -2.12 1.10 Intr + 26896 26964 69 0 0 67 117 26 0.013 2.78 1.11 Intr + 41182 41404 223 0 1 36 38 165 0.385 4.00 1.12 Intr + 41573 41667 95 0 2 32 82 113 0.437 4.78 1.13 Term + 50966 51049 84 1 0 101 49 74 0.358 2.45 1.14 PlyA + 53758 53763 6 1.05 2.00 Prom + 54040 54079 40 -6.86 2.01 Init + 54358 54793 436 0 1 45 -15 326 0.239 13.33 2.02 Intr + 54953 55279 327 0 0 96 72 77 0.110 2.57 2.03 Intr + 58369 58453 85 2 1 57 13 98 0.037 -1.92 2.04 Intr + 63285 63499 215 0 2 4 77 155 0.033 4.36 2.05 Intr + 75031 75219 189 2 0 106 93 -13 0.051 0.56 2.06 Intr + 76090 76202 113 2 2 76 81 -3 0.039 -2.10 2.07 Intr + 80346 80456 111 2 0 97 72 39 0.270 3.78 2.08 Intr + 81378 81482 105 2 0 83 61 64 0.349 3.61 2.09 Intr + 83269 83815 547 0 1 -98 58 304 0.001 1.26 2.10 Intr + 85394 86051 658 0 1 44 5 311 0.021 9.30 2.11 Intr + 86410 87561 1152 1 0 17 53 309 0.103 8.44 2.12 Intr + 99295 99457 163 1 1 99 49 123 0.467 9.48 2.13 Intr + 99938 100114 177 1 0 50 102 234 0.808 21.12 2.14 Intr + 104413 104568 156 0 0 87 42 74 0.625 2.91 2.15 Term + 111650 111991 342 1 0 100 42 618 0.737 52.91 2.16 PlyA + 112863 112868 6 1.05 3.03 PlyA - 116541 116536 6 1.05 3.02 Term - 119348 119101 248 0 2 94 46 355 0.999 27.85 3.01 Init - 119676 119631 46 0 1 50 80 59 0.871 2.15 3.00 Prom - 126492 126453 40 -5.86 4.02 PlyA - 130642 130637 6 1.05 4.01 Sngl - 134311 133274 1038 1 0 83 44 1991 0.982 191.33 4.00 Prom - 137906 137867 40 -4.66 5.00 Prom + 140068 140107 40 -4.36 5.01 Init + 143611 143660 50 0 2 65 99 40 0.760 3.22 5.02 Intr + 144830 144966 137 2 2 83 80 39 0.512 2.81 5.03 Intr + 170294 170400 107 0 2 39 41 130 0.095 3.33 5.04 Intr + 177194 177349 156 1 0 47 92 75 0.103 4.01 5.05 Term + 209623 209751 129 0 0 131 48 37 0.025 2.18 5.06 PlyA + 210493 210498 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 27694 27824 131 2 2 30 90 102 0.837 5.01 S.002 Sngl + 83377 84120 744 0 0 49 43 302 0.836 18.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:8633877_8845864|GENSCAN_predicted_peptide_1|484_aa MEEKGVASGSSRARSPREKPLMLPRYLQETPCTSNAILVSASWVTQTAKSKLLNLAGKVP TLDALTHKRGRKDSHLPTCQGKASYMCRPGHFLMGLLSVALALTKPMLNSELRLIPLHLA SQCELFYSVSRMHSDDQETVEVNHPSPEGKSNNHMIRMTHSVQSHLPGSGEETGLLFVLG IGYLNALKLCHCTWLPEPAPHPLGERESTTKNRNDSHKNGHLFGSKLRITWRCEWMNNQS KGWYKSWKVRRQRSNTKWHEEAQDSAPTLKQLSTQLLHQKTGDDPHGDIYYRLRDDDMCK QLAECLWLAESQPASIGDWLGPPGWVGQSTYKNCAQSLFNAQLFGYESTEPPRTKDYRPV QDLCLLNQATLTLHPTVPNPSTLLGLLPAEDSWFTCLDLKDAFFPIRLSPERQKLFAFQW EDPESVGCPKGTDALHQHLEDCGYKASKKKAQICRQQGIQNNRTGGVYTLCDIESHIILF RSGY >gi568815595f:8633877_8845864|GENSCAN_predicted_CDS_1|1455_bp atggaggagaagggggttgcttctggcagcagcagggccagaagcccccgggagaagcct ctcatgctgcccaggtatcttcaagaaaccccttgcacatctaatgccatcttggtgtct gcttcatgggtgacccaaactgctaaatccaagctcctgaatttggcaggtaaggtccct acattagatgcccttacccacaagagaggaagaaaggattctcatctgcccacatgccaa ggaaaagcctcctacatgtgcagaccaggtcacttcctcatgggcctcctatctgttgcc ctcgccctgacaaagcccatgctcaattcagaactcagactgatccctctacaccttgcc agccagtgtgagctgttctacagcgtgtcccgcatgcacagtgatgaccaggaaacggtg gaggtcaaccatccatccccagaaggaaaatcaaacaatcacatgatcaggatgacacac agtgtccaaagccatctccctgggagtggggaggagactgggctcctctttgtcctgggg atcggctacctaaatgccctgaagctctgccattgtacctggcttccagagccagctcca catccacttggagaaagagaatccacaaccaaaaaccgaaatgatagccacaagaatgga cacctttttggatctaaattgagaatcacatggagatgtgagtggatgaataatcaatcc aaagggtggtacaagtcatggaaggttaggaggcagaggagtaatacaaagtggcatgaa gaagcccaagactcagcccctaccctcaagcagctttcaacccagcttctacatcagaaa actggggatgaccctcatggagatatttattacaggctaagagatgacgacatgtgtaag cagttagcggagtgcctgtggctagcagaatctcaaccagcctccattggcgactggctt gggccgccaggctgggtggggcagtcaacttataaaaactgtgctcagtctttgttcaat gctcagctttttggatatgaatccactgagccgccacggaccaaggactaccggccagta caggatttgtgcttgcttaatcaagctacactgactttacatccaacagtacctaacccg tccacattgttgggtttgctgccagctgaggacagctggttcacctgcttggacctgaaa gacgctttctttcctatcagattatcccctgagaggcagaagctgtttgcctttcagtgg gaagatccggagtcagtcgggtgtcccaagggaacagatgccctacaccagcacctggag gactgtgggtataaggcgtccaagaagaaagctcagatctgccgacagcagggtattcag aacaataggacaggaggggtgtacaccctctgcgatattgagagtcatatcatcctcttt cgctctggatattag >gi568815595f:8633877_8845864|GENSCAN_predicted_peptide_2|1591_aa MGLSEDPELQPVLAGLSLSMCLVTVLRNLLSILAVSSDSHLHTPMYFFLSNLCWADIGFT SATVPKIIVDMQSHSRVISYVGCLTRMSFLVLFACIEDMLLTVMAYDCFVAICRPLHYPV IVNPHLRVFLVLVSFFLSLLDSQLHRILLSYCKIVPSILRISTSDGKYKAFSTCGSHLAL VCLFYGAGIGVYLTSAVSPPPRNGVVVSVMYTVVTPMLNPFIYSLRNRDIQSTLRRLLSR TVESHDLFHPFSCVGKRKSNQECDTTHLDGCDKQTSIGETRGKEYTLCDSRSNLTQGYKE QIQRMHTCCDISSNIHLGYYEYYHNVNTPCDIRSNIPLQYWEPYHTVYTPCDIREDPGFQ HPEIVQRKRERETPLVPPPWDVPKDHAVAPTPTPRFLVYKGPPCLQSPLRLTRVLRKGRP MNRGSRWPSVAAVTAQHKAKHFYLLYHSKAASACKGSNSPKQAPGPKESPRQGNIECQAQ GIRPSGHLDTVRLQELPPLHQEELTYQGDSLETACPIKAMIPNLTLQLREDIQTKGKEVE NFEKNLEECITRITNTEKCLKELMELKTKARELRKECSSLRSRCHQLEERVSAMEDEMNE MKWEGKFREKRIKRNKQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIVQNFPN LARQTNVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKEIQTTIREYYK HLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGF TAEFYQRYKEELVPFLLQLFQSIEKEGILPNSFYEASIILIPKAGRDTTKKENFRPISLM NVDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMI ISIDAEKAFDKIQQPFMLKTLNKLGIDGTNRQTESQIMSELPFTIASKRIKYLAIQLTRD VKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRVNAIPIKLPMTF FTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWDQNRDI DQWNRTEPSEIMPHIYNCLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPY TKINSRWIKDLNVRPKTIKTLEENLGNTIQDTGMGKDFMSKTPKAMATKDKIDKWDLIKL KSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELQQIYKKKTNNPIKKWAKDM DRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVSMAIIKKSGNNSLAPHHSP STPAPGGNLRNPQSSDLLQVTKQQGQALAIQREAPLHRIPAPEAIPWYFQPQPATQLGSP PVDPPSSAMMAEEHTDLEAQIVKDIHCKEIDLVNRDPKNINEDIVKNACLLQKNKHPFAP RLRLHNTQRNQMPTSPTNTCSRFVLIRDTLLHKEETDLVDFEDVIAEPVGTYSFDGVWKV SYTTFTVSKYWCYRLLSTLLGVPLALLWGFLFACISFCHIWAVVPCIKSYLIEIQCISHI YSLCIRTFCNPLFAALGQVCSSIKVVLRKEV >gi568815595f:8633877_8845864|GENSCAN_predicted_CDS_2|4776_bp atgggactctcagaggatccagaactgcagcccgtcctcgctgggctgtccctgtccatg tgtctggtcacggtgctgaggaacctgctcagcatcctggctgtcagctctgactcccac ctccacacccccatgtacttcttcctctccaacctgtgctgggctgacatcggtttcacc tcggccacggttcccaagataattgtggacatgcagtcgcatagcagagtcatctcttat gtgggctgcctgacacggatgtcttttttggtcctttttgcatgtatagaagacatgctt ctgactgtgatggcctatgactgctttgtagccatctgtcgccctctacactacccagtc atcgtgaatcctcacctccgtgtcttcttagttttggtgtcctttttccttagcctgttg gattcccagctgcacaggatccttttgtcttactgtaaaattgttccctccattctaagg atttcaacatcagatgggaaatataaagccttctccacctgtggctctcacctggcactt gtttgcttattttatggagcaggcattggcgtgtacctgacttcagctgtgtcaccaccc cccaggaatggtgtggtggtgtcagtgatgtacactgtggtcacccccatgctgaaccct ttcatctacagcctgagaaacagggacattcaaagcaccctgaggaggctgctcagcaga acagtcgaatctcatgatctgttccatcctttttcttgtgtggggaagcgcaaatccaac caagaatgtgataccacacatttggatggatgtgataaacaaacgagcattggtgaaact agagggaaggagtacaccctgtgtgacagtaggagtaacctcacccaaggatataaggaa caaatacagaggatgcacacgtgttgtgacattagcagtaacatccatttaggatattac gaatattaccacaatgtgaataccccctgtgatattaggagtaacatccccctacaatat tgggaaccatatcacacggtgtacaccccctgtgacattagagaagacccaggctttcag cacccagagattgtgcagaggaagagagagagagagacaccactggtgcccccaccttgg gacgtgcctaaagatcatgcggtagcccctacccccacccccaggtttctggtttataaa ggacctccatgtcttcaatctcccctgagactcacaagagtcctgagaaaagggaggccc atgaaccgaggaagcaggtggccatctgtggcagctgtcacagcccagcataaagctaag cacttctacttattatatcattcaaaagctgcctcagcctgcaagggcagtaattcccca aagcaggctccagggcccaaggaaagccctcggcagggaaacattgagtgtcaggcacag ggaatcaggccatcgggtcacctggacaccgtaaggctccaagagctgccccctctccac caggaggaactgacttaccaaggtgacagtctagagactgcttgcccaattaaagccatg atccccaatctcacactccaactacgggaggacattcaaaccaaaggcaaagaagttgaa aactttgaaaaaaatttagaagaatgtataactagaataaccaatacggagaagtgctta aaggagctgatggagctgaaaaccaaggctcgagaactacgtaaagaatgcagtagcctc aggagccgatgccatcaactggaagaaagggtatcagcaatggaagatgaaatgaatgaa atgaagtgggaagggaagtttagagaaaaaagaataaaaagaaataagcaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctacgtctcattggtgtacctgaaagtgac ggggagaatggaactaagttggaaaacactctgcaggatatcgtccagaacttccccaat ctagcaaggcagaccaatgttcagattcaggaaatacagagaacgccacaaagatactcc tcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaa aaaatgttaagggcagccagagagaaagaaatacaaactaccatcagagaatactacaaa cacctctacgcaaataaactagaaaatctagaggaaatggataaattccttgacacatac actctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctct gaaattgtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattc acagccgaattctaccagaggtacaaggaggaactggtaccattccttctacaactattc caatcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctg ataccaaaggctggcagagacacaaccaaaaaagagaattttagaccaatatccttgatg aacgttgatgcaaaaatcctcaataaaatattggcaaaacgaatccagcagcacatcaaa aagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatata cgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgatt atctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaact ctcaataaattaggtattgatgggaccaacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacctagcaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttacagagtcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatc gccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactgggaccaaaacagagatata gatcaatggaacagaacagagccctcagaaataatgccacatatctacaactgtctgatc tttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaacc ctagaagaaaacctaggcaataccattcaggacacaggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaacta aagagcttctgtacagcaaaagaaactaccatcagagtgaacaggcaacccacaaaatgg gagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatg gacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctca ccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagcatggcaatcattaaaaagtcaggaaacaacagcctggccccacaccacagtcca tcaactcctgctccaggtggaaacctcagaaacccacaaagctcggacctgctccaggtc accaagcagcagggccaggcactggcaatccaacgagaagccccactacatcgcattcca gctcctgaagccattccctggtattttcagccccagccggccacacagctcggatctcct cctgtggatccccccagctctgcgatgatggcagaagagcacacagatctcgaggcccag atcgtcaaggatatccactgcaaggagattgacctggtgaaccgagaccccaagaacatt aacgaggacatagtcaagaacgcctgccttcttcagaaaaacaaacacccatttgccccc cggctcaggctacacaacacccagaggaaccagatgcccaccagccctacaaacacttgt tccaggtttgtgctcatcagggacaccctgttgcacaaagaagaaactgacctggtggat tttgaagacgtgatcgcagagcctgtgggcacctacagctttgacggcgtgtggaaggtg agctacaccaccttcactgtctccaagtactggtgctaccgtctgttgtccacgctgctg ggcgtcccactggccctgctctggggcttcctgttcgcctgcatctccttctgccacatc tgggcggtggtgccatgcattaagagctacctgatcgagatccagtgcatcagccacatc tactcactctgcatccgcaccttctgcaacccactcttcgcggccctgggccaggtctgc agcagcatcaaggtggtgctgcggaaggaggtctaa >gi568815595f:8633877_8845864|GENSCAN_predicted_peptide_3|97_aa MHEKVAVMDSASAEGASAFIIVMLLASLNSCCNPWIYMLFTGHLFHELVQRFLCCSASYL KGRRLGETSASKKSNSSSFVLSHRSSSQRSCSQPSTA >gi568815595f:8633877_8845864|GENSCAN_predicted_CDS_3|294_bp atgcatgaaaaggtggctgtgatggattcagcttctgctgagggagcctcggccttcatc atcgtcatgctcctggccagcctcaacagctgctgcaacccctggatctacatgctgttc acgggccacctcttccacgaactcgtgcagcgcttcctgtgctgctccgccagctacctg aagggcagacgcctgggagagacgagtgccagcaaaaagagcaactcgtcctcctttgtc ctgagccatcgcagctccagccagaggagctgctcccagccatccacggcgtga >gi568815595f:8633877_8845864|GENSCAN_predicted_peptide_4|345_aa MEGALAANWSAEAANASAAPPGAEGNRTAGPPRRNEALARVEVAVLCLILLLALSGNACV LLALRTTRQKHSRLFFFMKHLSIADLVVAVFQVLPQLLWDITFRFYGPDLLCRLVKYLQV VGMFASTYLLLLMSLDRCLAICQPLRSLRRRTDRLAVLATWLGCLVASAPQVHIFSLREV ADGVFDCWAVFIQPWGPKAYITWITLAVYIVPVIVLAACYGLISFKIWQNLRLKTAAAAA AEAPEGAAAGDGGRVALARVSSVKLISKAKIRTVKMTFIIVLAFIVCWTPFFFVQMWSVW DANAPKEGSQGWETQEEGAWWLGEALILLPQNVQGSVDFLGDKRV >gi568815595f:8633877_8845864|GENSCAN_predicted_CDS_4|1038_bp atggagggcgcgctcgcagccaactggagcgccgaggcagccaacgccagcgccgcgccg ccgggggccgagggcaaccgcaccgccggacccccgcggcgcaacgaggccctggcgcgc gtggaggtggcggtgctgtgtctcatcctgctcctggcgctgagcgggaacgcgtgtgtg ctgctggcgctgcgcaccacacgccagaagcactcgcgcctcttcttcttcatgaagcac ctaagcatcgccgacctggtggtggcagtgtttcaggtgctgccgcagttgctgtgggac atcaccttccgcttctacgggcccgacctgctgtgccgcctggtcaagtacttgcaggtg gtgggcatgttcgcctccacctacctgctgctgctcatgtccctggaccgctgcctggcc atctgccagccgctgcgctcgctgcgccgccgcaccgaccgcctggcagtgctcgccacg tggctcggctgcctggtggccagcgcgccgcaggtgcacatcttctctctgcgcgaggtg gctgacggcgtcttcgactgctgggccgtcttcatccagccctggggacccaaggcctac atcacatggatcacgctagctgtctacatcgtgccggtcatcgtgctcgctgcctgctac ggccttatcagcttcaagatctggcagaacttgcggctcaagaccgctgcagcggcggcg gccgaggcgccagagggcgcggcggctggcgatggggggcgcgtggccctggcgcgtgtc agcagcgtcaagctcatctccaaggccaagatccgcacggtcaagatgactttcatcatc gtgctggccttcatcgtgtgctggacgcctttcttcttcgtgcagatgtggagcgtctgg gatgccaacgcgcccaaggaaggtagccagggctgggagacccaggaggagggagcctgg tggctgggggaggcccttatcttgctgcctcagaatgtccaggggtctgtggacttcctg ggggataagcgggtttga >gi568815595f:8633877_8845864|GENSCAN_predicted_peptide_5|192_aa MEKENTPEESEALKHFCTNPYSQSLADSSQFHCTAASVALAHPICLLSTHADLPVPGLAP QAEWPKSKTLKTPNAGEDVEQKKVSSIAGGKAKWYNYFAFVGFSCPPQGSQHHAGKSAGP CSGDGAPETLPYLDKSLKPLSLHSLICEKQVAPESLQPNPKDFKTVYKRTDPSVCPKCPA KLVQLRTLFSSS >gi568815595f:8633877_8845864|GENSCAN_predicted_CDS_5|579_bp atggaaaaagagaataccccggaggaatctgaagctctcaaacacttctgcacaaacccg tactcccagtccttagcagactcttctcaattccactgcacagctgcttctgtggccctg gcccaccctatctgcctcctttccacccacgctgatcttcctgtaccaggcttggctccc caagcagaatggccaaaatccaaaacactgaaaacaccaaatgctggtgaggatgtggag caaaagaaagtctcatccattgctggtgggaaagcaaaatggtacaactactttgccttc gtgggctttagctgccctccccagggcagtcagcaccatgctggcaagagtgcaggtccg tgctcaggagatggggcaccagaaaccctcccctatctggacaagtcactgaagcccctg agcctccattccctcatctgtgaaaagcaggtggcccctgagtcattacaaccaaaccca aaggacttcaagacagtgtacaaacgaactgaccccagtgtttgtccaaaatgtcctgcc aagctggttcaactcagaacgctgtttagctccagctga