GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:59:24 Sequence gi568815592f:151352546_151569107 : 216562 bp : 41.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 48 1195 1148 1 2 40 41 1360 0.980 117.59 1.02 PlyA + 1455 1460 6 1.05 2.03 PlyA - 3155 3150 6 1.05 2.02 Term - 14347 12976 1372 0 1 91 42 1320 0.955 117.52 2.01 Init - 21092 20920 173 2 2 57 93 111 0.963 7.47 2.00 Prom - 22937 22898 40 -10.05 3.00 Prom + 25793 25832 40 -5.75 3.01 Init + 31927 31964 38 0 2 85 93 13 0.433 1.03 3.02 Intr + 35710 35758 49 1 1 100 113 26 0.417 4.06 3.03 Term + 38259 39059 801 2 0 87 45 426 0.992 30.45 3.04 PlyA + 40750 40755 6 1.05 4.04 PlyA - 41650 41645 6 1.05 4.03 Term - 45077 45060 18 2 0 111 49 24 0.409 -1.86 4.02 Intr - 53291 53175 117 2 0 81 94 103 0.048 9.94 4.01 Init - 57713 57711 3 2 0 92 81 0 0.037 -0.25 4.00 Prom - 58281 58242 40 -3.25 5.07 PlyA - 58420 58415 6 1.05 5.06 Term - 61360 61218 143 2 2 97 42 78 0.023 1.21 5.05 Intr - 71086 70980 107 0 2 50 119 69 0.410 5.14 5.04 Intr - 75037 74937 101 1 2 93 63 93 0.371 5.29 5.03 Intr - 77632 77593 40 0 1 44 93 39 0.715 -2.69 5.02 Intr - 80685 80610 76 1 1 71 110 67 0.545 4.85 5.01 Init - 84093 83901 193 0 1 72 65 87 0.543 3.99 5.00 Prom - 88340 88301 40 -3.25 6.00 Prom + 93575 93614 40 -3.05 6.01 Init + 93965 94012 48 1 0 60 92 21 0.422 0.70 6.02 Intr + 99922 100041 120 0 0 46 32 198 0.875 9.77 6.03 Intr + 105783 106027 245 2 2 99 94 133 0.982 10.37 6.04 Intr + 111908 112073 166 2 1 84 99 132 0.986 12.94 6.05 Term + 115798 116565 768 0 0 98 38 185 0.972 6.91 6.06 PlyA + 117501 117506 6 1.05 7.03 PlyA - 117572 117567 6 1.05 7.02 Term - 126254 126200 55 1 1 72 50 106 0.427 1.25 7.01 Init - 130540 130377 164 1 2 62 67 105 0.058 3.11 7.00 Prom - 133470 133431 40 -4.95 8.00 Prom + 135940 135979 40 -2.75 8.01 Init + 141584 141640 57 1 0 83 80 102 0.313 8.28 8.02 Term + 171802 171900 99 0 0 124 47 76 0.622 4.25 8.03 PlyA + 171977 171982 6 1.05 9.00 Prom + 179524 179563 40 -3.75 9.01 Init + 182503 182577 75 0 0 36 89 42 0.711 0.24 9.02 Intr + 183710 183901 192 1 0 49 94 185 0.856 13.97 9.03 Intr + 185500 185756 257 0 2 47 70 249 0.974 14.32 9.04 Intr + 192027 192171 145 0 1 91 34 117 0.750 5.96 9.05 Intr + 195759 195944 186 2 0 60 80 204 0.935 15.76 9.06 Term + 200059 200127 69 0 0 83 47 84 0.853 0.76 9.07 PlyA + 200851 200856 6 1.05 10.02 PlyA - 200992 200987 6 1.05 10.01 Term - 212117 211875 243 2 0 102 48 185 0.277 10.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_1|382_aa XQEEAVCTKIQVQSSEASFTLTAAAEEEKVLGETANILETGETLEPAGAHLVLEEKSSEK NEDFAAHPGEDAVPTGPDCQAKSTPVIVSATTKKGLSSDLEGEKTTSLKWKSDEVDEQVA CQEVKVSVAIEDLEPENGILELETKSSKLVQNIIQTAVDQFVRTEETATEMLTSELQTQA HVIKADSQDAGQETEKEGEEPQASAQDETPITSAKEESESTAVGQAHSDISKDMSEASEK TMTVEVEGSTVNDQQLEEVVLPSEEEGGGAGTKSVPEDDGHALLAERIEKSLVEPKEDEK GDDVDDPENQNSALADTDASGGLTKESPDTNGPKQKEKEDAQEVELQEGKVHSESDKAIT PQAQEELQKQERESAKSELTES >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_1|1149_bp ngtcaagaggaggcagtatgcaccaaaattcaagttcagagctctgaggcatcattcact ctaacagcggctgcagaggaggaaaaggtcttaggagaaactgccaacattttagaaaca ggtgaaacgttggagcctgcaggtgcacatttagttctggaagagaaatcctctgaaaaa aatgaagactttgccgctcatccaggggaagatgctgtgcccacagggcccgactgtcag gcaaaatcgacaccagtgatagtatctgctactaccaagaaaggcttaagttccgacctg gaaggagagaaaaccacatcactgaagtggaagtcagatgaagtcgatgagcaggttgct tgccaggaggtcaaagtgagtgtagcaattgaggatttagagcctgaaaatgggattttg gaacttgagaccaaaagcagtaaacttgtccaaaacatcatccagacagccgttgaccag tttgtacgtacagaagaaacagccaccgaaatgttgacgtctgagttacagacacaagct cacgtgataaaagctgacagccaggacgctggacaggaaacggagaaagaaggagaggaa cctcaggcctctgcacaggatgaaacaccaattacttcagccaaagaggagtcagagtca accgcagtgggacaagcacattctgatatttccaaagacatgagtgaagcctcagaaaag accatgactgttgaggtagaaggttccactgtaaatgatcagcagctggaagaggtcgtc ctcccatctgaggaagagggaggtggagctggaacaaagtctgtgccagaagatgatggt catgccttgttagcagaaagaatagagaagtcactagttgaaccgaaagaagatgaaaaa ggtgatgatgttgatgaccctgaaaaccagaactcagccctggctgatactgatgcctca ggaggcttaaccaaagagtccccagatacaaatggaccaaaacaaaaagagaaggaggat gcccaggaagtagaattgcaggaaggaaaagtgcacagtgaatcagataaagcgatcaca ccccaagcacaggaggagttacagaaacaagagagagaatctgcaaagtcagaacttaca gaatcttaa >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_2|514_aa MDLANHGLILLQQLNAQREFGFLCDCTVAIGDVYFKAHKSVLASFSNYFKMLFVHQTSEC VRLKPTDIQPDIFSYLLHLMYTGKMAPQLIDPVRLEQGIKFLHAYPLIQEASLASQGAFS HPDQVFPLASSLYGIQIADHQLRQATKIASAPEKLGRDPRPQTSRISQEQVPEASQLSQL TSNLAQVNRTNMTPSDPLQTSLSPELVSTPVPPPPPGEETNLEASSSDEQPASLTIAHVK PSIMKRNGSFPKYYACHLCGRRFTLRSSLREHLQIHTGVPFTSSQQGESRVPLTLCSNAA DLGKDAMEVPEAGMISDSELQHISDSPIIDGQQQSETPPPSDIADIDNLEQADQEREVKR RKYECTICGRKFIQKSHWREHMYIHTGKPFKCSTCDKSFCRANQAARHVCLNQSIDTYTM VDKQTLELCTFEEGSQMDNMLVQTNKPYKCNLCDKTFSTPNEVVKHSCQNQNSDVFALDE GRSILLGSGDSEVTEPDHPVLASIKKEQETVLLD >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_2|1545_bp atggatttggccaaccatggacttattctactgcaacagttaaacgctcagcgagagttt ggtttcctgtgtgactgcacggttgcaatcggcgatgtatacttcaaggcacacaaatca gttcttgcttcattctccaattactttaagatgttgtttgtccatcagaccagtgagtgt gtccgcttgaaaccaactgacatacagcccgacatcttcagctatctcttacacttgatg tacactgggaagatggcgcctcagctcatcgacccggttcgattagaacagggcatcaag tttctgcacgcctacccgctcattcaggaagccagcctcgccagccagggagccttttct caccctgaccaagttttcccactggcttcttcattgtatggcattcagattgcagatcat cagttgagacaagccaccaagattgcttcagcacctgaaaaactcgggcgagatccacgg ccacagacctccaggataagccaggagcaggtccctgaggcctcacagctctcccagctg acttcaaatctggcccaggtgaatcggacaaatatgactccctcagaccccctgcagacc tcgctgtctccagaacttgtttccactcctgttcctccccctcctcccggggaggagacc aatctggaagcatcttcctccgatgagcagcctgcgtccctcacaatagcccacgtcaag ccaagcatcatgaagaggaatgggagctttccaaagtactatgcctgccacctgtgtgga cggcgcttcactctccggagcagcttacgtgaacacctccagatccacacaggagtacct ttcacatctagccaacagggagaaagtcgcgtccccctgactctctgtagcaatgcagct gacctcgggaaagatgccatggaagtgcctgaagccgggatgataagtgacagtgagctg cagcacatctctgattctcccatcatcgatgggcagcagcagtcggaaaccccacccccc tcagacattgctgacattgacaacctggagcaggccgaccaggagagggaggtgaagagg cggaagtacgagtgcacaatatgtggacgcaaatttatccagaaaagccactggagggaa cacatgtacatacacaccgggaagcctttcaagtgcagcacttgtgacaaaagcttttgc agggccaaccaggctgcccgccacgtgtgcctcaaccagagcatcgacacttacaccatg gtggacaaacagactctggaactctgcacatttgaggaagggagtcagatggacaacatg ctggtgcaaacaaacaaaccctacaaatgcaacttgtgtgacaaaacattctccactccc aatgaggttgttaaacattcatgccaaaaccagaactcggatgtttttgccctagacgaa gggcgatccattctcctgggcagtggggactcggaagtaacggagcctgaccacccagtg ttagcttccatcaaaaaggaacaagaaaccgtcttactagactga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_3|295_aa MEEIDRIQKIFRRLVPRDLPSFSKFLQGKLAALPPKQQRRQQQEAAARRRKDYAPDGPAS EGSSAGRSPAAAAQGALRGGGGKGGTGHGEPNANRPPPLPPSHPPSDPPPALSASVPSPS HKTPDGGAADSPPPGSGGWFCQAARAPAPGAAAAAPARRPCPRFPEPGPRVARAFRLPAP VALVRGSFVVFVATASAGTVCELRWPPPPHRLPGPWAGGGTWAPAPGGGGGARLPVAGAA AAGGVEEAPLGSLGRAVLPAAAASASRCCCCCCRRGRCLRRGGGGGGGAFPVRLR >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_3|888_bp atggaggaaattgacaggattcaaaagatatttagaaggctggtgccacgggacttgccc agcttttccaagttcctacagggaaagctcgcggccttgccgccgaagcagcagcggcgg cagcagcaggaggcggcggcgaggcgtcggaaggattacgcacctgacgggcccgcctct gagggctccagcgcgggacgctcaccggccgccgccgcccagggcgcacttcgcggtggc ggcgggaagggagggaccggccacggggaaccgaacgcgaatcgcccccctccccttcct ccctcccaccctccctccgatccgccgcccgccctctcggcttccgtcccctccccctcc cacaaaaccccggatggaggcgccgccgactcgccgccgcctggctccgggggatggttt tgtcaagcggccagagccccggcgccaggcgcagccgccgccgcccccgcgaggcgcccc tgccctcgcttcccagaaccgggtccccgagtggcccgggccttccgcctgcctgcaccc gtagccctggtacgagggtcattcgtcgtctttgttgccaccgcctctgccggaaccgtt tgcgagctccggtggcccccgcccccccaccgccttcccggaccctgggctgggggtggc acgtgggcaccggccccgggaggcggaggcggtgcgcgcttacctgtcgccggtgctgct gcggcggggggtgtggaggaggcgcctctgggcagcctcggccgtgctgtcctccccgcc gccgccgcctctgcctctcgctgctgctgctgctgctgccgccgcggtcggtgtctccgg cgcggcggcggcggcggcggcggggcttttcctgtccggttacgttaa >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_4|45_aa MVMNEKLQHCMELTDLMRNHLNEKRALRLEWMIVILITIEWTVTT >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_4|138_bp atggtcatgaatgaaaaacttcagcactgcatggaactaacagatctaatgcggaatcac ctgaatgagaagagggcactccgcttggagtggatgattgtcatcctcattaccatagag tggactgtgaccacctga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_5|219_aa MESSRVLSSRHRCFSLLVTHVMLISLEQDLMHCTAFATADEYHLGNLSQDLASHGYVEVT SLPRDAANILVMGVENSAKEGDPGTIFFFREGAAVFWNVKDKTMKHVMKVLEKHEIQPYE IALVHWENEELNYIKIEGQSKLHRGEIKLNSELDLDDAILEKFAFSNALCLSGVFQHRTV EIITTRQENQAMLLLADTERLISDGLGIKFSGLHQHSFR >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_5|660_bp atggagtccagtagggtgctcagcagccgtcacagatgcttcagcctcttggtaacccat gttatgctcatttccctggaacaggacctaatgcactgcacagcatttgcaacggcagat gagtatcatctgggaaatctgtctcaagatctggcctcccacggatatgttgaagtaaca agcttgcctagagatgcagcaaatattttggtgatgggtgtggaaaattctgcaaaagaa ggtgatcctggaacaatattcttcttcagggaaggagctgctgtgttttggaatgtgaaa gacaaaactatgaagcatgtgatgaaagttctagaaaaacatgaaattcagccctatgaa atcgcactggtacactgggaaaatgaagaacttaactacataaaaatagagggacagtca aaacttcacaggggggaaatcaagttaaattcagagctggatttagatgatgccattcta gagaagtttgctttctccaatgctctatgcctttctggtgtcttccagcacaggacagta gagatcatcaccactcggcaagaaaaccaagctatgttgcttttagctgacacagaaaga ttgattagtgatggtttaggaatcaaattcagtggactccatcaacacagttttagataa >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_6|448_aa MGVRGMSDTHMNIPDKLLLRGGTASVSAAIEQPSFAAGIAESDGCRPGVSLRTGRGEGVE AEKKAISLLSKLRNELQTDKPFIPLVEKFVDTDIWNQYLEYQQSLLNESDGKSRWFYSPW LLVECYMYRRIHEAIIQSPPIDYFDVFKESKEQNFYGSQESIIALCTHLQQLIRTIEDLD ENQLKDEFFKLLQISLWGNKCDLSLSGGESSSQNTNVLNSLEDLKPFILLNDMEHLWSLL SNCKKTREKASATRVYIVLDNSGFELVTDLILADFLLSSELATEVHFYGKTIPWFVSDTT IHDFNWLIEQVKHSNHKWMSKCGADWEEYIKMGKWVYHNHIFWTLPHEYCAMPQVAPDLY AELQKAHLILFKGDLNYRKLTGDRKWEFSVPFHQALNGFHPAPLCTIRTLKAEIQVGLQP GQGEQLLASEPSWWTTGKYGIFQYDGPL >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_6|1347_bp atgggagtgcgaggcatgtcagatacacacatgaacattccagacaagctcctccttcgc ggcggtaccgcctctgtttctgcggcgattgaacagccgagctttgcggccgggatcgcg gaaagtgatggctgtcgtcccggcgtctctctcaggacaggacgtggggaaggcgtggaa gctgaaaagaaagctatctctctcctttctaaattacggaatgaattgcaaacagataaa ccatttatccccttggttgagaaatttgttgatactgatatatggaatcagtacctagaa tatcaacagagtcttttaaatgaaagtgatggaaaatcaagatggttctactcaccgtgg ttgttggtagaatgttacatgtatcgaagaattcatgaagcaattatccagagtccacca atcgattactttgatgtatttaaagaatcaaaagagcaaaatttctatgggtcacaggaa tccatcattgctttatgtactcacctgcaacaattgataagaactattgaagacctagat gaaaatcagctgaaagatgagttttttaaacttctgcagatttcactgtggggaaataag tgtgatctgtctctctcaggtggagaaagtagttctcagaataccaatgtactaaattca ttggaagacctaaaacctttcattttattgaatgatatggaacatctttggtcattgctt agcaattgcaagaaaacaagagaaaaagcttctgctactagagtgtatattgttctcgat aattctggatttgagcttgttacagatttaatattagccgacttcttgttgtcctctgaa ctggctactgaggttcatttttatggaaaaacaattccatggtttgtttctgatactact atacatgattttaattggttaattgaacaggtaaaacacagtaatcataagtggatgtcc aagtgtggggctgactgggaagagtatattaaaatgggtaaatgggtttaccacaatcat atattttggactctgcctcatgagtactgtgcaatgcctcaggttgcacctgacttatat gctgaactacagaaggcacatttaattttattcaagggtgatttgaattacaggaagttg acaggtgacagaaaatgggagttttctgttccatttcatcaggctctgaatggcttccat cctgcaccactctgtaccataagaacattaaaagctgaaattcaggttggtctgcagcct gggcaaggggaacagctcctggcctctgagcccagctggtggaccactggaaaatatgga atatttcagtacgatggtcccctttga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_7|72_aa MKRLDWLSLTAYIFLLCWMLPALKHRTPKFFSLETWTGFLAPQLADGLLWDLVIVSFDGM ELDKKGFALSEI >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_7|219_bp atgaaaaggctagactggcttagcctcacagcctacatctttctcctgtgctggatgctt cctgcactcaaacatcgcactcccaagttcttcagccttgagacttggactggcttcctt gctcctcagcttgcagatggcctgttgtgggaccttgtgatcgtatcttttgacggcatg gagctggacaagaaaggcttcgcattaagtgaaatttga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_8|51_aa MSLDCTSHIALGAASPAPELELSDSPFDFYFDHAGVQVQDFLWPLKPIKNS >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_8|156_bp atgagcctggactgcaccagccatatcgcgctgggtgccgcttcgccagcgcccgagctg gaactcagtgattctccttttgacttctactttgaccatgcaggggtccaagttcaggat ttcctgtggcctctgaagccaataaagaattcatga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_9|307_aa MSQMETGPEGSVCSPQAKPTKSALQLHGKDRLTLFFIFCILGEFYQETYDHLSEVPVTRE QLNHYRNVAQNARSELAATLVKFECAQSELQDLRSKMLSKEVSCQELKAEMESYKENNAR KSSLLTSLRDRVQELEEESAALSTSKIRTEITAHAAIKENQELKKKVVELNEKLQKCSKE NEENKKQVSKNCRKHEEFLTQLRDCLDPDERNDKASDEDLILKLRDLRKENEFVKGQIVI LEETINVHEMEAKASRETIMRLASEVNREQKKAASCTEEKEKLNQCSKFSKFVRQKLLQD VPCDLVA >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_9|924_bp atgagtcagatggaaacaggaccagagggttcagtctgcagcccacaggccaaacccacc aaatctgctctgcagctgcacggaaaagatcgtttaacactcttctttatattttgtatt ttgggggaattctaccaggaaacttacgatcatctttcggaagtcccggtcacgcgggag cagttaaaccactatcggaatgtggctcaaaatgctcgaagtgaacttgcagcaactttg gtcaaatttgaatgtgctcagtctgagcttcaagacctccgatccaagatgctttctaaa gaagtctcctgtcaagaactgaaagctgaaatggagagctacaaggaaaacaatgccaga aaatcatctctccttacctctttgagagacagagttcaggaactagaagaagaatcagca gcactttccacttctaaaatcagaacagaaatcacagctcacgctgcaatcaaggagaac caggaattaaagaagaaagttgtagagttaaatgaaaaattacaaaagtgttcaaaagaa aatgaggagaataagaaacaagtttcaaagaattgcaggaaacatgaggaatttctgact caactgcgtgactgcttggatccagatgagaggaatgacaaggcatcagatgaagattta attttaaagcttagagacctgcgcaaagaaaatgaattcgtgaaaggacaaattgttatt cttgaagagactataaatgtccatgagatggaagcaaaagctagcagagaaacgatcatg aggctggcttcagaagtcaacagagagcagaaaaaagctgcctcctgtactgaagagaaa gagaagctgaaccagtgctccaaattctccaaatttgtccggcagaagctccttcaggat gtaccctgtgaccttgtagcatga >gi568815592f:151352546_151569107|GENSCAN_predicted_peptide_10|80_aa MAPTCTHNLQAQGLACPAITATVNTSTDSLRAKALSHNCYYDSPCHTAAQGLKDPPAAAT ASIDPPRPAKTSASKCHPAA >gi568815592f:151352546_151569107|GENSCAN_predicted_CDS_10|243_bp atggcacccacctgcacacacaacctgcaggcgcagggcctggcctgcccagccatcaca gccactgtcaacaccagcacagacagcttgagagccaaagcattgtcccacaactgctac tatgatagcccatgccacactgctgcccagggactgaaagacccacctgctgctgccact gccagtatagacccacctagacctgctaagaccagtgccagcaaatgccaccctgcggcc taa