GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:47:07 Sequence gi568815581f:16117227_16425895 : 308669 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 Intr - 801 662 140 1 2 79 87 24 0.458 1.58 1.15 Intr - 2259 2197 63 1 0 95 94 88 0.625 8.79 1.14 Intr - 4043 3826 218 1 2 51 81 197 0.961 13.45 1.13 Intr - 8983 8856 128 1 2 34 89 223 0.708 16.48 1.12 Intr - 20986 20932 55 0 1 51 116 -9 0.046 -2.82 1.11 Intr - 21960 21782 179 0 2 69 30 165 0.088 7.52 1.10 Intr - 26470 26380 91 0 1 63 94 12 0.920 -0.70 1.09 Intr - 29322 29150 173 0 2 40 101 149 0.964 10.14 1.08 Intr - 31982 31955 28 1 1 92 113 -21 0.791 -1.08 1.07 Intr - 34772 34720 53 2 2 60 105 74 0.857 4.01 1.06 Intr - 36169 36113 57 1 0 73 119 31 0.932 3.78 1.05 Intr - 41647 41534 114 1 0 72 102 132 0.981 13.64 1.04 Intr - 47935 47753 183 1 0 68 38 104 0.801 3.38 1.03 Intr - 54769 54577 193 0 1 62 94 60 0.630 3.49 1.02 Intr - 69461 69328 134 2 2 63 95 87 0.791 6.34 1.01 Init - 77343 77236 108 0 0 34 110 41 0.478 1.12 1.00 Prom - 92858 92819 40 -3.96 2.00 Prom + 93204 93243 40 -3.16 2.01 Init + 100001 100235 235 1 1 86 91 179 0.969 14.30 2.02 Intr + 116745 116844 100 1 1 68 90 92 0.614 6.57 2.03 Intr + 150709 150875 167 1 2 64 65 61 0.009 1.00 2.04 Intr + 169384 169582 199 2 1 -16 36 168 0.046 -0.39 2.05 Intr + 182662 182752 91 1 1 68 95 71 0.065 5.80 2.06 Intr + 196321 196388 68 0 2 112 64 51 0.187 2.80 2.07 Intr + 199455 199486 32 0 2 129 116 -5 0.694 4.17 2.08 Intr + 200549 200682 134 0 2 102 100 203 0.995 23.26 2.09 Term + 208574 208672 99 1 0 94 48 81 0.979 2.83 2.10 PlyA + 210577 210582 6 1.05 3.06 PlyA - 214191 214186 6 1.05 3.05 Term - 225715 225591 125 2 2 78 49 178 0.954 11.45 3.04 Intr - 227485 227371 115 1 1 58 109 80 0.994 7.12 3.03 Intr - 231459 231390 70 2 1 58 86 65 0.729 2.48 3.02 Intr - 232803 232705 99 2 0 59 89 52 0.712 1.63 3.01 Init - 236210 235801 410 2 2 94 70 552 0.984 50.03 3.00 Prom - 241977 241938 40 -3.86 4.00 Prom + 257402 257441 40 -4.66 4.01 Init + 261856 261888 33 0 0 107 77 75 0.141 6.23 4.02 Term + 264676 265371 696 0 0 67 31 1342 0.168 119.95 4.03 PlyA + 265492 265497 6 1.05 5.04 PlyA - 266256 266251 6 1.05 5.03 Term - 274387 274269 119 2 2 105 47 168 0.962 13.10 5.02 Intr - 288103 288053 51 2 0 108 63 20 0.287 0.38 5.01 Init - 289867 289789 79 1 1 58 89 38 0.404 0.78 5.00 Prom - 292366 292327 40 -8.66 6.00 Prom + 293448 293487 40 -4.96 6.01 Init + 297233 297236 4 1 1 97 68 0 0.276 -0.94 6.02 Intr + 298546 298615 70 2 1 103 60 43 0.694 1.24 6.03 Intr + 300475 300642 168 1 0 62 94 174 0.700 14.46 6.04 Intr + 302889 303022 134 0 2 53 74 233 0.999 18.69 6.05 Intr + 305373 305663 291 1 0 117 109 323 0.997 34.41 6.06 Intr + 306243 306541 299 1 2 93 83 323 0.963 28.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 21960 21778 183 0 0 69 28 177 0.900 7.34 S.002 Init + 97696 97755 60 0 0 51 44 96 0.813 2.75 S.003 Sngl + 264682 265371 690 0 0 82 31 1336 0.821 123.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_1|639_aa MSSSGYPPNQGAFSTEQSRYPPHSVQYTFPNTRHQQEFAVPDYRSSHLEVSQASQLLQQQ QQQQLRRRPSLLSEFHPGSDRPQERRTSYEPFHPGPSPVDHDSLESKRPRLEQVSDSHFQ RVSAAVLPLVHPLPEGLRASADAKKDPAFGGKHEAPSSPISGQPCGDDQNASPSKLSKEE LIQSMDRVDREIAKVEQQILKLKKKQQQLEEEAAKPPEPEKPVSPPPVEQKHRSIVQIIY DENRKKAEEAHKIFEGLGPKVELPLYNQPSDTKVYHENIKTTFSFSQNKTEQKICQRYDQ LMEAWEKKVDRIENNPRRKAKESKTREYYEKQFPEIRKQREQQERFQRVGQRGAGLSATI ARSEHEISEIIDGLSEQENNEKQMRQLSVIPPMMFDAEQRRVKFINMNGLMEDPMKVYKD RQFMNVWTDHEKEIFKDKFIQHPKNFGLIASYLERKQQIARPSQEEKVEEKEEDKAEKTE KKEEEKKDEEEKDEKEDSKENTKEKDKIDGTAEETEEREQATPRGRKTANSQGRRKGRIT RSMTNEAAAASAAAAAATEEPPPPLPPPPEPISTEPVETSRWTEEEMEVAKKGLVEHGRN WAAIAKMVGTKSEAQCKNFYFNYKRRHNLDNLLQQHKQK >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_1|1917_bp atgtcaagttcaggttatcctcccaaccaaggagcattcagcacagaacaaagtcgttat cctcctcactctgtccagtatacatttcccaacacccgccaccagcaggagttcgcagtc cctgattatcgttcctctcatcttgaagtgagtcaggcatcacagcttttgcagcaacag cagcagcaacagcttcgaaggcgaccttccttgctttcagaatttcacccaggttctgac aggcctcaagaaaggagaactagttatgaaccgtttcatccaggcccatccccagtggat catgattcactggaatcgaagcgaccacgtctggaacaggtttctgattctcattttcag cgtgtcagtgctgcggttttgcctttagtgcacccgctgccagaagggctgagggcttct gcagatgctaagaaggatccagcattcggaggcaaacatgaagctccatcctctccaatt tcggggcaaccatgtggagatgatcaaaatgcttcaccttcaaaactctcaaaggaagag ttaatacagagtatggatcgtgtagatcgagaaattgcaaaagtagaacagcagatcctt aaactgaaaaagaaacaacaacagcttgaagaagaggcagctaaacctcctgagcctgag aagcccgtgtcccctcctcctgtggagcagaaacaccgcagtattgtccaaattatttat gatgagaatcggaaaaaagcagaagaagctcataaaatttttgaaggtcttggcccaaaa gttgaactgccactgtataaccagccatcagataccaaggtgtaccatgagaacatcaag acaacattttcatttagtcaaaataaaacggaacaaaaaatctgccagcgttatgatcag ctcatggaggcatgggagaaaaaagtggacagaatagaaaataatcctcggaggaaagct aaagaaagcaaaacaagggaatactatgaaaagcagtttccagaaattcgaaaacaaaga gaacagcaagaaagatttcagcgagttgggcagaggggagctggtctttcagccaccatt gctaggagtgagcatgagatttctgaaattattgatgggctctctgagcaggagaataat gagaaacaaatgcggcagctctctgtgattccacctatgatgtttgatgcagaacaaaga cgagtcaagttcattaacatgaatgggcttatggaggaccctatgaaagtgtataaagat aggcagtttatgaatgtttggactgaccatgaaaaggagatctttaaggacaagtttatc cagcatccaaaaaactttggactaattgcatcatacttggagaggaagcagcaaattgct cgaccctcgcaagaagaaaaagtagaagaaaaagaagaggataaagcagaaaaaacagaa aaaaaagaagaagaaaagaaagatgaagaggaaaaagatgaaaaagaagactccaaagaa aataccaaggaaaaggacaagatagatggtacagcagaagaaactgaggaaagagagcaa gccacaccccgggggcgaaagactgccaacagtcagggccgccgtaagggccggatcacc aggtccatgacaaacgaagctgcagctgccagtgctgcagccgcagcggctactgaagag cccccaccacctctgccaccgccaccagaacccatttctacagagcctgtggagacctct cgatggacagaagaagaaatggaagttgctaaaaaaggtctagtagaacatggtcgtaac tgggcagcaattgctaaaatggtgggaacgaaaagtgaagctcaatgtaaaaacttctat tttaactataaaaggcgacacaatcttgacaacctcttacagcagcataaacagaaa >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_2|374_aa MEAMWLLCVALAVLAWGFLWVWDSSERMKSREQGGRLGAESRTLLVIAHPDDEAMFFAPT VLGLARLRHWVYLLCFSAGNYYNQGETRKKELLQSCDVLGIPLSSVMIIDNRQEQTKSRL FRLVSRTDSSLKELPDQGGEETVNIHHAVVTAEMEHKGKFRERSQKRGRDFEFGAFSRGI NTVFAKYTPGKESSGFYESHLKDWQELASSVVFLASKIVVSPQSQTRNAIMLMRDFPDDP GMQWDTEHVARVLLQHIEVNGINLVVTFDAGGVSGHSNHIALYAAVRALHSEGKLPKGCS VLTLQSVNVLRKYISLLDLPLSLLHTQDVLFVLNSKEVAQAKKAMSCHRSQLLWFRRLYI IFSRYMRINSLSFL >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_2|1125_bp atggaagcaatgtggctcctgtgtgtggcgttggcggtcttggcatggggcttcctctgg gtttgggactcctcagaacgaatgaagagtcgggagcagggaggacggctgggagccgaa agccggaccctgctggtcatagcgcaccctgacgatgaagccatgttttttgctcccaca gtgctaggcttggcccgcctaaggcactgggtgtacctgctttgcttctctgcaggaaat tactacaatcaaggagagactcgtaagaaagaacttttgcagagctgtgatgttttgggg attccactctccagtgtaatgattattgacaacaggcaagagcagacaaagagcagactc tttagactggtaagcagaacagactcgtccctcaaggagctcccagaccagggaggggag gagactgtaaatattcaccatgccgtggtaacagcagagatggagcacaaggggaagttc agagaaagatctcagaagagagggcgggacttcgagtttggggccttttccagaggtatt aacactgtgtttgctaaatacaccccagggaaggagtcgtcgggcttctacgagtcacac ttaaaagactggcaagagttggcttcttctgttgtgttcttagcttccaaaatagtagtc tctcctcagagccagacgcgcaatgccatcatgttaatgagggatttcccagatgaccca ggcatgcagtgggacacagagcacgtggccagagtcctccttcagcacatagaagtgaat ggcatcaatctggtggtgactttcgatgcagggggagtaagtggccacagcaatcacatt gctctgtatgcagctgtgagggccctgcactcagaagggaagttacctaaagggtgctct gtgctcacgcttcagtctgtgaatgtgctgcgcaagtacatctcccttctggatctgccc ttgtctctgcttcatacgcaggatgtcctcttcgtgctcaacagcaaagaagtggcacag gccaagaaagccatgtcctgccaccgcagccagctcctctggttccgccgcctctacatt atcttctcccggtacatgagaatcaactcactgagcttcctctga >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_3|272_aa MRRSRSSAAAKLRGQKRSGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPPSEK PRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWETFQK RQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNR HFIVPASRFKLLKGAEHITTYTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDE GTVRSMVTEEFNGSDWEKAMKEHKTIKNMSKE >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_3|819_bp atgcggcgatcgaggagctctgcggccgccaagctgcgcgggcagaagcggtccggggcc tccgcggcccccgcggcctccgcggccgctgccttggcacccagcgccacccgcacacgg cgctccgctagccaggccgggagcaagagccaggcggtggagaagccgccgtcggagaag ccgcggctgaggcgctcgtcgccgcgggcccaggaggagggcccgggggagccgccgccg cctgagctggcgttgctcccgccaccgccgccgccgccgccgactcccgcgaccccgacg tcctcggcgtccaacctggacctgggcgagcagcgggagcgctgggagacgttccagaag cggcagaagcttacctccgagggtgccgccaagctcctgctagacacctttgaataccag ggcctggtgaagcacacaggaggctgccactgtggagcagttcgttttgaagtttgggcc tcagcagacttgcatatatttgactgcaattgcagcatttgcaagaagaagcagaataga cacttcattgttccagcttctcgcttcaagctcctgaagggagctgagcacataacgact tacacgttcaatactcacaaagcccagcataccttctgtaagagatgtggcgttcagagc ttctatactccacgatcaaaccccggaggcttcggaattgccccccactgcctggatgag ggcactgtgcggagtatggtcactgaggaattcaatggcagcgattgggagaaggccatg aaagagcacaagaccatcaagaacatgtctaaagagtga >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_4|242_aa MGSSLASLGLVVKMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAG KQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQ DKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITL EVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRG GC >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_4|729_bp atgggctccagcctcgcctctctgggcctggtggtcaaaatgcagatcttcgtgaaaacc cttaccggcaagaccatcacccttgaggtggagcccagtgacaccatcgaaaatgtgaag gccaagatccaggataaggaaggcattccccccgaccagcagaggctcatctttgcaggc aagcagctggaagatggccgtactctttctgactacaacatccagaaggagtcgaccctg cacctggtcctgcgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaag accatcaccctggaagtggagcccagtgacaccatcgaaaatgtgaaggccaagatccag gataaagaaggcatccctcccgaccagcagaggctcatctttgcaggcaagcagctggaa gatggccgcactctttctgactacaacatccagaaggagtcgaccctgcacctggtcctg cgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaagaccatcactctg gaggtggagcccagtgacaccatcgaaaatgtgaaggccaagatccaagataaagaaggc atcccccccgaccagcagaggctcatctttgcaggcaagcagctggaagatggccgcact ctttctgactacaacatccagaaagagtcgaccctgcacctggtcctgcgcctgaggggt ggctgttaa >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_5|82_aa MQADGGTTAFPSGLEGGGRPGSALWSVYKTGEKGESKKLERNRSPAEERAAMLALHLPET LGQFTFQSHIVTAEVEAQRDAS >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_5|249_bp atgcaggcggacggcgggaccacagcatttccctccggcttggagggcggaggccgaccc gggagcgcgctctggagcgtttataagacaggagaaaagggagaaagcaaaaagttggaa agaaacagaagtcctgcagaagagcgggctgccatgctggctttgcatcttccagagacg cttggccagttcaccttccagagccacatcgtcacggctgaagtagaagcccagagagat gccagttga >gi568815581f:16117227_16425895|GENSCAN_predicted_peptide_6|322_aa MAPATEKLRDPSSRHALASACGVSRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGE DRKFAPQIRVNLNYRKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLT DSEYTEGSTGKTCLMKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSA LHIAIEKRSLQCVKLLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVS YLLENPHQPASLQATDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQL EDIRNLQDLTPLKLAAKEGKIE >gi568815581f:16117227_16425895|GENSCAN_predicted_CDS_6|966_bp atggcccctgctactgagaagctccgggatcccagcagccgccacgccctggcctcagcc tgcggggtaagtcggttggagacattagatggaggccaagaagatggctctgaggcggac agaggaaagctggattttgggagcgggctgcctcccatggagtcacagttccagggcgag gaccggaaattcgcccctcagataagagtcaacctcaactaccgaaagggaacaggtgcc agtcagccggatccaaaccgatttgaccgagatcggctcttcaatgcggtctcccggggt gtccccgaggatctggctggacttccagagtacctgagcaagaccagcaagtacctcacc gactcggaatacacagagggctccacaggtaagacgtgcctgatgaaggctgtgctgaac cttaaggacggagtcaatgcctgcattctgccactgctgcagatcgacagggactctggc aatcctcagcccctggtaaatgcccagtgcacagatgactattaccgaggccacagcgct ctgcacatcgccattgagaagaggagtctgcagtgtgtgaagctcctggtggagaatggg gccaatgtgcatgcccgggcctgcggccgcttcttccagaagggccaagggacttgcttt tatttcggtgagctacccctctctttggccgcttgcaccaagcagtgggatgtggtaagc tacctcctggagaacccacaccagcccgccagcctgcaggccactgactcccagggcaac acagtcctgcatgccctagtgatgatctcggacaactcagctgagaacattgcactggtg accagcatgtatgatgggctcctccaagctggggcccgcctctgccctaccgtgcagctt gaggacatccgcaacctgcaggatctcacgcctctgaagctggccgccaaggagggcaag atcgag