GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:21:29 Sequence gi568815587f:123477464_123722592 : 245129 bp : 45.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1470 1509 40 -0.76 1.01 Init + 3108 3148 41 2 2 95 64 1 0.358 -1.78 1.02 Intr + 3353 3430 78 2 0 70 116 74 0.542 7.07 1.03 Intr + 4348 4466 119 1 2 59 75 64 0.623 2.31 1.04 Intr + 6452 6540 89 0 2 59 96 12 0.311 -1.21 1.05 Term + 6987 7241 255 2 0 113 35 49 0.130 -2.31 1.06 PlyA + 7845 7850 6 1.05 2.13 PlyA - 8137 8132 6 1.05 2.12 Term - 10000 9886 115 0 1 79 48 68 0.372 -0.06 2.11 Intr - 14061 13950 112 1 1 112 91 -7 0.271 1.44 2.10 Intr - 14792 14678 115 2 1 92 51 52 0.388 1.92 2.09 Intr - 32824 32784 41 2 2 100 95 21 0.399 1.94 2.08 Intr - 34889 34683 207 0 0 58 62 132 0.677 6.65 2.07 Intr - 40051 39995 57 2 0 58 108 49 0.170 2.76 2.06 Intr - 48482 48377 106 2 1 103 30 42 0.349 -0.31 2.05 Intr - 49620 49423 198 0 0 106 88 -9 0.173 0.45 2.04 Intr - 50833 50697 137 2 2 78 11 85 0.144 0.09 2.03 Intr - 53977 53911 67 1 1 91 92 66 0.711 5.78 2.02 Intr - 79431 79354 78 0 0 69 88 31 0.052 0.95 2.01 Init - 82339 82277 63 1 0 89 71 60 0.194 3.62 2.00 Prom - 89580 89541 40 -4.26 3.04 PlyA - 91767 91762 6 1.05 3.03 Term - 93872 93733 140 0 2 92 43 96 0.917 3.63 3.02 Intr - 95635 95287 349 1 1 58 9 253 0.628 9.33 3.01 Init - 96506 96441 66 2 0 39 84 26 0.455 -1.56 3.00 Prom - 97897 97858 40 -7.66 4.00 Prom + 99510 99549 40 -4.06 4.01 Init + 99638 99735 98 1 2 53 62 24 0.230 -3.81 4.02 Intr + 99904 100251 348 1 0 82 53 352 0.227 25.57 4.03 Intr + 103997 104156 160 2 1 87 110 5 0.103 2.59 4.04 Intr + 106849 106869 21 0 0 122 101 -8 0.031 1.64 4.05 Intr + 113489 113605 117 1 0 99 92 54 0.045 7.56 4.06 Intr + 116619 116703 85 2 1 83 97 120 0.693 11.79 4.07 Intr + 117272 117375 104 0 2 112 63 48 0.888 4.59 4.08 Intr + 118479 118574 96 2 0 43 115 97 0.838 8.11 4.09 Term + 119437 119508 72 0 0 -21 42 99 0.239 -7.69 4.10 PlyA + 120164 120169 6 -1.95 5.02 PlyA - 120275 120270 6 1.05 5.01 Sngl - 121989 121054 936 0 0 95 47 545 0.998 47.59 5.00 Prom - 123376 123337 40 -9.95 6.00 Prom + 123812 123851 40 -6.66 6.01 Init + 125622 125668 47 2 2 43 100 -3 0.069 -3.45 6.02 Intr + 125911 126078 168 1 0 37 84 168 0.172 10.36 6.03 Intr + 127859 128015 157 2 1 78 64 195 0.990 16.11 6.04 Intr + 129146 129335 190 1 1 66 86 296 0.999 26.36 6.05 Intr + 131196 131339 144 1 0 101 38 363 0.999 32.85 6.06 Intr + 132332 132450 119 0 2 85 99 86 0.877 9.58 6.07 Intr + 132733 132875 143 0 2 106 64 275 0.826 26.05 6.08 Intr + 135298 135401 104 1 2 108 71 171 0.948 17.22 6.09 Intr + 135992 136195 204 0 0 81 89 170 0.996 15.57 6.10 Intr + 137282 137372 91 0 1 112 47 156 0.980 12.95 6.11 Intr + 141230 141337 108 2 0 98 84 80 0.986 8.00 6.12 Intr + 141644 141761 118 2 1 122 94 20 0.974 6.47 6.13 Term + 145043 145132 90 1 0 96 48 179 0.994 12.42 6.14 PlyA + 145567 145572 6 1.05 7.10 PlyA - 145965 145960 6 1.05 7.09 Term - 156743 156680 64 1 1 122 42 59 0.906 2.06 7.08 Intr - 160861 160723 139 2 1 98 96 128 0.962 14.12 7.07 Intr - 165208 164983 226 1 1 126 72 417 0.998 41.46 7.06 Intr - 168287 168124 164 0 2 91 36 184 0.950 13.19 7.05 Intr - 176409 176251 159 1 0 2 92 86 0.014 0.36 7.04 Intr - 192204 192156 49 0 1 108 83 35 0.048 3.25 7.03 Intr - 194549 194471 79 1 1 70 87 44 0.088 2.05 7.02 Intr - 200835 200744 92 0 2 53 73 66 0.007 0.39 7.01 Init - 223124 223065 60 2 0 71 110 6 0.143 2.35 7.00 Prom - 235495 235456 40 -2.26 8.02 PlyA - 237059 237054 6 1.05 8.01 Term - 244287 244111 177 0 0 100 55 106 0.918 6.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_1|193_aa MGLFCFGWQQTVCSSQQSSQQSSHDDDSSRFLSPRAREERKLMVLTLTKRKSQLAYRRGS AFGLALFIVFINKLGEKAEVQLINLDRRIWRHGTSKEQRLIILVKMLNQEMLQRFSHLNL CPEKLSRFLPGSFVPILAHSQSPKLFMVDGDVVLHQLALESMRNRENVVRSMLPLVVPVT GPPSNPQNLRNWL >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_1|582_bp atgggcctcttctgttttggctggcaacagacagtctgcagcagccaacagagcagtcag cagagcagccacgatgatgattctagcaggttcctgagtcctcgagcgcgggaagaaaga aagttgatggtattgacgttaaccaaaaggaagtctcagctggcctaccgcaggggctct gcttttggtctagctctgtttatcgtttttatcaacaagctgggtgaaaaagcagaagtt cagcttatcaatttagaccgacgtatatggagacatggcacttccaaggagcaaagacta atcattcttgtcaagatgcttaatcaggaaatgttacaaaggttctcccatctaaacctc tgcccagaaaaactctctcgatttcttcccgggtcctttgttcccatcctggcccactcg cagtcccctaagctctttatggtggatggggatgttgttttgcatcaactcgctttggag agtatgaggaacagagagaatgtggtcagaagtatgcttcctctggtggtccctgtgacc ggccccccctccaacccccaaaacctcagaaattggctctag >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_2|431_aa MAEGLFSVAAPRLPGCSLLPQRHSYFLPLSFLKGQHKLWGDQLIQEQSSNDDDNNTTKQI MFIAAYFKPDSATKVVNRFGHLASIKSPCWKSREYPEYQYQSISISTLSVPEYQRGESSL LSPRSLSPLKEASGSCTCHPAGVPDHWYGKVFQAKSAVSPPDEKPIHLPVQSPYWVKKQR ELKAGLGGKGQILQIQPGAGAACPSCGCRCQFLMRMGEDDKHPSGCYAERMTLRAKHLPP PSPENQLVDNKPPTTSRASSLEALQGPGEDRQQRSHQRANCLQQFGIQTQLQMPAFMVAT AHCQGQDEGGMWEEASWQDFQVLYGLEQLIIKEKSNGPWPVMKKDPEAKAPRWEAHNKED EAVPACDVSCWCWGKGRGREHCSQRVCNLALSPWIWRCSNTEVFVVPEHALLSPASMLMY ILFPGLQMLSM >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_2|1296_bp atggccgagggcctcttcagtgtagctgctcccagacttccaggatgctcgcttctcccg cagaggcactcatactttctgcccttgtccttcttgaaaggacagcacaaactctggggt gaccagctgatccaagagcagagtagtaatgatgacgacaataacacaacaaagcaaata atgtttattgctgcttacttcaagccagacagtgctaccaaggtagtcaatagatttggt catcttgcttctatcaagtctccttgttggaagagtagggagtaccctgagtaccagtac cagagtatcagcattagtaccctatcagtacctgagtaccagagaggagaaagctccctc ctcagtccaaggagcctttctcctctgaaggaggcctcaggcagctgcacatgccaccct gcaggggtaccggatcactggtatggcaaagttttccaagcaaaatctgcagtttctcca ccagatgaaaagccaattcacctgcccgtgcaatctccttattgggtcaaaaaacagaga gagctgaaagctgggcttggaggcaaaggacagattctgcaaatccagcctggggctgga gccgcctgcccgagctgtggctgccgctgccagtttctgatgaggatgggtgaggatgac aagcacccatcaggttgctatgcggagcggatgacactaagagcaaagcacttgccacca ccatcccctgagaaccagctggttgataacaaaccacctacaacctcaagagctagttca ctagaggctttgcaaggccctggtgaagacagacagcagcgttcccaccagagagcaaac tgcttgcagcagtttgggatccaaacacagttgcagatgcctgctttcatggtcgccact gcccactgtcaaggtcaggatgaggggggcatgtgggaagaggccagctggcaggatttc caagttctttatggactggagcaacttatcataaaggagaaaagtaatggtccctggcca gttatgaagaaggacccggaggccaaggcacctcgttgggaggcacacaacaaggaggat gaagctgtacctgcatgtgatgtcagctgctggtgctgggggaaggggaggggcagggag cactgctcccagagggtttgcaacttggctctctccccatggatatggcgctgcagcaac actgaagtatttgtggttcccgaacatgccttgctttctcccgcctccatgcttatgtac atcttgttccctggcttgcaaatgctgtccatgtga >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_3|184_aa MKDLQTPGGSALCTPHTTQQGSPSGPTCAPAPAIGAYLRLCPSHRGLPGPLPQPSGPICA SASAIGVYLRPCPSHRGLFVPLPQPSGSTCAPVPAIGAYLCPCPSHRGLPAPLPQPSGPT CASAPAIGAYLRLCLSHWDTVAATPLCLAFFLILIPTVVLQAQGRPCQQASSLISYPCIF QNPE >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_3|555_bp atgaaggacctacaaactcctggaggctcagccctctgcacgccccacacgacccagcag ggctctccatcggggcctacctgcgcccctgccccagccatcggggcctacctgcgcctc tgccccagccatcggggcctacctgggcctctgccccagccatcggggcctatctgcgcc tctgcctcagctatcggggtctacctgcgcccctgccccagccatcggggcctctttgtg cctctgcctcagccatcggggtctacctgcgcccctgtcccagccatcggggcctacctg tgcccctgccccagccatcggggcctacctgcgcccctgccccagccatcggggcctacc tgcgcctctgccccagccatcggggcctacctgcgcctctgcctcagccattgggacacg gtggccgctacaccactttgcttggctttcttcctcatccttatccccactgtggtcctg caggcacaaggcagaccttgtcagcaagccagctccctgatcagctacccgtgcattttt cagaatcctgagtag >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_4|366_aa MDGCMSRCRLPVGSCVCVHMGTAALPWPTAVLRTASNSNRSTPACSPILRKRSRSPTPQN QDGDTMVEKGSDHSSDKSPSTPEQGVQRSCSSQSGRSGGKNSKVSGTPLRRYLLVRGCGE RYWGGEPENICEGPHVMNSPDGSETVIIQAFGTHFRECRTTGIECASGQSSGTPAHSGQA RGLLWTWLEGGPIFPVAPQWQQKSQSWYNNLLVHPGACLPDLQLALGTCHAICILWTLGV TVCTDQVEVLSPTYKQRNEDFRKLFKQLPDTERLIVDYSCALQRDILLQGRLYLSENWIC FYSNIFRWETLLTVRLKDICSMTKEKTARLIPNAIQVCTDSEKIWILGQLLNSTQLYHGY LGLENM >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_4|1101_bp atggacggatgcatgtctagatgtcgattgcccgttggctcgtgcgtctgtgtccacatg ggcactgctgccctcccgtggccaacggcagtattacgcactgccagtaactccaaccgc agcacgccggcctgctcgcccatcctccggaagcggtctcgctcgccaaccccgcagaac caggacggagacaccatggtggagaagggctcagatcactcctcggacaagtccccgtcc acaccggagcagggcgtgcagcgcagctgctcctcccagtccggccggagcggcggcaag aattccaaggtgagcgggaccccgttgaggcggtacctccttgtcaggggctgcggggag cgatattggggtggtgagccggagaacatctgcgagggtcctcacgtgatgaacagccct gatggctcagagacagtcattatccaagcctttgggactcatttccgggaatgcagaacc acggggattgaatgtgcctctgggcagagttcagggacccctgcccacagtggacaggct cgggggctgctttggacgtggctggaggggggacccatttttcctgtggcccctcagtgg cagcagaaaagccagagttggtataataatctgcttgtccacccaggggcctgcctcccg gacctccagctggccctgggcacctgccatgccatctgcatcctctggacccttggggtg actgtctgcactgaccaggtggaggtgttaagccccacctacaagcagagaaatgaagac ttcagaaagctctttaagcagcttccagacacggagcgcctcattgttgattactcatgt gcactccaaagagacattctccttcagggccgactctacctctctgaaaattggatctgc ttctacagcaacatcttccgctgggaaactctgctgacagtccgtttgaaagacatctgt tccatgactaaagaaaaaacagctcgcctcattcccaatgccatccaagtttgcactgat tcagaaaagatttggatcctggggcagctgctcaactctacccaactctaccatggctac ctgggcttggagaacatgtga >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_5|311_aa METILEQQQCYHEEKEWLMDVMAKEMLTKKSMLWDQINSDHCTRATQDRYMEVSGNPRDL YDDKDGLRKEELGAISGPKEFSDFCNRLKQIKEFHRKHPNEIYVAMSVEFEELLKARENP SEEAQNSVEFTDEEGYGRYLDLHDCCLKYINLKASEKLDYITYLSILDQLFDIPKDRRNA EHKRYLEMLLEYLQDYTDRVKPLQDQNELSGKIQAEFEKKWENGIFPGWPKETSSALTQA GAHLDLSAFSSWEELASLGLDRLKSALLALGLKCGRIPEERAQRLFSTKGKSLESLDTSL FAKNPKSKGTK >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_5|936_bp atggagacaatactggagcagcagcagtgctatcatgaggagaaggaatggctcatggat gtcatggccaaagagatgctcactaaaaagtccatgctctgggaccagatcaattctgat cactgcactcgggccacgcaagataggtatatggaggtcagtgggaacccaagggatttg tatgatgataaggatggattacgaaaggaggagctcggtgccatttcaggacccaaggaa ttttctgatttctgtaacagactcaagcaaataaaggaattccaccggaagcacccaaat gagatctatgtggcaatgtcagtggaatttgaggagctcctgaaggctcgagagaatcca agtgaagaggcacaaaactcggtggagttcacagatgaagagggatatggtcgttacctc gatctccatgactgttgcctcaagtacattaacctgaaggcatctgagaagctggattat atcacatacctgtccatcttagaccaattatttgacattcctaaagacaggaggaatgca gagcataagagatacctagagatgctgcttgagtaccttcaggattacacagatagagtg aagcctctccaagatcagaatgaactttctgggaagattcaggctgaatttgagaagaaa tgggagaatgggatctttcctggatggccgaaagagacaagcagtgctctgacgcaggct ggagcccatcttgacctctctgcattctcctcctgggaggagttggcctctctgggtttg gacagattgaaatccgctctcttagctttaggactgaaatgtggcaggatcccagaagag cgagcccagagactattcagcaccaaaggaaagtccctagagtcacttgatacctctttg tttgccaaaaatcccaagtcaaagggcaccaagtaa >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_6|560_aa MGLCHSVLDLFHKSKRVGGPEGARLSRSFSPLKPLCPKELWHFVHQCYGNELGLTSDDED YVPPDDDFNTMGYCEEIPVEENEVNDSSSKSSIETKPDASPQLPKKSITNSTLTSTGSSE APVSFDGLPLEEEALEGDGSLEKELAIDNIMGEKIEMIAPVNSPSLDFNDNEDIPTELSD SSDTHDEGEVQAFYEDLSGRQYVNEVFNFSVDKLYDLLFTNSPFQRDFMEQRRFSDIIFH PWKKEENGNQSRVILYTITLTNPLAPKTATVRETQTMYKASQESECYVIDAEVLTHDVPY HDYFYTINRYTLTRVARNKSRLRVSTELRYRKQPWGLVKTFIEKNFWSGLEDYFRHLESE LAKTESTYLAEMHRQSPKEKASKTTTVRRRKRPHAHLRVPHLEEVMSPVTTPTDEDVGHR IKHVAGSTQTRHIPEDTPNGFHLQSVSKLLLVISCVLVLLVILNMMLFYKLWMLEYTTQT LTAWQGLRLQERLPQSQTEWAQLLESQQKYHDTELQKWREIIKSSVMLLDQMKDSLINLQ NGIRSRDYTSESEEKRNRYH >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_6|1683_bp atgggtttatgccactcagtgctggaccttttccataagagcaaaagagtcgggggacct gagggagcaaggctcagtcgttccttttctcccctgaagcctctgtgtcccaaggagctc tggcactttgttcaccagtgctatgggaacgaattgggcctgaccagtgatgacgaggac tacgtgccccctgacgacgacttcaacacaatgggatactgtgaagagatccctgtggaa gagaatgaagtgaatgacagctcatccaagagcagcatagagaccaagccagatgccagt ccacagctgcccaagaaatccatcaccaacagcacactaacatccacagggagcagtgag gcccccgtctcgtttgatgggctgcccctggaggaagaggcgctggagggagacgggtcc ctggaaaaggagctcgccattgacaacatcatgggggagaagattgagatgatcgctcct gtgaactccccttcactggacttcaatgacaatgaggacatccccactgagctcagtgac tcttccgacacacacgatgaaggagaggtccaggccttctatgaggacctgagtggccgg cagtacgtgaatgaagtcttcaacttcagcgtggacaagctctatgacctcctcttcacc aactcgcccttccagcgggatttcatggagcagcggcgcttctctgatatcatcttccat ccatggaaaaaggaggagaatggaaaccagagccgagtgattctttacaccatcaccctt accaaccctctggctcccaaaactgccactgtcagggagacacagaccatgtacaaggcg agccaggagagtgaatgttacgtgatagatgccgaagtcctcacccacgacgtgccctac catgactacttctacacaatcaatcgctacacgctcacccgtgtggctcggaacaagagc cgactcagggtctccacagagctgcgctatcgaaaacagccctgggggttagtgaaaacg ttcatcgagaagaacttctggagtgggctggaggactacttccgccatttagagagcgag ctggccaaaacggagagcacttatttggctgagatgcacagacaatctcccaaagagaag gccagcaagactacaacggtgcggaggaggaagcgtccccatgcccacctgcgagtccct cacctggaagaggtgatgagcccggtcaccacgcccacagatgaggatgtgggccacagg atcaaacatgtggcaggttccacacagacgcggcatatcccggaggacacccccaacggt ttccacctgcagagcgtgtccaagctgctgctggttatcagctgtgttctggtgctgctg gtcatccttaacatgatgctcttctacaaactctggatgttggaatacaccacgcagacc ctcactgcctggcagggtctaaggctccaagaaaggttaccccagtctcagacagaatgg gcccagctcttagagtcccaacaaaagtaccacgatactgagctccaaaaatggagggaa atcatcaaatcctcagtgatgctccttgaccagatgaaggactcgctcatcaaccttcag aacggcatcaggtcccgcgactacacgtcggaaagtgaagaaaagaggaatcgctatcat tga >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_7|343_aa MPSLITLIQHTVGSSGQGNQPGAAVGHRGPQSGGQAECRAEEGQNHLRPTRGSQQADKNS NSQIKVKMTPSLWAEGQRTAKHPITFNPVQQNIQSERAQSLTEGISLCSLGSRQPQKMPA FNRLFPLASLVLIYWGKYQHLAMPDAVSVCFPVCVEVPSETEAVQGNPMKLRCISCMKRE EVEATTVVEWFYRPEGGKDFLIYEYRNGHQEVESPFQGRLQWNGSKDLQDVSITVLNVTL NDSGLYTCNVSREFEFEAHRPFVKTTRLIPLRVTEEAGEDFTSVVSEIMMYILLVFLTLW LLIEMIYCYRKVSKAEEAAQENASDYLAIPSENKENSAVPVEE >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_7|1032_bp atgccctctctcatcactcttattcaacatactgttggaagttctggccagggcaatcag cctggagctgctgttggtcatcggggcccacagtctggaggacaagctgaatgcagagca gaggagggccagaaccacctaagacccacaagagggagccaacaggctgacaagaacagc aattcccagatcaaggtcaaaatgacaccatcgctgtgggcagaagggcagagaactgcc aaacatccaataactttcaatccagtccagcaaaacatacaatctgagagggcgcagtcc ttgaccgagggaatctctctgtgtagccttggaagccgccagccccagaagatgcctgcc ttcaatagattgtttcccctggcttctctcgtgcttatctactggggtaagtaccagcac ctcgccatgccggatgcagtcagtgtctgcttccctgtgtgtgtggaagtgccctcggag acggaggccgtgcagggcaaccccatgaagctgcgctgcatctcctgcatgaagagagag gaggtggaggccaccacggtggtggaatggttctacaggcccgagggcggtaaagatttc cttatttacgagtatcggaatggccaccaggaggtggagagcccctttcaggggcgcctg cagtggaatggcagcaaggacctgcaggacgtgtccatcactgtgctcaacgtcactctg aacgactctggcctctacacctgcaatgtgtcccgggagtttgagtttgaggcgcatcgg ccctttgtgaagacgacgcggctgatccccctaagagtcaccgaggaggctggagaggac ttcacctctgtggtctcagaaatcatgatgtacatccttctggtcttcctcaccttgtgg ctgctcatcgagatgatatattgctacagaaaggtctcaaaagccgaagaggcagcccaa gaaaacgcgtctgactaccttgccatcccatctgagaacaaggagaactctgcggtacca gtggaggaatag >gi568815587f:123477464_123722592|GENSCAN_predicted_peptide_8|58_aa LYISPRQVQGTTNEKISGRGLQQLLGAWVHLRNLGNRSTEKQFHGKRAVGQLNNPSKQ >gi568815587f:123477464_123722592|GENSCAN_predicted_CDS_8|177_bp ttgtacatctctcctagacaagtccaaggaactactaacgagaagatttcaggaagaggc ctacagcaattgcttggtgcttgggttcatttgcggaatcttggcaacaggtctacagag aagcagttccacggcaaaagagctgtggggcagttgaataatccatccaaacaatga