GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:50:19 Sequence gi568815596f:42145504_42430204 : 284701 bp : 40.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 640 635 6 1.05 1.02 Term - 2337 2180 158 1 2 40 42 160 0.291 3.71 1.01 Init - 15468 15222 247 0 1 114 49 168 0.730 13.30 1.00 Prom - 19121 19082 40 -5.85 2.00 Prom + 19698 19737 40 -8.85 2.01 Init + 24109 24133 25 0 1 90 109 27 0.594 4.81 2.02 Term + 24337 24881 545 2 2 91 37 156 0.606 4.24 2.03 PlyA + 28344 28349 6 1.05 3.03 PlyA - 28389 28384 6 1.05 3.02 Term - 37059 36937 123 0 0 92 54 116 0.951 6.00 3.01 Init - 43429 43367 63 1 0 65 44 78 0.384 0.30 3.00 Prom - 58112 58073 40 -2.35 4.00 Prom + 62271 62310 40 -4.75 4.01 Init + 62454 62579 126 2 0 65 87 127 0.995 10.51 4.02 Intr + 72789 72846 58 2 1 102 47 4 0.004 -4.76 4.03 Intr + 100002 100184 183 1 0 79 86 157 0.926 13.44 4.04 Intr + 110998 111127 130 2 1 80 75 127 0.967 9.43 4.05 Intr + 117675 117803 129 0 0 52 108 105 0.955 7.79 4.06 Intr + 119203 119228 26 1 2 104 94 50 0.912 3.95 4.07 Intr + 135347 135464 118 0 1 84 -12 105 0.169 -1.30 4.08 Intr + 139131 139200 70 0 1 81 96 56 0.897 3.97 4.09 Intr + 140766 140876 111 2 0 100 92 49 0.976 6.16 4.10 Intr + 142724 142819 96 1 0 85 94 53 0.959 4.89 4.11 Intr + 149622 149756 135 2 0 79 78 22 0.512 0.14 4.12 Intr + 149878 150013 136 0 1 76 87 74 0.956 5.32 4.13 Intr + 151970 152137 168 0 0 57 89 59 0.525 1.90 4.14 Intr + 155738 155889 152 0 2 64 94 62 0.984 3.36 4.15 Intr + 157601 157726 126 1 0 88 115 93 0.999 11.96 4.16 Intr + 158981 159048 68 1 2 107 91 39 0.926 2.88 4.17 Intr + 170459 170547 89 2 2 77 68 62 0.639 1.80 4.18 Intr + 180651 180749 99 1 0 44 87 92 0.033 3.86 4.19 Intr + 183383 183513 131 0 2 41 92 98 0.777 4.99 4.20 Term + 184231 184704 474 0 0 56 41 483 0.997 34.30 4.21 PlyA + 187014 187019 6 1.05 5.00 Prom + 187476 187515 40 -6.85 5.01 Init + 187898 187982 85 1 1 52 47 77 0.501 1.03 5.02 Intr + 188107 188349 243 2 0 31 26 191 0.023 3.75 5.03 Intr + 190917 191069 153 1 0 60 37 97 0.021 0.92 5.04 Intr + 192290 192457 168 0 0 57 -11 219 0.005 7.90 5.05 Intr + 195973 196096 124 2 1 115 31 68 0.003 2.52 5.06 Intr + 196806 196910 105 0 0 123 96 28 0.057 5.51 5.07 Intr + 198410 198545 136 2 1 47 2 96 0.181 -3.45 5.08 Intr + 200592 200793 202 2 1 111 82 115 0.924 11.14 5.09 Term + 202822 202961 140 2 2 50 42 108 0.347 -0.46 5.10 PlyA + 204115 204120 6 1.05 6.07 PlyA - 204233 204228 6 1.05 6.06 Term - 205856 205716 141 2 0 72 44 193 0.998 10.25 6.05 Intr - 207840 207709 132 0 0 126 94 97 0.997 14.02 6.04 Intr - 215769 215587 183 0 0 -18 96 160 0.041 5.26 6.03 Intr - 223642 223365 278 2 2 80 111 69 0.119 4.71 6.02 Intr - 239443 239359 85 1 1 60 80 99 0.008 4.77 6.01 Init - 253492 253430 63 1 0 51 72 77 0.134 3.60 6.00 Prom - 253741 253702 40 -3.25 7.00 Prom + 255608 255647 40 -6.85 7.01 Init + 262087 262536 450 0 0 74 69 270 0.575 17.66 7.02 Intr + 263493 263685 193 2 1 25 89 97 0.610 1.64 7.03 Term + 266440 266654 215 2 2 21 44 163 0.635 1.61 7.04 PlyA + 270719 270724 6 1.05 8.03 PlyA - 273026 273021 6 1.05 8.02 Term - 279670 279430 241 0 1 86 55 194 0.917 10.41 8.01 Init - 283888 283860 29 1 2 95 96 20 0.821 2.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_1|134_aa MPSHGGIFASAHEVSHDPGPESPSCEMGCPYPKVRKDSRGRAQVPPRAKIVGEASFCYCQ QLLVSVIQYPACNQAYQVGQTAGCLLPCIVSLLSGPAWHRSTVERPDPGPLEERAGIIAE ELSQCWLAVVRSSP >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_1|405_bp atgcccagccatggtggcatttttgcatcagcccatgaggtgagccatgacccaggtcca gagagtccttcatgtgaaatgggctgtccttacccaaaagtgagaaaagattcccgtgga agagcccaagttcctcccagggccaaaattgtgggggaagcatctttctgttattgccag cagttgctggtctcagttatccagtatcctgcatgcaaccaggcttatcaagttggccag accgcaggctgtctgctgccctgcattgtgagtttgctcagtgggccagcgtggcacaga agcacagttgagagaccagacccggggcctctggaggaacgtgctggtatcatcgctgaa gagctttctcaatgctggctggctgttgtcaggagcagtccttag >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_2|189_aa MDGFAGSLDPPFLPEPLSVRGPRSPKVGTPSFSGPPIAGTPLPVHSRLPPLTPLQSWDIS SNPGSSSGYNLPPPFHIHASLSGWGSPSPEQRLLFRGGTLIPSSGRGHPPPPSLHFRFEG DFPVSFRVVGNFLFNPSTSLSILQWWRDLEASEHHLLNAGKEEVHLCWGTLFKSESRSVF WFCIHAGEK >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_2|570_bp atggacggtttcgccggcagtctcgaccctcctttcctcccggagcccctttcagtgcgg ggccctcgttcccccaaagtgggaaccccttccttctcaggccccccgatagctggcaca cccctccccgtacactcacggctccctccacttaccccccttcagagttgggacatttct tcaaatccaggctcctcttcagggtacaaccttccaccacctttccacatacacgcctct ctttcagggtggggatcgccttcacccgaacagaggctcctttttaggggagggaccctc attccctcttcaggaagggggcatcccccgcccccaagtcttcactttagatttgaaggc gactttccagtatccttcagagtagtaggcaacttccttttcaatccctccacttctctg agcattttgcagtggtggagagacctagaagcatcggagcatcatttgcttaatgcagga aaggaagaggtgcatctgtgttgggggaccctctttaagagtgaaagcagaagtgttttc tggttctgcatacatgctggggagaagtga >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_3|61_aa MLPTWAQAILLPWTPKALGLQFSPKAIKAASARQVCKEVILAEEQHSTGCQFSAGYQQPG G >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_3|186_bp atgctgcctacctgggctcaagcgatcctcctgccttggactcccaaagcgctgggatta cagttcagtcctaaagctatcaaggcagccagtgcaaggcaagtgtgtaaggaggtcatt ctggcagaggagcagcacagtacggggtgtcagtttagtgcagggtatcaacaaccaggt ggatga >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_4|874_aa MAVFGYEYATVLTLSESRTSVCESVLRDSSSHFQFHLFFGGQMGPSSCRKTSSGLPLILH YNDSISAASTSDVQDRLSALESRVQQQEDEITVLKAALADVLRRLAISEDHVASVKKSVS SKGQPSPRAVIPMSCITNGSGANRKPSHTSAVSIAGKETLSSAAKSIKRPSPAEKSHNSW ENSDDSRNKLSKIPSTPKLIPKVTKTADKHKDVIINQEGEYIKMFMRGRPITMFIPSDVD NYDDIRTELPPEKLKLDLAIHPDKIRIATGQIAGVDKDGRPLQPHVRVWDSVTLSTLQII GLGTFERGVGCLDFSKADSGVHLCIIDDSNEHMLTVWDWQKKAKGAEIKTTNEVVLAVEF HPTDANTIITCGKSHIFFWTWSGNSLTRKQGIFGKYEKPKFVQCLAFLGNGDVLTGDSGG VMLIWSKTTVEPTPGKGPKGNRKSNLVNCWAHEMACVVLCHMTNPIMGKVAKNLRFSIGW EEFGERSEINGSPDQSVYQISKQIKAHDGSVFTLCQMRNGMLLTGGGKDRKIILWDHDLN PEREIEVPDQYGTIRAVAEGKADQFLVGTSRNFILRGTFNDGFQIEVQEPGHCADFHPSG TVVAIGTHSGRWFVLDAETRDLVSIHTDGNEQLSVMRYSIGDIPNGCKLIRNRSDCKDID WTTYTCVLGFQVFGVWPEGSDGTDINALVRSHNRKVIAVADDFCKVHLFQYPCSKAKAPS HKYSAHSSHVTNVSFTHNDSHLISTGGKDMSIIQWKLVEKLSLPQNETVADTTLTKAPVS STESVIQSNTPTPPPSQPLNETAEEESRISSSPTLLENSLEQTVEPSEDHSEEESEEGSG DLGEPLYEEPCNEISKEQAKATLLEDQQDPSPSS >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_4|2625_bp atggcagtatttggatatgagtatgctacagtattgacactttctgaaagtcggacatca gtatgcgagagcgtacttcgtgatagcagctcacactttcaatttcacctcttctttggt ggacagatgggaccatctagttgcaggaaaacaagctcagggctcccactgattctacat tataatgatagtatttctgctgcaagtacttctgatgttcaagatcgcctgtcagctctt gagtcacgagttcagcaacaagaagatgaaatcactgtgctaaaggcggctttggctgat gttttgaggcgtcttgcaatctctgaagatcatgtggcctcagtgaaaaaatcagtctca agtaaaggccaaccaagccctcgagcagttattcccatgtcctgtataaccaatggaagt ggtgcaaacagaaaaccaagtcataccagtgctgtctcaattgcaggaaaagaaactctt tcatctgctgctaaaagcataaaacgaccatcaccagctgaaaagtcacataattcttgg gaaaattcagatgatagccgtaataaattgtcgaaaataccttcaacacccaaattaata ccaaaagttaccaaaactgcagacaagcataaagatgtcatcatcaaccaagaaggagaa tatattaaaatgtttatgcgcggtcggccaattaccatgttcattccttccgatgttgac aactatgatgacatcagaacggaactgcctcctgagaagctcaaactggaccttgctata catcctgacaaaattaggattgcaactggacagatagctggcgtggataaagatggaagg cctctacaaccccacgtcagagtgtgggattctgttactctatccacactgcagattatt ggacttggcacttttgagcgtggagtaggatgcctggatttttcaaaagcagattcaggt gttcatttatgtattattgatgactccaatgagcatatgcttactgtatgggactggcag aagaaagcaaaaggagcagaaataaagacaacaaatgaagttgttttggctgtggagttt cacccaacagatgcaaataccataattacatgcggtaaatctcatattttcttctggacc tggagcggcaattcactaacaagaaaacagggaatttttgggaaatatgaaaagccaaaa tttgtgcagtgtttagcattcttggggaatggagatgttcttactggagactcaggtgga gtcatgcttatatggagcaaaactactgtagagcccacacctgggaaaggacctaaagga aacaggaaatccaatcttgtaaattgctgggcacatgaaatggcctgtgttgtactttgc cacatgactaaccctattatgggcaaagttgctaagaatttgaggttttcaattggctgg gaagaatttggtgagcgcagtgaaataaatggaagtccagaccagagtgtatatcaaatc agcaaacaaatcaaagctcatgatggcagtgtgttcacactttgtcagatgagaaatggg atgttattaactggaggagggaaagacagaaaaataattctgtgggatcatgatctgaat cctgaaagagaaatagaggttcctgatcagtatggcacaatcagagctgtagcagaagga aaggcagatcaatttttagtaggcacatcacgaaactttattttacgaggaacatttaat gatggcttccaaatagaagtacaggaaccaggacactgtgcagattttcatccaagtggc acagtggtggccataggaacgcactcaggcaggtggtttgttctggatgcagaaaccaga gatctagtttctatccacacagacgggaatgaacagctctctgtgatgcgctactcaata ggggacattccaaatggctgcaaactaatcaggaatcgatcggattgtaaggacattgat tggacgacatatacctgtgtgctaggatttcaagtatttggtgtctggccagaaggatct gatgggacagatatcaatgcactggtgcgatcccacaatagaaaggtgatagctgttgcc gatgacttttgtaaagtccatctgtttcagtatccctgctccaaagcaaaggctcccagt cacaagtacagtgcccacagcagccatgtcaccaatgtcagttttactcacaatgacagt cacctgatatcaactggtggaaaagacatgagcatcattcagtggaaacttgtggaaaag ttatctttgcctcagaatgagactgtagcggatactactctaaccaaagcccccgtctct tccactgaaagtgtcatccaatctaatactcccacaccgcctccttctcagcccttaaat gagacagctgaagaggaaagtagaataagcagttctcccacacttctggagaacagcctg gaacaaactgtggagccaagtgaagaccacagcgaggaggagagtgaagagggcagcgga gaccttggtgagcctctttatgaagagccatgcaacgagataagcaaggagcaggccaaa gccacccttctggaggaccagcaagacccttcgccctcgtcctaa >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_5|451_aa MDQEELPLEEYLLWAQHLPIPRCSGIQGGERHSLDIEGKDQGRFTDLLWKGDKGNRRESV SVLVLSLGRGESTTQQCRTSVSPSLKPCWKSPFVQDAHDWSLQQPQKQNDLIEGTEKKVS ARRKGLREEKEFLNMQSLQPGHPISCLTDHSNQALVPPRKGNMEKKMSLTTTKPFETLPA RVHLQCSTDQSEEAVRTVGPSGAHWLSSARSATMGPSMGLLFASHVVQSSQQCSPSPPAA VGAHILQLKEQVLTQSDGRAETRDLSSLLGKMPSETTPISAPTYSLALQAGTSFTAKSHW MKGPRRRAKASQTLKGNLADRKSKARDKWALLYKDKCKPGTKQLTGPVEDTSPKWDNNIL SLLELRACCQQLDNNFLPNPMRNRESWSAETELNSCIIKRVTERTGYLLTAKGKGYQLDK GEIRQTQLQITNNGTNWYQVPPDITRTPYHL >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_5|1356_bp atggaccaagaagagctgcccctggaggagtacctactgtgggcccaacatctcccaata cccaggtgttcagggatacagggaggagaaagacattcattagacatagaaggaaaggac cagggtagattcacagacttactttggaaaggagataagggaaacagaagagagtctgtg tcagtcttggtcctcagtctgggacgaggtgagtccacgacacagcagtgccgcacatct gtatcaccatcactgaagccctgctggaaaagtccctttgtccaagatgcccacgactgg agtctccagcagccacagaaacagaatgatctcattgaaggcactgagaagaaagtgagt gcccgaagaaagggcctcagggaggaaaaggaatttcttaatatgcaaagccttcagcct ggtcatcccatctcatgcttgactgatcacagtaaccaagcccttgtgcctcctaggaaa gggaatatggagaagaaaatgagccttactaccacgaagccttttgaaactctccctgca cgtgtgcacctgcagtgctccactgaccagtcagaggaggcagtgcggaccgtgggacca agcggtgcccactggctttctagcgcccgctctgctaccatgggtccctccatgggattg ctttttgcaagtcatgtggttcaatctagccagcagtgctcccccagccctcctgctgcc gtgggcgcccacatccttcagctcaaagaacaggtcctgactcagagtgatgggagagct gagacgagggacttgagctctctgcttggcaagatgccctcagaaaccaccccaatttct gcacccacttattccttggcccttcaggcaggcaccagcttcacagcaaagtcacactgg atgaaggggccgagaaggagagcgaaagctagtcagaccctgaaagggaatctggctgac agaaagagcaaagccagggataagtgggcactgctttacaaggacaagtgtaaacctggc accaaacagctgactggcccagtggaggacacctcacccaaatgggacaataacattctc tctctcctggaattaagggcatgttgccagcagctggacaacaatttccttccaaatcca atgagaaacagagaaagctggtctgcagagacagaattaaacagctgcataatcaagaga gtgacagaaaggacaggctacttattaactgcaaaggggaaagggtatcaacttgacaag ggagaaatccggcagacacaacttcaaatcaccaataatgggacaaactggtatcaagtg ccacctgacataacgaggacaccatatcacctataa >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_6|293_aa MNRQFTKDEPYKAMNIEKMLRLPCSSNSLADDHSPETGKAGTLQSWGKAAASRCPKQEDI TVIGHREHGWGVCQAGQCREAKNSGRKHNKVRPAQNGKHSSRATTHFPSGSGGESSQGGR EMDGNCWGCEKPGALTGVVIDLSQHRVLALSLGTTAGEAGSFSGAVALAADAGSRTLGVM YYKFSGFTQKLAGAWASEAYSPQGLKPVVSTEAPPIIFATPTKLTSDSTVYDYAGKNKVP ELQKFFQKADGVPVYLKRGLPDQMLYRTTMALTVGGTIYCLIALYMASQPKNK >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_6|882_bp atgaacaggcagttcacaaaagatgagccttacaaggcaatgaacattgaaaagatgctg aggcttccttgcagcagcaacagccttgctgacgaccacagtccagaaactggcaaggct ggcaccttgcagagctgggggaaggcagcagcaagcagatgcccaaaacaggaggacatc acggtgataggccacagagaacatggctggggcgtgtgccaggcaggacaatgcagggaa gccaaaaacagtgggagaaaacacaataaagtgagacccgcccagaatggtaaacacagc agcagagctaccacacacttcccttctgggagcggtggggaaagcagccagggaggtcgg gagatggacggaaactgctggggatgtgagaagcctggggcactaactggagttgttatt gacttgtcccagcaccgggtccttgcgctgagtctcgggaccacagccggggaggcgggg tccttctctggggcggtcgcgttggcagcggatgcgggaagccggactctgggcgtcatg tactacaagtttagtggcttcacgcagaagttggcaggagcatgggcttcggaggcctat agcccgcagggattaaagcctgtggtttccacagaagcaccacctatcatatttgccaca ccaactaaactgacctccgattccacagtgtatgattatgctgggaaaaacaaagttcca gagctacaaaagtttttccagaaagctgatggtgtgcccgtctacctgaaacgaggcctg cctgaccaaatgctttaccggaccaccatggcgctgactgtgggagggaccatctactgc ctgatcgccctctacatggcttcgcagcccaaaaacaaatga >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_7|285_aa MAVDPGLLLLQGAPLGGQLQLPKSQLQTRASLCSCRCQEQAGALPSLVQLQTRASLCSCR CQKQAGALPSLVQLQTRASMQQAGTPSPPTLGQLQPPKPWLWTKASLHSWGPRKPHLTLQ ARKCLLPLPDFSLLLVPTPISEQSWGEDSQPHREPVPVLAPGAAHPTAAASMSDCVQWPD PTLTHTPLAIPCLTCPWWCGIQAGSKNQMQPARPTNSRYDLALMNQIHTYIQEQEDGKDV EPLVLLLWWLASQVELTTPGLNIPVSSHYLCGHEKIVVAAAVTAS >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_7|858_bp atggctgtggacccaggcctcctgctcctccagggagccccgcttggggggcagctgcag ctgcccaaatcacagctgcagacccgggcctccctgtgctcttgcaggtgccaagagcaa gcaggagccctgccctccctggtgcagcttcagacccgggcctccctgtgctcttgcagg tgccaaaagcaagcaggagccctgccctccctggtgcagcttcagacccgggcctccatg cagcaggcaggaaccccatccccgccaactctggggcagctgcagccacccaaaccgtgg ctgtggaccaaggcatccctacactcttgggggcccagaaagccccacctgaccttgcag gctcggaaatgcctgctcccactgcctgacttctccctgctgttggtgcccacgccgatc tcagaacaaagctggggtgaagacagccagcctcacagagagccagtgcctgtgctggca cctggagctgcccaccccactgcagcagccagcatgtctgactgtgtgcagtggccagac cccacactcactcacacaccccttgccattccatgtctgacttgcccttggtggtgtggg atccaggctggcagcaagaaccaaatgcagcctgccaggccaacaaattctagatatgat ttagctttgatgaatcaaatacatacatacatacaagagcaggaagatggaaaagacgtg gagccccttgtcctcctgctgtggtggctggcaagccaggttgaactgactacccctgga ctcaatattccagtgtctagtcactacctttgtggacatgagaagattgtggtggcagca gcagtgacagcttcctga >gi568815596f:42145504_42430204|GENSCAN_predicted_peptide_8|89_aa MADLWQSLEGGPPVEQLPPDPLTRWCFHPAGSTLCGPANSMAVASPGSRPAAPGGGFLRT EALVLIVAAGPVDGLNCENHPFRGGCKDF >gi568815596f:42145504_42430204|GENSCAN_predicted_CDS_8|270_bp atggctgatttgtggcagtctctagaaggtggcccaccggtggagcagctgcccccagac cccttgacccggtggtgcttccaccctgccggaagcaccttgtgtggccccgccaacagc atggcggttgcatccccaggaagcaggcccgcagcgccaggagggggtttcctgaggaca gaggcccttgtcctgattgtcgcagcaggccctgtcgatggacttaactgtgaaaatcac cctttcaggggtggatgcaaggacttctga