GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:49:26 Sequence gi568815594f:151177173_151391450 : 214278 bp : 40.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12679 12889 211 0 1 68 57 170 0.564 10.90 1.02 Intr + 13360 13562 203 2 2 38 23 149 0.033 1.48 1.03 Intr + 13718 14006 289 1 1 -15 31 266 0.031 6.50 1.04 Term + 25956 26128 173 1 2 107 38 162 0.953 10.11 1.05 PlyA + 26235 26240 6 1.05 2.00 Prom + 30903 30942 40 -7.45 2.01 Sngl + 35354 35974 621 1 0 56 49 288 0.340 17.65 2.02 PlyA + 36016 36021 6 1.05 3.00 Prom + 38076 38115 40 -4.95 3.01 Init + 44252 44424 173 1 2 61 50 147 0.624 7.16 3.02 Term + 53009 53087 79 2 1 65 41 80 0.189 -2.74 3.03 PlyA + 53969 53974 6 1.05 4.03 PlyA - 54338 54333 6 1.05 4.02 Term - 59229 58918 312 0 0 38 42 263 0.915 10.72 4.01 Init - 60060 60040 21 0 0 60 101 28 0.210 -0.11 4.00 Prom - 69060 69021 40 -3.75 5.00 Prom + 69945 69984 40 -4.65 5.01 Init + 77345 77465 121 1 1 68 74 77 0.879 4.70 5.02 Intr + 82344 82464 121 1 1 61 86 96 0.546 5.33 5.03 Intr + 84187 84371 185 1 2 56 54 107 0.966 2.61 5.04 Term + 85145 85401 257 0 2 90 46 147 0.478 5.46 5.05 PlyA + 85981 85986 6 1.05 6.00 Prom + 96116 96155 40 -6.55 6.01 Init + 100001 100052 52 1 1 86 99 103 0.914 10.72 6.02 Intr + 102624 102786 163 1 1 95 84 37 0.804 2.11 6.03 Intr + 103809 104114 306 0 0 18 61 187 0.599 3.64 6.04 Intr + 104976 105241 266 0 2 119 94 110 0.989 10.93 6.05 Intr + 105945 106114 170 1 2 141 77 167 0.936 19.64 6.06 Intr + 113946 114005 60 2 0 124 60 53 0.497 4.11 6.07 Intr + 116540 116571 32 1 2 92 61 25 0.076 -3.79 6.08 Intr + 120599 120750 152 2 2 79 74 50 0.204 1.59 6.09 Intr + 130176 130368 193 1 1 97 89 78 0.325 6.43 6.10 Term + 139572 139740 169 0 1 34 47 101 0.046 -3.03 6.11 PlyA + 140499 140504 6 1.05 7.04 PlyA - 141888 141883 6 1.05 7.03 Term - 146419 146394 26 2 2 83 48 62 0.484 -0.89 7.02 Intr - 147853 147686 168 2 0 21 71 171 0.734 7.70 7.01 Init - 148180 148069 112 1 1 67 94 33 0.227 2.23 7.00 Prom - 153360 153321 40 -1.25 8.00 Prom + 157236 157275 40 -3.15 8.01 Init + 161304 161542 239 2 2 51 67 148 0.248 6.53 8.02 Intr + 163956 164085 130 0 1 92 69 12 0.103 -0.52 8.03 Intr + 175634 175823 190 1 1 71 115 75 0.962 6.74 8.04 Intr + 175935 176060 126 1 0 56 73 129 0.948 7.93 8.05 Intr + 184883 184990 108 0 0 33 105 107 0.086 6.24 8.06 Term + 198359 198468 110 0 2 108 48 30 0.195 -1.31 8.07 PlyA + 199981 199986 6 1.05 9.02 PlyA - 204963 204958 6 1.05 9.01 Sngl - 209807 209547 261 2 0 85 42 210 0.501 11.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 13360 13766 407 2 2 38 48 267 0.909 12.06 S.002 Term + 176141 176256 116 0 2 45 48 124 0.850 1.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_1|291_aa MWKRLWNWVTGRGWNSLEGSEEDRKMWDSLELPRDLLNGFAQNADNDMDNENQAEVFSDG DEELVGNLSKATPAVAERGQRRAWAVASEGASPKTWQLPCGVERASAQKSRIEVWEPPPR FQGIYGNTWMSRQKFAAGLALCSWKSRRHSTPAVKAAGGEAIPCKTTGVELPKTMGTHLL HQHDLDRFGPGVKVDRFGALRFDLPCWISDLHGDCSPFVLANFSHLEWLYLPNACHSPNT KGLKKTGGIPMTGPPAIAASVPLTPSSPPALSQEESADTEQSLAVGIGATG >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_1|876_bp atgtggaagcgactttggaactgggtaacaggcaggggttggaatagtttggagggttca gaagaagacaggaaaatgtgggacagtttggaacttcctagagacttgctgaatggcttt gcccaaaatgctgataatgatatggataatgaaaaccaggctgaggtgttctcagatgga gatgaggaacttgttgggaacttgagcaaagccactccagctgtggctgaaaggggacaa cgtagagcttgggccgtggcttcagagggtgcaagccccaagacttggcagcttccatgt ggtgttgagcgtgcaagtgcacagaagtcaagaattgaggtttgggaacctccacctaga tttcaggggatttatggaaacacctggatgtccaggcagaagtttgctgcagggcttgca ctgtgctcctggaaaagccgcagacactcaacgccagccgtgaaagcagctggtggggag gctataccctgcaaaaccacaggggtggagctgcccaagaccatgggaacccacctcttg catcagcatgacctggatcgttttggacctggcgtcaaagtagatcgttttggagcttta agatttgacctgccctgctggatttcagacttgcatggggactgtagcccctttgttttg gccaatttctcccatttggaatggctgtatttacccaatgcctgccacagtcctaacacc aaaggtctgaaaaagacaggtggaattcccatgacaggacccccagcaatcgcagcatcg gtgccactgaccccatcatcgccaccagccctttcccaggaagagtcagctgacacggaa caatccttggcagtggggattggggctactggatga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_2|206_aa MTAYLFTAWFTKYFKPTLETYCSGKKIPFKILLLIDNAPGHPRALMQMYREIHIVFMSAD TTSNLQPMDQGVILTLKSYYLRNTFCKAIVAIDSNSSDELGQSKWKTWKGFTIVDAIKNI CDSWEEGNVTTLTGVWKKSIPTPMDDFEGFKTPVEEVTADIVETARELELEVDPKDVTTL MYLMIKLQWMRSSFFWMSKESRIYSW >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_2|621_bp atgacagcatatctgtttacagcatggtttactaaatattttaagcccactcttgagacc tattgctcaggaaaaaagattcctttcaaaatattactgctcattgacaatgcacctggt cacccaagagctctgatgcagatgtacagggagattcatattgttttcatgtctgctgac acaacatccaatctgcaacccatggatcaaggagttattttgactttgaagtcttattat ttaagaaatacattttgtaaggccatagttgccatagatagtaattcctctgatgaactg gggcaaagtaaatggaaaacctggaaaggattcaccattgtagatgccattaagaatatc tgtgattcatgggaggagggcaacgtaacaacattaacaggagtttggaagaagtcgatt ccaacccccatggatgactttgaggggttcaagactcccgtggaggaagtaactgcagat atagtggaaacagcaagagaactagaattagaagtggaccctaaagatgtgaccacattg atgtatctgatgataaagcttcagtggatgaggagttccttcttctggatgagcaaagaa agtagaatttactcctggtga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_3|83_aa MAGEQVTANVSRYPGQKTMSFPEKTFLLSYRASLLAVVTHRSNNSRGRAFESQVLPDLCY ATYVEMGPCSGTDSFWVRLRRSY >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_3|252_bp atggcaggagaacaggtcaccgccaatgtcagcagataccctggacagaaaacgatgtcc tttcctgaaaaaacatttctcctttcttatagggcatcactccttgctgttgtaacacac agatccaataatagtcgtgggcgagcttttgagagtcaggttcttcccgatttgtgttac gccacttacgtagaaatgggcccctgttctggcacagacagtttttgggtacggctacgc agatcctactga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_4|110_aa MKPQTLAHLARALRQPRGSSSPNQAQQVPASCAKCGARGTHAHLEPPPARKPRAQPRLPL APLHELREPALASASPREGPPQRSGRLKGSSSAARADAEAEEAPRVNEGC >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_4|333_bp atgaagccacagaccctcgcgcacctagcccgggcactccggcagcccagagggagctca tcccccaatcaagcccagcaggtgccagccagctgtgccaagtgcggggcccgcggaacc cacgcacacctggaacccccgccagcccgcaagccccgggcgcagccccggctcccgctt gcacctctccacgagctgagggagccggctctggcctcggccagccccagagaggggccc ccacagcgcagcggcaggctgaagggctcctccagcgcggccagagcggacgccgaggcc gaggaggcaccaagagtgaacgagggctgctag >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_5|227_aa MTLKEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGGWGASVGQWPWQVSIRQGLI HVCSDTLISEEWVLTVAICFPLSPHPDFQANTSSAIAVVELPSPVSVSPVVLLICLPSSE VYLKKNTTSCWVTGWGYTGIFQYIKRSYTLKELKVPLIDLQTCGDHYQNEILLHGVELII SEAMICSKLPVGQMDQCTVRIHPSGTFHRPCLPQCASSTSHIFRDPG >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_5|684_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg ggctggggggctagtgtggggcagtggccctggcaggtcagcatccgccagggcttgatt cacgtctgctcagataccctcatctcagaggagtgggtgctgacagtggcgatctgcttc ccattatccccccaccctgatttccaagcaaacacatctagtgccatcgctgtggtagaa ctgccctccccagtttctgttagccctgttgtcctgctcatctgccttccctcatctgaa gtctacctgaagaagaatacaacctcctgctgggtgactggatggggctatactggaata ttccaatatatcaagcgttcttatacactgaaggagctgaaagtgcccctcattgatctc cagacatgcggtgaccactatcaaaatgaaatcttgctgcacggagttgagctcatcatc agtgaagctatgatctgctccaagctcccagtggggcagatggatcagtgtactgtaaga atccacccctcaggcacctttcacaggccttgccttccccagtgtgcttcctccacttct catatcttcagagaccctgggtga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_6|520_aa MGPAGCAFTLLLLLGISVCGQPVYSSRVVGGQDAAAGRWPWQVSLHFDHNFICGGSLVSE RLILTAAHCIQPSHRRICSTKIKGEKQENTGSRQKGNKTQKRIPKKVIKDKESAKMMAVQ QPRLEQESRGLQKKDSIDRMPDKFEQNERRFTLLVESLGNETGDGYKDKNKKPKTWTTFS YTVWLGSITVGDSRKRVKYYVSKIVIHPKYQDTTADVALLKLSSQVTFTSAILPICLPSV TKQLAIPPFCWVTGWGKVKESSDRDYHSALQEAEVPIIDRQACEQLYNPIGIFLPALEPV IKEDKICAGDTQNMKDSCKGDSGGPLSCHIDGVWIQTGVHFKGILLLLTEICEYVTTRSK GDFAHMIKYPEMGTSSVLSGGTNVINHMASYKSKIGGSGPVSPFLPEQHEPSLAKYPNLH LSFKSQINQDPRWRSPREPKKPLASGRQPLGSAFPEFLVLDPSGRTSLQFLKVRPTAAEE QGLVPAASKQGGQASARAFSFLPCVKLSKLPPQPLPCPMP >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_6|1563_bp atgggccctgctggctgtgccttcacgctgctccttctgctggggatctcagtgtgtggg caacctgtatactccagccgcgttgtaggtggccaggatgctgctgcagggcgctggcct tggcaggtcagcctacactttgaccacaactttatctgtggaggttccctcgtcagtgag aggttgatactgacagcagcacactgcatacaaccaagccataggagaatctgctccacc aaaataaagggggaaaaacaggaaaacacaggatccagacaaaaggggaataaaacacag aagagaatccctaagaaagtcataaaggataaagaaagtgccaagatgatggctgtgcag cagcccagattagaacaagagagcagagggctccaaaaaaaagatagtattgatagaatg cctgacaagtttgaacaaaatgagaggagatttacacttctggtggagagcctggggaat gagactggtgatgggtataaagataaaaacaaaaaaccaaagacctggactactttttca tatactgtgtggctaggatcgattacagtaggtgactcaaggaaacgtgtgaagtactac gtgtccaaaatcgtcatccatcccaagtaccaagatacaacggcagacgtcgccttgttg aaactgtcctctcaagtcaccttcacttctgccatcctgcctatttgcttgcccagtgtc acaaagcagttggcaattccacccttttgttgggtgaccggatggggaaaagttaaggaa agttcagatagagattaccattctgcccttcaggaagcagaagtacccattattgaccgc caggcttgtgaacagctctacaatcccatcggtatcttcttgccagcactggagccagtc atcaaggaagacaagatttgtgctggtgatactcaaaacatgaaggatagttgcaagggt gattctggagggcctctgtcgtgtcacattgatggtgtatggatccagacaggagtacac ttcaaaggcatcctcctgctgctcacagaaatctgtgaatatgttactacacgtagtaaa ggggattttgcacatatgattaagtatcctgagatggggacatcatctgtattatctggt gggaccaatgtaatcaatcacatggcttcttataagagtaagataggaggatcagggcca gtgtctcccttcttacctgaacaacatgagccttctttggccaaatatccaaatctacat ttaagcttcaaaagccaaattaatcaggacccaaggtggcgcagccctcgtgagcccaaa aagccactcgcgtctgggcgccaacccttaggctctgccttcccagagttcctggtgcta gacccgtcaggcaggacaagcttacagttcctgaaggtgaggccaacagctgctgaggag caaggactcgttccagctgcttctaaacaagggggccaggcctctgccagagccttctcc ttcctgccctgtgtcaagctatccaagcttcctcctcaacctttaccctgccctatgccc tga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_7|101_aa MAEGRRREDEEEELRERRELGGQRRARGRALSGHSAAVVVQLKLKFSVHTKSRSCWLGTC GLVDDLSLDESEVNGRLGGQRHLPRGLPAPPQPGCEDDVME >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_7|306_bp atggctgagggccggcggcgggaggacgaggaggaagagctacgcgagcgccgcgaactt ggtggccagcgccgcgcccggggccgtgcgctctcgggccactcggccgcagtggtcgta cagctgaaactcaaattttccgtgcacactaaatccaggagctgctggcttgggacttgt ggcctggttgatgacttatctctggatgagtcagaggtcaatggccgtctcggtggacaa cgacaccttccgaggggtttgcctgcgccgccgcagccagggtgtgaggacgatgtaatg gaatga >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_8|300_aa MTAELNRVPKATGHKKVLGANANKSISRYPAPKSAMIEKRRKALYWEIIGVRRYQTVARG AETQRCLNWTPHRALSNKASLLFNVSTWAKIAGMKRKDGLWTNSGSLPHPVILTALNCLL ISKVHLREVFLGSISTSWLALTTPFSVSTFIVYIPLLWTNHTVSGIGSSRWVLGLADFKN EAADPRGVKPQTFAVSVTALKGGASRVVCSSRWVRGLSDFRSEAADLCNSYFRFGTYCRQ DKKSDGQSEEESCAQGPSNPAGERVSCVHADVPLNSYFLTFLLLSALTFLNVTRNKGTIS >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_8|903_bp atgacagcagaactcaacagagtacccaaagcaacaggtcacaaaaaggttcttggggcc aatgcaaataagtctattagcagatatcctgccccaaaaagtgccatgatagaaaaaagg agaaaagcgctgtactgggaaataataggggtacgcaggtatcagacagtagccagagga gctgaaacacagagatgcctgaactggacaccacacagagcactgagcaataaagcaagc cttctctttaatgtctccacctgggctaagatagcaggaatgaagagaaaagatggactt tggacaaactcaggttcacttcctcatcctgtgattttaactgctctgaactgtttattg atcagtaaagtacatctccgtgaagttttcctgggtagtatttccacctcctggttagca ttaaccactcccttctctgtctccacatttattgtgtacatacctctcttgtggaccaac catactgtgtccggaattggttcctcccggtgggttcttggtctggctgacttcaagaat gaagccgcggaccctcgcggagtgaagccgcagaccttcgcagtgagtgttacagctctt aaaggtggtgcgtccagagttgtttgttcctcccggtgggttcgtggtctttctgacttc aggagcgaagctgcagacctttgcaattcctacttccggtttgggacttactgtcgacaa gataagaaaagtgatggacaaagtgaggaggagtcatgtgctcaaggaccctccaatcca gcaggggaaagagtctcttgtgttcatgctgatgtgcctctgaatagctacttccttact tttttacttctcagtgccctcacattcttgaatgtcactaggaataagggcactatttct taa >gi568815594f:151177173_151391450|GENSCAN_predicted_peptide_9|86_aa MHSALKDKPLEFFKRKKCEHKELEQLLKATPSSNVSALKASFFVANYVAKAKNPFTIGEE LILSAAKNICHELLGEASVQMCSSFG >gi568815594f:151177173_151391450|GENSCAN_predicted_CDS_9|261_bp atgcactctgcattaaaagacaagcctttggagtttttcaaaagaaaaaaatgtgaacac aaagaactggagcaattattgaaggccaccccttcatcaaatgtgtctgcactgaaagca tcattctttgtggctaactacgttgctaaagctaagaacccctttactattggtgaagag ttgatcctgtctgctgctaagaatatttgtcatgaacttctaggagaggcttcagttcaa atgtgttcctcttttggctag