GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:03:50 Sequence gi568815588f:87764470_88065469 : 301000 bp : 37.53% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 817 812 6 1.05 1.05 Term - 7604 7495 110 0 2 105 41 85 0.343 3.19 1.04 Intr - 20201 20001 201 0 0 76 57 228 0.805 16.84 1.03 Intr - 25961 25841 121 2 1 62 86 77 0.990 4.05 1.02 Intr - 28286 28188 99 2 0 72 50 90 0.830 2.99 1.01 Init - 50130 49969 162 0 0 62 80 184 0.884 14.78 1.00 Prom - 50636 50597 40 -7.35 2.00 Prom + 51410 51449 40 -4.25 2.01 Sngl + 53191 53403 213 0 0 68 47 235 0.831 12.33 2.02 PlyA + 55229 55234 6 1.05 3.00 Prom + 77424 77463 40 -2.75 3.01 Init + 79401 79628 228 2 0 100 70 364 0.537 34.22 3.02 Term + 79667 79849 183 1 0 18 37 194 0.783 3.86 3.03 PlyA + 80322 80327 6 1.05 4.00 Prom + 80884 80923 40 -6.55 4.01 Init + 80934 80964 31 2 1 40 92 36 0.240 -0.62 4.02 Intr + 84702 84833 132 1 0 106 50 107 0.226 8.40 4.03 Intr + 95179 95260 82 2 1 21 103 57 0.084 -1.72 4.04 Intr + 98495 98644 150 2 0 22 91 132 0.335 5.26 4.05 Term + 99280 100141 862 1 1 10 42 553 0.305 34.11 4.06 PlyA + 100491 100496 6 1.05 5.02 PlyA - 101730 101725 6 1.05 5.01 Sngl - 135614 134673 942 2 0 42 47 411 0.979 28.67 5.00 Prom - 137068 137029 40 -6.15 6.02 PlyA - 137164 137159 6 1.05 6.01 Sngl - 138490 137855 636 1 0 88 44 504 0.895 41.93 6.00 Prom - 140034 139995 40 -7.65 7.00 Prom + 140649 140688 40 -4.05 7.01 Init + 150875 150997 123 1 0 56 76 80 0.059 2.59 7.02 Intr + 168582 168782 201 2 0 34 115 155 0.253 11.36 7.03 Intr + 181155 181455 301 2 1 10 110 299 0.300 19.68 7.04 Intr + 193384 193550 167 2 2 45 76 76 0.777 0.86 7.05 Intr + 196425 196649 225 2 0 125 86 116 0.978 12.36 7.06 Term + 200818 201003 186 0 0 100 38 206 0.999 13.31 7.07 PlyA + 201039 201044 6 1.05 8.00 Prom + 235522 235561 40 -3.05 8.01 Init + 235576 235819 244 0 1 74 6 180 0.048 6.34 8.02 Intr + 255944 256081 138 0 0 58 95 47 0.332 1.91 8.03 Intr + 256409 256574 166 0 1 98 76 29 0.381 0.80 8.04 Intr + 256643 256715 73 2 1 76 116 -13 0.364 -1.21 8.05 Term + 257717 257839 123 1 0 73 45 104 0.483 2.00 8.06 PlyA + 259604 259609 6 1.05 9.00 Prom + 276232 276271 40 -2.85 9.01 Init + 277360 277522 163 0 1 83 94 64 0.516 6.44 9.02 Term + 281909 282102 194 0 2 57 44 184 0.613 7.50 9.03 PlyA + 282805 282810 6 1.05 10.03 PlyA - 282986 282981 6 1.05 10.02 Term - 287173 287082 92 2 2 109 38 115 0.291 5.50 10.01 Init - 300491 300374 118 2 1 38 55 112 0.160 3.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 235576 235836 261 0 0 74 42 182 0.847 7.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_1|230_aa MVHAEAFSRPLSRNEVVGLIFRLTIFGAVTYFTIKWMVDAIDPTRKQKVEAQKQAEKLMK QIGVKNVKLSEYEMSIAAHLVDPLNMHVTWSDIAGLDDVITDLKDTVILPIKKKHLFENS RLLQPPKGVLLYGPPGCGKTLIAKATAKEAGCRFINLQPSTLTDKWYGESQKLAAAVFSL AIKLQPSIIFIDEIGGKLSQLIESNWALRLLNVIQCRVEVGYKIWIISEM >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_1|693_bp atggtacatgctgaagccttttctcgtcctttgagtcggaatgaagttgttggtttaatt ttccgtttgacaatatttggtgcagtgacatactttactatcaaatggatggtagatgca attgatccaaccagaaagcaaaaagtagaagctcagaaacaggcagaaaaactaatgaag caaattggagtgaaaaatgtgaagctctcagaatatgaaatgagtattgctgctcatctt gtagaccctcttaatatgcatgttacttggagtgatatagcaggtttagatgatgtcatt acggatctgaaagacacagtcatcttacctatcaaaaagaaacatttgtttgagaattcc aggcttctgcagcctccaaaaggtgttcttctctatgggcctccaggctgtggtaaaacg ttgattgccaaggccacagccaaagaagcaggctgtcgatttattaaccttcagccttcg acactgaccgataagtggtatggagaatctcagaaattggctgctgctgtcttctccctt gccataaagctacaaccatccatcatctttatagatgaaataggtggcaaactctcacag ttaattgaaagtaattgggctctgaggctcctgaatgtcattcagtgtagagtagaagta ggatataagatttggattatttcagaaatgtaa >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_2|70_aa MKACQSSQRGQVIINITIFCESVPGPEEPPSILHKVVAAPVRQSDLVHSSRLRLGSPGSE VFRDDLKPPQ >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_2|213_bp atgaaggcctgccaatcttcacaaagagggcaggttattattaacatcaccatcttttgt gaaagcgtgcccggccctgaagaaccaccttcgattttgcacaaggtggtggccgcgcct gtgcgacaatcagacctggttcattccagccgccttcgtctaggttcgcctggttctgaa gtcttccgggacgacctcaaacccccacagtga >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_3|136_aa MGSHVAVFDDVIKVFSDMKVCKSSTLEKVKKRKKLLLFCLSEYKKYIILAEAKKILVSNV DQTIDDPYATFVKMLTDSKKKDLVFIFWASESASIKSKMIYASSKDVIKKQLGVSTVVFL EGKLCDPLQRPSWRIY >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_3|411_bp atgggctcccatgtggctgtctttgatgatgtcatcaaggtgttcagtgacatgaaagtg tgcaagtcttcaacactagaaaaggtgaagaagcgcaagaagttgctgctcttctgcctg agtgagtacaagaagtacatcatccttgcggaggccaagaagatcctggtaagtaatgtg gaccaaaccattgatgatccctatgccacttttgtcaagatgctgacagacagcaagaag aaggacctggtgtttatcttctgggcctctgagtctgcatccattaagagcaaaatgatc tatgccagctccaaagacgtcatcaagaagcagctaggggtcagcactgtcgtcttcttg gagggcaaactttgtgatcccctccagcgcccttcctggagaatctactag >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_4|418_aa MELAGLEVALGEVGKQWSYEWQQPMAWGCTLGASQDKDAMSRKEISEMTYGSEPFTLKLY IIVPLLQASTSPYGILVLNPDSGAKGKSECSPRRGNLGVEARGEGIPLAGTVPAFPSTLS SVVTWSFSPVHSACEQPRGQRPRGAGRPAAAAAAAFLASSSSFLTVQPLPRLLLKGKVEA VGSGGSRLRRGGGGGTSRSWSGGEKRRRRRPRRLQLQGGGLSRLSPFPGLGTPESWSLPF YCLQHGGGGGWHIQGPGPVLNLPCAAAAPPVARAPEAAGGGSRSEDYSSSPHSAAAAARP LAAEEKQAQSLQPSSSRRSSHYPAAVQSQAAAERGASATAKSRAISILQKKPRHQQLLPS LSSFFFSHRLPDMTAIIKEIVSRNKRRYQEDGFDLDLTCIHFCGCSSLPFCHSLRTWE >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_4|1257_bp atggagcttgcaggactggaagttgctctgggtgaggtaggaaaacagtggtcctatgag tggcagcagccaatggcctggggctgcaccctgggagctagccaagacaaagatgccatg tccagaaaagaaatttcagaaatgacttatggctcagagccattcaccctgaagctttat ataattgtccctttgttacaagcttctacttctccctatggtattctggttctgaatcca gacagcggtgcaaaaggaaagagcgaatgcagtccacgccgcggaaatctaggggtagag gcaaggggggagggtattccccttgcagggaccgtccctgcatttccctctacactgagc agcgtggtcacctggtccttttcacctgtgcacagcgcctgtgagcagccgcgggggcag cgccctcggggagccggccggcctgcggcggcggcagcggcggcgtttctcgcctcctct tcgtcttttctaaccgtgcagcctcttcctcggcttctcctgaaagggaaggtggaagcc gtgggctcgggcgggagccggctgaggcgcggcggcggcggcggcacctcccgctcctgg agcgggggggagaagcggcggcggcggcggccgcggcggctgcagctccagggagggggt ctgagtcgcctgtcaccatttccagggctgggaacgccggagagttggtctctccccttc tactgcctccaacacggcggcggcggcggctggcacatccagggacccgggccggtttta aacctcccgtgcgccgccgccgcaccccccgtggcccgggctccggaggccgccggcgga ggcagccgttcggaggattattcgtcttctccccattccgctgccgccgctgccaggcct ctggctgctgaggagaagcaggcccagtcgctgcaaccatccagcagccgccgcagcagc cattacccggctgcggtccagagccaagcggcggcagagcgaggggcatcagctaccgcc aagtccagagccatttccatcctgcagaagaagccccgccaccagcagcttctgccatct ctctcctcctttttcttcagccacaggctcccagacatgacagccatcatcaaagagatc gttagcagaaacaaaaggagatatcaagaggatggattcgacttagacttgacctgtatc catttctgcggctgctcctctttacctttctgtcactctcttagaacgtgggagtag >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_5|313_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTASIILNGQKLEAFPLK TGTRQGRPLSPLLFNTVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLESPTVS AQNLLQLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPLTTATKRIKYLGIQLT RDVKDLFKNYKPLLNEIKDDTNKWKNIPCSWVGRINIVKMATLPKVIYRNNAIPIKLPIN FFTELEKTTLKFIWSQKRARTAKTILSRKKKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEIT >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_5|942_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcta aaaactctcaataaattaggtattgatggaacgtatctcaaaataataagagctatttat gacaaacccacagccagtatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggacgccctctctcaccactcctattcaacacagtgttggaagtt ctggccagggcaatcaggcaagagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaagccccactgtctca gcccaaaatctccttcagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttatacaccaataacagacagacagagagccaaatcatg agtgaactcccactcacaactgctacaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctgttcaagaactacaaaccactgctcaacgaaataaaagatgac acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccacactgcccaaggtaatttatagaaacaatgccatccccatcaagctaccaataaat ttcttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcccgc actgccaagacaatcctaagcagaaagaaaaaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagaccaatggaacagaacagagccctcagaaataacatga >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_6|211_aa MGRKQSRKAENSKTQSASSPPKDHSSSPAMEQSWMKNDFDELTEVGFRRSVVTNFSELKE DVQTHRKEAKNLEKRLDKWLTRINSVQKTINDLMELKIMARELHDACTSFSSRFDQVEER VSVIADQMNEMKQEEKFREKTVKRNEQSLQEIWDYVKRPNLRLIGVLDSDGENGTKLENT LQDIIQENFPNIARHANIQIQGIQRPPQRYS >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_6|636_bp atggggagaaaacagagcagaaaagctgaaaattctaaaacccagagtgcctcttctcct ccaaaggatcacagctcctcaccagcaatggaacaaagctggatgaagaatgactttgac gagttgacagaagtaggcttcagaaggtcagtagtaacaaacttctccgagctaaaggag gatgttcaaacccatcgcaaggaagctaaaaatcttgaaaaaagattagacaaatggcta actagaataaacagtgtacagaagaccataaatgacctgatggagctgaaaatcatggct cgagaactacatgacgcatgcacaagcttcagcagccgattcgatcaagtggaagaaagg gtatcagtaattgcagatcaaatgaatgaaatgaagcaagaagagaagtttagagaaaaa acagtaaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaat ctacgtttgattggtgtacttgacagtgacggggagaatggaaccaagctggaaaacact cttcaggatattatccaggagaacttccccaacatagcaaggcacgccaacattcaaatt cagggaatacagagaccaccacaaagatactcctag >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_7|400_aa MKTDFLTLLMGASRASLLQVSGSVTCRRHQGFGPAWSECCPLELIKPFCEDLDQWLSEDD NHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRTRDKKLTGQNQVFSKAR YTVSAFGNRRNEKTVVCCTVRGAKADEILENDLKVQECELRKNNFSDTGNFGFGTQEHID LGIRYDPSIDVYSLDFYVVLEKPGFIIADPQFVVCQLKVKIYSSNSGPTRREDKFMYFEF PQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVNTFFIPGPEETSEKVENGSLCDQEIDSI CSIERADNDKEYLVLTLTKNDLDKANKDKANRYFSPNFKVKLYFTKTVEEPSNPEASSST SVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_7|1203_bp atgaagacagattttctcaccctgcttatgggtgcttctcgtgctagccttttgcaagtg tcgggaagtgtaacctgcaggaggcatcagggctttgggcctgcatggtctgagtgctgc cctctagaacttatcaaacccttttgtgaagatcttgaccaatggctaagtgaagatgac aatcatgttgcagcaattcactgtaaagctggaaagggacgaactggtgtaatgatatgt gcatatttattacatcggggcaaatttttaaaggcacaagaggccctagatttctatggg gaagtaaggaccagagacaaaaagcttacaggtcagaaccaggtgttttccaaagctaga tatactgttagcgcctttggcaacagaagaaatgaaaagactgttgtctgctgcacagtt cgaggggccaaggcagacgaaatcctggagaatgatctaaaggtgcaggagtgtgagtta agaaaaaataacttctcagatactggaaactttggttttgggacccaggaacacattgat ctgggtatcagatatgacccaagcattgatgtctacagcctggacttctatgtggtgctg gaaaagccaggtttcatcattgcagatcctcagtttgtggtctgccagctaaaggtgaag atatattcctccaattcaggacccacacgacgggaagacaagttcatgtactttgagttc cctcagccgttacctgtgtgtggtgatatcaaagtagagttcttccacaaacagaacaag atgctaaaaaaggacaaaatgtttcacttttgggtaaatacattcttcataccaggacca gaggaaacctcagaaaaagtagaaaatggaagtctatgtgatcaagaaatcgatagcatt tgcagtatagagcgtgcagataatgacaaggaatatctagtacttactttaacaaaaaat gatcttgacaaagcaaataaagacaaagccaaccgatacttttctccaaattttaaggtg aagctgtacttcacaaaaacagtagaggagccgtcaaatccagaggctagcagttcaact tctgtaacaccagatgttagtgacaatgaacctgatcattatagatattctgacaccact gactctgatccagagaatgaaccttttgatgaagatcagcatacacaaattacaaaagtc tga >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_8|247_aa MGKDFKVKMSKAIAMKTKIDNRDLTKLKHFCTEKETIIRVNRQPTEWEKMCAVYPSDKGP ITIIYKELKQIHTKKQTTPLKSRKNWNTERLSQLGITGEVGAGWILRDQVQVKVLREKRL RKQRTVDGNLIELECEFLGAGWECPELEDLGFTGRAEISSHSYSRGMTIFHHLKKVLTNA FKITLYMSDTHFSVHLLSKQKIKGEKKDNGTFRKVENAREGEGSEEEEEIGEENSFLNQF KKLQRLN >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_8|744_bp atgggcaaagatttcaaggtgaaaatgtcaaaagcaattgcaatgaaaacaaaaattgac aatcgggatctcactaaactgaagcacttctgcacagaaaaagaaaccatcatcagagtg aacagacaacctacagaatgggagaaaatgtgtgcagtctatccatccgacaaaggtcca ataaccataatctacaaggaacttaagcaaattcacactaaaaagcaaacaaccccctta aaatcaaggaagaattggaacactgagagactaagccagttaggtatcacaggtgaagtt ggggcaggctggatcttaagagaccaagtacaagttaaagttttgagggagaaaagactg agaaagcagaggacagtggatggaaaccttatagaattggagtgtgaatttcttggtgca ggatgggaatgcccagagcttgaggatttagggtttactggcagggctgagatctcttca cactcctactcaagaggcatgactatcttccatcacctcaagaaggttttaactaatgct ttcaagataactctttatatgagcgatactcacttctcagttcatttgctcagtaaacaa aagatcaaaggggaaaaaaaggataatggtacttttaggaaagtggagaatgccagagaa ggagaaggaagtgaggaggaggaagaaataggggaggagaactcattcttgaatcagttt aagaagcttcaaagactcaattga >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_9|118_aa MATSSFVSLSEFQEEFGAFQGRTSSWRQGVGSDKRNSLTWNPEIGLGFLGRGCTASSPGW ISSMLEKNMLRNTMMNSKDLPGERDFKRRWEGLLWWPYVAAAARVLMTAMMIVVVVGS >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_9|357_bp atggcgacaagctcgttcgtgtcgttgtctgagtttcaagaagagtttggtgcgtttcag ggaagaacttcttcatggaggcaaggggttggaagtgataagagaaactctttaacttgg aatcctgagataggacttgggtttctgggcagaggttgcacagcctcttcccctggttgg atctcctctatgctggagaagaacatgttgagaaatactatgatgaacagtaaggatctg cctggggaacgggatttcaagagacgatgggaaggcttgctctggtggccctatgttgct gctgctgctagggtgctgatgacggccatgatgattgtggtggtggtgggtagttga >gi568815588f:87764470_88065469|GENSCAN_predicted_peptide_10|69_aa MSLIRSSPTFNQNHQPIDSGTEWLDHFARKPYAIGPKETNFYEKITFDVPKDAALKHAEE YWYTSLTQL >gi568815588f:87764470_88065469|GENSCAN_predicted_CDS_10|210_bp atgtcactcatcaggtcctctcccacgtttaaccagaatcatcagcctatagattcggga actgagtggcttgaccattttgcacgtaagccatatgcaattggtccaaaggagaccaat ttttatgagaaaatcacttttgatgttccaaaggatgccgcattaaagcatgctgaggag tattggtacacctccttgactcaactgtaa