GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:03:58 Sequence gi568815596f:189823336_190024213 : 200878 bp : 36.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2610 2665 56 2 2 84 73 84 0.941 7.31 1.02 Intr + 8124 8178 55 0 1 56 42 88 0.165 -0.94 1.03 Intr + 20629 20745 117 0 0 83 74 80 0.372 5.84 1.04 Intr + 30604 30747 144 0 0 41 88 123 0.981 7.16 1.05 Intr + 30904 31793 890 0 2 99 100 549 0.964 46.38 1.06 Intr + 40408 40893 486 1 0 70 64 234 0.944 10.40 1.07 Intr + 44464 44594 131 1 2 59 80 74 0.880 3.12 1.08 Intr + 50161 50321 161 2 2 114 85 68 0.991 7.89 1.09 Term + 56059 56505 447 0 0 27 39 307 0.745 13.93 1.10 PlyA + 56957 56962 6 1.05 2.04 PlyA - 57058 57053 6 1.05 2.03 Term - 65631 65232 400 2 1 57 42 191 0.181 5.00 2.02 Intr - 69199 69044 156 0 0 48 71 115 0.272 4.00 2.01 Init - 70334 70276 59 2 2 110 60 76 0.907 7.83 2.00 Prom - 75260 75221 40 -4.95 3.00 Prom + 82996 83035 40 -2.55 3.01 Sngl + 100001 100807 807 1 0 84 43 917 0.999 82.35 3.02 PlyA + 101148 101153 6 1.05 4.00 Prom + 104417 104456 40 -3.45 4.01 Init + 115060 115119 60 0 0 72 115 23 0.676 4.70 4.02 Intr + 115734 115899 166 2 1 96 68 49 0.271 2.31 4.03 Intr + 119135 119228 94 0 1 43 80 49 0.160 -2.30 4.04 Term + 120272 120479 208 2 1 41 44 167 0.446 3.43 4.05 PlyA + 121577 121582 6 1.05 5.03 PlyA - 122471 122466 6 1.05 5.02 Term - 141344 141073 272 0 2 47 48 223 0.807 8.86 5.01 Init - 141386 141368 19 2 1 50 63 12 0.331 -4.87 5.00 Prom - 148136 148097 40 -3.45 6.02 PlyA - 148417 148412 6 1.05 6.01 Sngl - 149908 148652 1257 1 0 34 32 354 0.938 20.00 6.00 Prom - 167710 167671 40 -5.25 7.02 PlyA - 168645 168640 6 1.05 7.01 Sngl - 174963 174637 327 0 0 87 48 160 0.612 7.66 7.00 Prom - 182615 182576 40 -5.85 8.02 PlyA - 184923 184918 6 1.05 8.01 Sngl - 192422 191682 741 2 0 -1 48 451 0.969 28.15 8.00 Prom - 194427 194388 40 -6.25 9.03 PlyA - 194689 194684 6 1.05 9.02 Term - 195700 195464 237 1 0 68 49 165 0.950 5.78 9.01 Intr - 198356 198192 165 2 0 8 115 66 0.249 0.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_1|828_aa MQALNQEQRPKKLAVEIWHYYEEESMGLAVSGKGPEKAVIWQKSRVSDHKMALMSVLGTA VMNNMESFQYHSEESQLIRHHYNLKCLKESTRLYPVFFLKIDVPTADVDVNLTPDKSQVL LQNKESVLIALENLMTTCYGPLPSTNSYENNKTDVSAADIVLSKTAETDVLFNKVESSGK NYSNVDTSVIPFQNDMHNDESGKNTDDCLNHQISIGDFGYGHCSSEISNIDKNTKNAFQD ISMSNVSWENSQTEYSKTCFISSVKHTQSENGNKDHIDESGENEEEAGLENSSEISADEW SRGNILKNSVGENIEPVKILVPEKSLPCKVSNNNYPIPEQMNLNEDSCNKKSNVIDNKSG KVTAYDLLSNRVIKKPMSASALFVQDHRPQFLIENPKTSLEDATLQIEELWKTLSEEEKL KYEEKATKDLERYNSQMKRAIEQESQMSLKDGRKKIKPTSAWNLAQKHKLKTSLSNQPKL DELLQSQIEKRRSQNIKMVQIPFSMKNLKINFKKQNKVDLEEKDEPCLIHNLRFPDAWLM TSKTEVMLLNPYRVEEALLFKRLLENHKLPAEPLEKPIMLTESLFNGSHYLDVLYKMTAD DQRYSGSTYLSDPRLTANGFKIKLIPGVSITENYLEIEGMANCLPFYGVADLKEILNAIL NRNAKEVYECRPRKVISYLECWDYPPRLAGNVFLTVTGVADWREQEWASGEGLALPHGPI LVDEAKGSLATTQKPGNLPQRSHPPQVLTTAPAPPAGPNAEAGSGDREALAPRMEVTAAA AWEAYPELRARRGDPKEKESFTPSYPFASLGWVARALGLGEGLLRYEF >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_1|2487_bp atgcaggcccttaaccaggaacagagacccaaaaaactggccgtggagatttggcactat tatgaggaagaatcaatgggactagctgtgagtggcaaaggaccagaaaaggcagttatt tggcagaaaagcagagtatcagatcacaagatggctctcatgtcagttctggggactgct gttatgaacaatatggaatcctttcagtaccactctgaagaatctcagttaatccgacat cattacaatctgaaatgcctaaaggaatctactcgtttgtatcctgttttctttctgaaa atcgatgttcctacagctgatgttgatgtaaatttaacaccagataaaagccaagtatta ttacaaaataaggaatctgttttaattgctcttgaaaatctgatgacgacttgttatgga ccattacctagtacaaattcttatgaaaataataaaacagatgtttccgcagctgacatc gttcttagtaaaacagcagaaacagatgtgctttttaataaagtggaatcatctggaaag aattattcaaatgttgatacttcagtcattccattccaaaatgatatgcataatgatgaa tctggaaaaaacactgatgattgtttaaatcaccagataagtattggtgactttggttat ggtcattgtagtagtgaaatttctaacattgataaaaacactaagaatgcatttcaggac atttcaatgagtaatgtatcatgggagaactctcagacggaatatagtaaaacttgtttt ataagttccgttaagcacacccagtcagaaaatggcaataaagaccatatagatgagagt ggggaaaatgaggaagaagcaggtcttgaaaactcttcggaaatttctgcagatgagtgg agcaggggaaatatacttaaaaattcagtgggagagaatattgaacctgtgaaaatttta gtgcctgaaaaaagtttaccatgtaaagtaagtaataataattatccaatccctgaacaa atgaatcttaatgaagattcatgtaacaaaaaatcaaatgtaatagataataaatctgga aaagttacagcttatgatttacttagcaatcgagtaatcaagaaacccatgtcagcaagt gctctttttgttcaagatcatcgtcctcagtttctcatagaaaatcctaagactagttta gaggatgcaacactacaaattgaagaactgtggaagacattgagtgaagaggaaaaactg aaatatgaagagaaggctactaaagacttggaacgatacaatagtcaaatgaagagagcc attgaacaggagtcacaaatgtcactaaaagatggcagaaaaaagataaaacccaccagc gcatggaatttggcccagaagcacaagttaaaaacctcattatctaatcaaccaaaactt gatgaactccttcagtcccaaattgaaaaaagaaggagtcaaaatattaaaatggtacag atccccttttctatgaaaaacttaaaaataaattttaagaaacaaaacaaagttgactta gaagagaaggatgaaccttgcttgatccacaatctcaggtttcctgatgcatggctaatg acatccaaaacagaggtaatgttattaaatccatatagagtagaagaagccctgctattt aaaagacttcttgagaatcataaacttcctgcagagccactggaaaagccaattatgtta acagagagtctttttaatggatctcattatttagacgttttatataaaatgacagcagat gaccaaagatacagtggatcaacttacctgtctgatcctcgtcttacagcgaatggtttc aagataaaattgataccaggagtttcaattactgaaaattacttggaaatagaaggaatg gctaattgtctcccattctatggagtagcagatttaaaagaaattcttaatgctatatta aacagaaatgcaaaggaagtttatgaatgtagacctcgcaaagtgataagttatttagag tgctgggattacccaccgcgcctggccggcaacgttttcctaacggtgactggagtagca gactggcgtgaacaggaatgggcttcaggtgagggtctagcgttgccccacggaccaatc ctggttgacgaggctaagggaagcctagcaaccacgcagaagcccggcaatcttccgcag cgctcccatcctccacaagtccttacaacagccccagcaccacccgcgggaccaaatgct gaagctgggagcggcgacagggaagcgctggctcccagaatggaggtaacggctgctgcg gcctgggaggcttacccggaattaagggcgaggcgaggagatccgaaggaaaaggaaagc ttcaccccaagttacccatttgcctctttaggctgggttgccagagcccttggcctggga gaaggacttttaaggtatgagttctaa >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_2|204_aa MAEQEQLWSAAPSEINTEGRVWNGPPANTSRPASEGPVRRKTNKHKGIASTSTKRMSTQK PHPKVTNIKDQSIGSSPIRQEKERKGIQMGRQEVKLSLFADDMIVYLENPIISAQNLLKL TTNFSKVSGYKINVQKSQAFLNTNNRQTESQIMSELPFTIATKRIKYLGIQLTRDVKDLF KENYKPLLKEIREDTNKWKNIPCS >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_2|615_bp atggccgaacaggaacagctctggtctgcagctcccagtgagatcaacacagaaggcagg gtctggaatggaccaccagcaaataccagcagacctgcatcagaggggcctgttagaagg aaaactaacaaacataaaggaatagcatcaacatcaacaaaaaggatgtctacacagaaa ccgcacccgaaagtcaccaacatcaaagaccaaagtattggaagttctccaatcaggcaa gagaaagaaagaaagggtattcaaatgggaagacaggaagtcaaattgtccctgtttgca gatgacatgattgtatatttagaaaaccccatcatctcagctcaaaatctccttaagctg acaaccaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattc ttaaacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gctacaaagagaataaaatacctaggaatacaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaaggaaataagagaggacacaaacaaatggaaaaac attccatgctcatga >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_3|268_aa MASNVTNKTDPRSMNSRVFIGNLNTLVVKKSDVEAIFSKYGKIVGCSVHKGFAFFQYVNE RNARAAVAGEDGRMIAGQVLDINLAAEPKVNRGKAGVKRSAAEMYGSSFDLDCDFQRDYY DRMYSYPARVPPPPPIARAVVPSKRQRVSGNTSQRGISGFNSKSGQRGSSKSGKLKGDDL QAIKRELTQIKQKVDSFLENLEKIEKEQSKQAVEMNNVKSEEEQSSSSVKKDETNVKMES EGGADDSAEEGDLLDDDDNEDGGMTSWS >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_3|807_bp atggccagcaacgttaccaacaagacagatcctcgttccatgaactcccgtgtattcatt gggaatctcaacactcttgtggtcaagaaatctgatgtggaggcaatcttttcgaagtat ggcaaaattgtgggctgctctgttcataagggctttgccttctttcagtatgttaatgag agaaatgcccgggctgctgtagcaggagaggatggcagaatgattgctggccaggtttta gatattaacctggctgcagagccaaaagtgaaccgaggaaaagcaggtgtgaaacgatct gcagcggagatgtacggctcctcttttgacttggattgtgactttcaacgggactattat gataggatgtacagttacccagcacgtgtacctcctcctcctcctattgctcgggctgta gtgccctcgaaacgtcagcgtgtatcaggaaacacttcacaaaggggcataagtggcttc aattctaagagtggacagcggggatcttccaagtctggaaagttgaaaggagatgacctt caggccattaagagggagctgacccagataaaacaaaaagtggattctttcctggaaaac ctggaaaaaattgaaaaggaacagagcaaacaagcagtagagatgaacaatgttaagtca gaagaggagcagagcagcagctccgtgaagaaagatgagactaatgtgaagatggagtct gaggggggtgcagatgactctgctgaggagggggacctactggatgatgatgataatgaa gatggggggatgaccagctggagttga >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_4|175_aa MKKNEIQSFSTTWMELEVIMEALGVCEGRNIPEDSQRQSREVGLSSPAMETLLCYSTKGD AKSISVAFQWHHAVGAEQPNTENWYQKWGFTIKVPQNVETTSELGNRPQNGKSTSSLHPE PGKAAGTQHQLLKAAMGAEPCKATEVELPRPLEPTPCTSLPWMWDMESLEIILEL >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_4|528_bp atgaaaaagaatgagattcagtcattttcaacaacatggatggaactggaggtcattatg gaagcattgggagtgtgtgaaggaagaaatattcctgaggacagccagagacaaagcagg gaggtgggactatcatccccagccatggaaactctgctctgttactccaccaaaggagat gccaaatcaatcagtgtggcctttcagtggcaccatgctgtaggagcagaacagcctaac acagaaaattggtaccagaagtggggctttactataaaggtacctcaaaatgtggaaaca acttcagaactaggtaacagaccccagaatggtaaatccaccagcagcttgcaccctgaa cctggaaaagctgcaggcactcaacaccagctcttgaaagcagccatgggggctgaaccc tgcaaagccacagaggtggagctcccaaggcctttggagcccaccccttgcaccagtttg ccttggatgtgggatatggagtcactggagattattttggagctttaa >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_5|96_aa MFETNIGSHIHNIKVQGEAASADVEATSYPEDLAKIIDEGGYTKQQIFNINKKYFYWKKM PSRTFKAIEAKSVLGNKAFKDRLTLFLRANAAGDLS >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_5|291_bp atgtttgagactaacataggaagccatatccataacataaaagtacaaggtgaagcagct agtgctgatgtagaagctacaagttatccagaagatctagctaagatcattgatgaaggt ggctacactaaacaacagattttcaatatcaacaaaaaatacttctattggaagaagatg ccatctaggacttttaaagctatagaagcgaagtcagtgcttggaaacaaggctttcaag gataggctgactctctttttaagggctaatgcagctggtgatttaagttga >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_6|418_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLK TSTRQGCPLSPLLFNIVLEVLARAIRQEKEIKDIQLGKEEVKLSLFADDMIVYLEKPIIS APNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQITSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRISIMKMAILPKVIYRFNAIPIKLPM TFFAELEKTTSKFIWNQKRARIAKTILSQKNRAGCIMLPDFKLYYKTTAIKTAWYWYQNR DIDQWNRTEPSEIIPHIYNHLIFDKPDKNKKRGKDSLFNKWCWENWLAICRKLKLDPFFT PYTKINSRWIKDLNVRHKTIKTLEENLGNTIQDIGMGKDFMSKHQKQWQQKPKLTNGI >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_6|1257_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagccctttatgcta aaaactctcaataaattaggtattgatgggatgtatctcaaaataataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actagcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaagagaaagaaataaaggatattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaagcccatcatctca gccccaaatctccttaagctgataagcaactttagcaaggtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcacg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagag gacacaaacaaatggaagaatattccatgctcatggataggaagaatcagtatcatgaaa atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttctttgcagaattggaaaaaactacttcaaagttcatatggaaccaaaaaagagcc cgcattgccaagacaatcctaagccaaaagaacagagctggatgcatcatgctacctgac ttcaaactatactacaagactacagcaatcaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaaccgaacagaaccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaaaggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttctttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacataaaaccata aaaaccctagaagaaaacctaggcaatactattcaggacataggaatgggcaaggacttc atgtcaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaa >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_7|108_aa MQLPFGWTEQHVETDTMDFCSKNYHRNIPGKLREFTDSLKEVHTTGNSVRQTKNCDFPKY ERGKSYLQKHILTGELEKSRLWEDLSFPRAETDLAQNIKVKAAVGRGL >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_7|327_bp atgcagctcccatttggatggacagaacagcatgtggagactgacaccatggacttttgc tccaagaactaccacaggaacataccaggaaaactgagagaattcacagattctttgaaa gaagtgcacaccactggaaattctgtaagacagacaaaaaactgtgactttccaaagtat gaaagggggaaatcttacctccaaaaacacatactcactggggaactggaaaaatccaga ttatgggaggacttaagctttcctagagctgaaacagatttagcacaaaatataaaagta aaagcagcagtgggaagaggcttgtag >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_8|246_aa MHSQRDGLKLELIHKREAECKSLKNLQPDHAVEKKNPFSGKEFKLAAEICMSNEEPSVNS QDDGENVSRESQRLSQQPLLSQAQRPTGKQCFHGLSPGLCCCSVQPRDLAPCIPTMAKRG QCTAEAVVSEGISPKPWWLTCGVGPMSAQKPRIEVWEPPYRFQRMYGNAWMSRQKFAAGV ETSRRTSAKAVQKGNVGLEPPQRVLTGALPSGAVRRGPPSSRPQDDRSTDSLHVHVEKLQ TLNASL >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_8|741_bp atgcattcacaaagagatggtttgaaactggaacttatacataaaagggaagcagagtgt aaaagtttgaaaaatttgcagcctgaccatgcagtagaaaagaaaaacccattttccgga aaagaattcaagctggctgcagaaatttgcatgagtaatgaggagccaagtgttaatagc caagatgatggagaaaatgtctccagggaatctcagaggctttcacagcagcctctccta tcacaggcccagaggcctacagggaaacaatgttttcatgggctgagcccagggctttgc tgctgctctgtgcagcctcgggacttggcaccctgcatcccaaccatggctaaaaggggc caatgtacagctgaggctgttgtttcagagggtataagccccaaaccttggtggcttaca tgtggtgttgggcccatgagtgcacagaagccaagaattgaggtttgggaacctccatat agatttcaaaggatgtatggaaatgcctggatgtccaggcagaagtttgctgcaggggtg gaaacttcacggagaacctctgctaaggcagtgcagaagggaaatgtgggtttggagccc ccacagagagtcctcactggggccctgcctagtggagctgtaagaagagggccaccatcc tccagaccccaggatgacagatcaaccgacagcttgcatgtgcatgtggaaaagctacag acactcaatgccagcctgtga >gi568815596f:189823336_190024213|GENSCAN_predicted_peptide_9|133_aa QSLVLDRNGQDLVSAPCPVIGSGFPRKAWAQLKSLDSEGTYSWRLSANCTPCNQMNANSG TTWPENTPYDPSQSETIVVSSHQLCPIAESSRQPHLNSEQRQWPSHLENPKASSACPGFL SAGPSRITGYMKQ >gi568815596f:189823336_190024213|GENSCAN_predicted_CDS_9|402_bp caaagtctggtattggacagaaatggccaggaccttgtgtctgcaccatgcccagtcatt ggctctggcttcccaagaaaagcatgggctcagctcaaaagcttagattctgaaggcact tacagctggaggctatcagctaactgcactccttgcaatcaaatgaatgcaaacagcggt accacctggccagaaaatacaccctatgacccatcccaatcagagacaattgtggtgtcc agccatcagctctgcccgattgcagaatccagccggcagccccacctaaactcagagcaa agacagtggcccagccatctagagaacccaaaagcaagctctgcctgcccagggttctta tcagctggcccatccagaatcacaggctacatgaaacagtga