GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:00:11 Sequence gi568815581f:29149141_29354346 : 205206 bp : 44.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4565 4623 59 0 2 150 48 50 0.351 5.05 1.02 PlyA + 11077 11082 6 1.05 2.03 PlyA - 11383 11378 6 1.05 2.02 Term - 14561 14439 123 2 0 84 37 62 0.229 -0.92 2.01 Init - 17800 16802 999 1 0 76 76 1425 0.369 133.50 2.00 Prom - 26225 26186 40 -7.36 3.00 Prom + 26617 26656 40 -4.56 3.01 Init + 27820 27880 61 0 1 96 55 36 0.798 2.45 3.02 Term + 30918 31459 542 1 2 113 54 165 0.756 10.02 3.03 PlyA + 34418 34423 6 1.05 4.05 PlyA - 34479 34474 6 1.05 4.04 Term - 42743 42534 210 2 0 32 48 97 0.137 -2.61 4.03 Intr - 48020 47982 39 2 0 88 75 46 0.312 1.72 4.02 Intr - 48659 48581 79 1 1 118 86 56 0.641 7.95 4.01 Init - 50503 50463 41 1 2 52 63 77 0.711 1.29 4.00 Prom - 52322 52283 40 -5.76 5.02 PlyA - 52439 52434 6 1.05 5.01 Sngl - 55334 54537 798 2 0 99 48 336 0.979 26.56 5.00 Prom - 69383 69344 40 -4.66 6.00 Prom + 71072 71111 40 -0.26 6.01 Init + 76228 76311 84 0 0 77 80 69 0.503 5.82 6.02 Intr + 91493 91519 27 1 0 100 94 -1 0.138 0.01 6.03 Intr + 92457 92545 89 2 2 108 58 62 0.218 3.97 6.04 Intr + 100002 100066 65 0 2 88 121 85 0.098 10.16 6.05 Intr + 101013 101160 148 1 1 87 77 82 0.338 6.29 6.06 Intr + 102924 103065 142 0 1 50 68 88 0.878 3.36 6.07 Intr + 104500 104635 136 0 1 23 9 207 0.697 6.44 6.08 Term + 105062 105213 152 0 2 66 38 171 0.873 7.97 6.09 PlyA + 105329 105334 6 1.05 7.06 PlyA - 106388 106383 6 1.05 7.05 Term - 115451 115399 53 0 2 81 49 47 0.304 -2.31 7.04 Intr - 118390 118358 33 2 0 112 116 3 0.262 3.69 7.03 Intr - 138576 136852 1725 1 0 123 88 876 0.852 78.42 7.02 Intr - 144847 144643 205 1 1 47 41 201 0.002 10.07 7.01 Init - 154834 154742 93 1 0 46 80 28 0.013 -1.82 7.00 Prom - 185465 185426 40 -2.36 8.00 Prom + 186134 186173 40 -2.96 8.01 Init + 187961 187978 18 1 0 81 116 4 0.648 2.83 8.02 Term + 194723 194866 144 1 0 81 28 103 0.929 1.51 8.03 PlyA + 196078 196083 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_1|19_aa XELKAVPAAKTSPSKQLHV >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_1|60_bp natgagctgaaagcagttcctgcagccaaaactagtccttccaagcaacttcatgtttga >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_2|373_aa MFNLMKKDKDKDGGRKEKKEKKEKKERMSAAELRSLEEMSLRRGFFNLNRSSKRESKTRL EISNPIPIKVASGSDLHLTDIDSDSNRGSVILDSGHLSTASSSDDLKGEEGSFRGSVLQR AAKFGSLAKQNSQMIVKRFSFSQRSRDESASETSTPSEHSAAPSPQVEVRTLEGQLVQHP GPGIPRPGHRSRAPELVTKKFPVDLRLPPVVPLPPPTLRELELQRRPTGDFGFSLRRTTM LDRGPEGQACRRVVHFAEPGAGTKDLALGLVPGDRLVEINGHNVESKSRDEIVEMIRQSG DSVRLKVQPIPELSELSRSWLRSGEGPRREPSDHRDHPGISDFPSFPFPTICCYPIPGEV DGLALEALVDSKL >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_2|1122_bp atgtttaacctaatgaagaaagacaaggacaaagatggcgggcggaaggagaagaaggag aaaaaggagaaaaaggagcggatgtcagcggcagagcttcggagcctggaggagatgagc ctgcgacgtggcttcttcaacctgaaccgctcctccaagcgtgaatccaagacgcgcctg gaaatctccaaccccatccccatcaaggtggccagcggctctgacctgcacctgactgac attgactccgatagtaaccggggcagcgtcatcctggactcgggccacctaagtacagcc agctccagcgatgacctcaagggtgaggagggtagcttccgtggctcggtgctgcagcgg gcagccaagttcggctcactggccaagcagaactcacagatgattgtcaagcgcttttcc ttctcccagcgtagccgggatgagagcgcctcagaaacctcgacgccctcagagcactct gccgccccctcgccacaggtggaggtgaggactctagagggacagctggtgcagcatcct ggcccaggcatccctcgaccagggcaccgatcccgagcccctgagctagtgactaaaaag ttcccagtcgacctgcgcctgccccccgtggtgcccctgcccccacctaccctccgggag ctggagctgcaacgacggcccactggagactttggcttctccctgcggcgcacaaccatg ctggatcggggccccgagggccaggcctgtcggcgtgtggtccactttgctgagcctggt gcaggcaccaaggacctggccctggggctggtgccaggagatcgactggtggagattaat gggcacaatgtggagagcaagtccagggatgagattgtggagatgatccggcagtcaggg gacagcgtgcggctcaaggtgcagcccattccagagctcagcgagctcagcaggagctgg ctgcggagcggcgagggacctcgcagggagccatccgatcacagggatcatcctggcatc tcagattttcctagctttcccttcccaactatctgctgctatcccatccctggggaagta gatggtttagctttggaggccctggtggattccaagttatag >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_3|200_aa MEERERESSCGLARFPAALPGWLEGPVVPPSPPLPIGPHPRASSSTAPPGTGAGSPGRCF QGTGEEEEEPAGPAARPPSLPAGARPAAPPSRARHRAAAAADRHSPATGARPAGSRSGDA ASSSLGPVRDGAERSAAQLSAAPSDRAGRAERELPAGGCGDKRPACLAAPVRAARRGPGG PGAALAASVRCGLRSPSRCV >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_3|603_bp atggaggaacgcgagcgggagagctcctgcggcctcgccaggttccctgctgctcttcca ggttggctggaagggccggtggttcccccttccccgccgctcccgatcggcccccacccc cgggcgtcgagctccacggcgccgccagggacgggagccgggagtccgggccgctgcttc caggggacgggggaggaggaggaggagccggcgggccccgccgcccgcccgccctccctc ccggccggagcccgccccgccgccccaccaagccgggcccgccaccgagcggccgccgcc gccgaccggcactcaccggccaccggagcccgcccggcaggcagcagaagcggagacgcg gcatccagcagcctcggcccggtcagggatggagcagagcgcagcgcggcgcagctcagc gccgcccccagcgaccgcgcaggccgagccgaaagggagctccctgctggcggctgtggg gataaacgcccggcctgcctggcagcgccagtgcgcgccgccagacgtgggccaggcggg ccgggcgctgccctggcagcctccgtccgctgtggactccgaagcccctctcgctgtgtc tga >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_4|122_aa MPVAILVDCVLQDQNHKMSDWSFLGWLLTRVQNDSTVVGKSHEEENLPELHVQRKLFTIK SFTESVLTPELGHWEPLRNPKPRSDMIIFVLWKDNSGSHMEDQLEGMKRKQATTRRPLLL SS >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_4|369_bp atgcctgtggccatcctagtggactgcgtcctccaggaccaaaatcacaagatgagcgac tggtcattcctgggctggctcctgacccgagtgcagaacgattccaccgtggttggcaag agccatgaggaagaaaatcttcctgaacttcatgtgcagcgtaaactatttaccatcaag tcctttacagaaagtgtgctgacccctgagctgggccattgggaacctttgaggaatccc aagccaagaagtgacatgatcatatttgtattatggaaagacaactctggcagccacatg gaggaccagttagaggggatgaagcggaagcaggcgaccacgagaaggccgttgctattg tccagctaa >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_5|265_aa MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPL LEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDE IFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAF PISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTTNTELKDLPKRIPKDSARYHFFL YKHSHEGDYLESIVLFIQCLDTHAV >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_5|798_bp atgtcccaccagaccggcatccaagcaagtgaagatgttaaagagatctttgccagagcc agaaatggaaagtacagacttctgaaaatatctattgaaaatgagcaacttgtgattgga tcatatagtcagccttcagattcctgggataaggattatgattcctttgttttacccctg ttggaggacaaacaaccatgctatatattattcaggttagattctcagaatgcccaggga tatgaatggatattcattgcatggtctccagatcattctcatgttcgtcaaaaaatgttg tatgcagcaacaagagcaactctgaagaaggaatttggaggtggccacattaaagatgaa atatttggaacagtaaaggaagatgtatcattacatggatataaaaaatacttgctgtca caatcttcccctgccccactgactgcagctgaggaagaattacgacagattaaaatcaat gaggtacagactgacgtgggtgtggacactaagcatcaaacactacaaggagtagcattt cccatttctcgagaagcctttcaggctttggaaaaattgaataacagacagctcaactat gtgcagttggaaatagatataaaaaatgaaattataattttggccaacacaacaaataca gaactgaaagatttgccaaagaggattcccaaggattcagctcgttaccatttctttctg tataaacattcccatgaaggagactatttagagtccatagttttatttattcaatgcctg gatacacatgcagtataa >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_6|280_aa MVYSMRSSKTGIQKEKNHNSGGKAADKEDAVVGWSAWRQGQGVKAIHKIPLSIYAAWRVA LVSRSPRNPSNHQDGSDQPYAGVPGAMEALLRHSISFQITIYDQENFQGKRMEFTSSCPN VSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFR PICSANHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQTGFATNILDI VGISISWNVTIMEETINIGESGALMPRLRRSNRFAESNSS >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_6|843_bp atggtgtatagcatgagaagctccaaaacaggaattcagaaggagaaaaatcacaacagt ggagggaaggcagcagacaaagaggatgctgtggttggctggtcagcctggaggcaaggc cagggagtcaaagccatccacaaaattccactgagcatctatgctgcttggcgtgtagcc ctggtgtccagaagcccaagaaacccttccaaccaccaagatggctcagaccaaccctac gccggggtccctggggccatggaagctctcttgcgccattcaatctcatttcagataacc atctatgatcaggagaactttcagggcaagaggatggagttcaccagctcctgtccaaat gtctctgagcgcagttttgataatgtccggtccctgaaggtggaaagtggcgcctggatt ggttatgagcataccagcttctgtgggcaacagtttatcctggagagaggagaataccct cgctgggatgcctggagtgggagtaatgcctaccacattgagcgtctcatgtccttccgc cccatctgttcagctaatcataaggagtctaagatgaccatctttgagaaggaaaacttt attggacgccagtgggagatctctgacgactacccctccttgcaagccatgggctggttc aacaacgaagtcggctccatgaagatacaaactgggtttgctaccaatatcctggatatc gtgggtatcagtatatcttggaatgtgaccatcatggaggagactataaacattggagag agtggggctctcatgcccagacttcgcagatccaatcgattcgccgaatccaacagtagc tga >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_7|702_aa MEKLPVKSHEELSSSVQHLMGLKTKSYQQPWQQQQQPHHHHHYYFYNHSHNHHHHHHHQQ PHQYLQHGAEGSPKAQPKPLKHEQKHTLQQHQETPKKKTGYGELNGNAGEREISLKNLSS DEATNPISRVLNGNQQVVDTSLKQTVKANTFGKAGIKTKNFIQKNSMDKKNGKSYENKSG ENQSVDKSDTIPIPNGVVTNNSGYITNGYMGKGADNDGSGSESGYTTPKKRKARRNSAKG CENLNIVQDKIMQQETSVPTLKQGLETFKPDYSEQKGNRVDGSKPIWKYETGPGGTSRGK PAVGDMLRKSSDSKPGVSSKKFDDRPKGKHASAVASKEDSWTLFKPPPVFPVDNSSAKIV PKISYASKVKENLNKTIQNSSVSPTSSSSSSSSTGETQTQSSSRLSQVPMSALKSVTSAN FSNGPVLAGTDGNVYPPGGQPLLTTAANTLTPISSGTDSVLQDMSLTSAAVEQIKTSLFI YPSNMQTMLLSTAQVDLPSQTDQQNLGDIFQNQWGLSFINEPSAGPETVTGKSSEHKVME VTFQGEYPATLVSQGAEIIPSGTEHPVFPKAYELEKRTSPQVLGSILKSGTTSESGALSL EPSHIGDLQKADTSSQGALVFLSKDYEIESQNPLASPTNTLLGSAKEQRYQRGLERNDSW GSFDLRAAIVYHTKEMESIWNLQKQDPKRIITYNEAMDSPDQ >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_7|2109_bp atggagaaacttccagtgaaaagtcatgaggaactgagttcttcagtccaacacctcatg gggttgaaaactaagtcctaccaacaaccatggcagcagcagcagcagccgcaccaccac caccattattatttctacaaccacagccacaaccaccaccaccaccatcatcaccagcag cctcaccaatacctgcagcatggagccgagggcagccccaaggcccagccaaagccgctg aaacatgagcagaaacacaccctccagcagcaccaggaaacgccgaagaagaaaacaggc tatggtgaactaaacggtaatgctggagaaagagaaatatctttaaagaacctgagttct gatgaagccaccaaccctatttccagggtcctcaatggcaaccagcaagttgtagacact agcctgaagcagactgtaaaggccaacacctttgggaaagcaggaattaaaaccaagaat ttcattcagaaaaacagtatggacaaaaagaatgggaagtcttatgaaaataaatctgga gagaatcagtctgtagataagtctgatactataccaattccaaatggtgtggtaacaaat aattctggttatattactaatggttatatgggtaaaggagcagataatgatggtagtgga tctgagagcggatatacaactcctaaaaaaaggaaagctaggcgcaatagtgccaagggt tgtgaaaaccttaatatagtgcaggacaaaataatgcaacaagagaccagtgtcccaacc ttaaaacagggacttgaaactttcaagcctgactatagtgaacaaaagggaaatcgagta gatggttcgaagcccatttggaagtatgaaactgggcctggaggaacaagtcgaggaaaa cctgctgtgggtgatatgcttcggaaaagctcagatagtaaacctggtgtgagcagcaaa aagtttgatgatcggcccaaaggaaagcatgcttcagctgttgcctccaaagaggactcg tggaccctatttaaaccacccccagtttttccagtggacaatagcagtgctaaaatagtt cctaaaataagttatgcaagcaaagttaaggaaaacctcaacaaaactatacagaactct tctgtgtcaccaacttcatcttcatcatcttcatcatctaccggggaaactcagacccaa tcatcaagtcgcttatcccaggtccctatgtcagcgctgaaatctgttacttctgccaac ttttctaatgggcctgttttagcagggactgatggaaatgtttatcctccagggggtcag ccactgctaactactgctgctaatactctaacacccatctcttctgggacagattcagtt ctccaggacatgagtctaacttcagcagctgttgaacaaattaagactagcctttttatc tatccttcaaatatgcaaactatgctgttgagcacagcacaagtggatctgccctctcag acagatcagcaaaacctgggggatatcttccagaatcagtggggtttatcatttataaat gagcccagtgctggccctgagactgttactgggaagtcatcagagcataaagtgatggag gtgacatttcaaggagaatatcctgctactttggtttcacagggtgctgaaataattccc tcaggaactgagcatcctgtgtttcccaaggcttacgagctggagaaacggactagtcct caagttctgggtagcattctaaaatctgggactactagtgagagtggagccttatccttg gaacccagtcatataggtgacctgcagaaagcagacaccagtagtcaaggtgctttagtg tttctctcaaaggactacgagatagaaagtcaaaatcctctggcctctcctacgaacact ttgttaggctctgccaaagaacagagataccagagaggcctagaaaggaatgatagctgg ggttcttttgacctgagggctgctattgtatatcacactaaagaaatggaatctatttgg aatttgcagaagcaagatcccaaaaggataatcacttacaatgaagccatggatagtcca gatcaatga >gi568815581f:29149141_29354346|GENSCAN_predicted_peptide_8|53_aa MNVRVKISDLISHNQGNTMMMLEKGPNPDPKRGVLGSHAERNSRQVTECSKKR >gi568815581f:29149141_29354346|GENSCAN_predicted_CDS_8|162_bp atgaatgtgagagtcaagatttctgatctaatcagccacaaccagggtaacacaatgatg atgctggaaaaaggtcccaacccagaccctaaaagaggggttcttggatctcatgcagaa aggaattcaaggcaagtcacagagtgcagcaagaagagataa