GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:52:20 Sequence gi568815597f:34657331_34858149 : 200819 bp : 46.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9138 9146 9 2 0 87 98 6 0.048 1.72 1.02 Intr + 29039 29125 87 1 0 46 94 119 0.680 8.47 1.03 Intr + 36146 36222 77 1 2 51 88 64 0.202 1.01 1.04 Intr + 39485 39587 103 2 1 58 44 93 0.097 2.08 1.05 Term + 48980 49120 141 1 0 84 49 104 0.825 4.03 1.06 PlyA + 50417 50422 6 1.05 2.05 PlyA - 50573 50568 6 1.05 2.04 Term - 59162 58995 168 2 0 44 43 158 0.213 4.78 2.03 Intr - 76589 76477 113 0 2 86 75 67 0.127 5.30 2.02 Intr - 85279 85096 184 1 1 41 75 174 0.359 10.76 2.01 Init - 90896 90843 54 2 0 89 72 44 0.331 4.21 2.00 Prom - 94014 93975 40 -2.26 3.00 Prom + 96768 96807 40 -5.46 3.01 Init + 100001 100818 818 1 2 94 42 1026 0.346 92.79 3.02 Intr + 101053 101187 135 1 0 107 49 45 0.273 2.18 3.03 Intr + 103843 104711 869 1 2 42 94 1297 0.326 116.69 3.04 Intr + 107899 108119 221 2 2 41 94 85 0.467 2.42 3.05 Intr + 116863 116986 124 0 1 30 38 127 0.188 2.06 3.06 Intr + 117072 117284 213 1 0 89 53 70 0.514 2.29 3.07 Term + 127428 128245 818 1 2 33 47 1476 0.013 131.30 3.08 PlyA + 129015 129020 6 1.05 4.00 Prom + 129995 130034 40 -6.36 4.01 Init + 130666 130722 57 0 0 100 55 26 0.141 1.60 4.02 Intr + 135585 135738 154 2 1 98 101 35 0.421 5.45 4.03 Term + 136867 137885 1019 2 2 128 44 1482 0.900 140.08 4.04 PlyA + 138396 138401 6 1.05 5.06 PlyA - 138473 138468 6 -3.24 5.05 Term - 139030 138938 93 1 0 83 48 63 0.095 -0.37 5.04 Intr - 140454 140341 114 0 0 95 80 33 0.092 3.84 5.03 Intr - 141116 141041 76 1 1 49 78 68 0.019 1.42 5.02 Intr - 166574 166487 88 0 1 114 77 1 0.171 0.63 5.01 Init - 168525 168426 100 0 1 76 27 139 0.491 5.02 5.00 Prom - 172116 172077 40 -2.76 6.06 PlyA - 174238 174233 6 1.05 6.05 Term - 174826 174650 177 1 0 96 44 75 0.364 1.59 6.04 Intr - 185816 185687 130 1 1 48 86 29 0.201 -0.60 6.03 Intr - 186437 186260 178 0 1 58 95 127 0.912 9.38 6.02 Intr - 194356 194272 85 1 1 97 53 47 0.064 1.49 6.01 Intr - 198652 198441 212 2 2 121 77 252 0.051 26.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 42844 42912 69 0 0 86 81 85 0.908 6.66 S.002 Sngl + 127433 128245 813 1 0 111 47 1456 0.982 139.58 S.003 Term - 198652 198369 284 2 2 121 39 342 0.939 28.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_1|138_aa MKQTLEVRSNYRRPTDEEIEAQRVELLKVTGKLQKQYLTATCGSERGTEKQGCPLRKSSS TGSNGSYQPPSDMPFFLATATNSKAHLMWSEPEVAEWTKEVLNENSQVPEQNCAERLATG LSQDVRGKKWMLEIFQRS >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_1|417_bp atgaagcagaccctagaagttaggtctaattatcgccgtcctacagatgaggaaatagag gcccagagagttgaactgctcaaggtcaccgggaagctacagaagcaatacctgacagcc acatgtggaagtgagcgtgggaccgagaagcaaggctgccctctgagaaagagttccagc accgggtccaatgggagctaccagccaccttctgacatgcccttcttcttggccactgcg accaactcaaaggcccacctgatgtggtctgaaccagaggtggcagaatggaccaaagaa gttctaaatgagaattcacaggtgccagagcagaactgtgcagaacgtctggccacaggt ctttcacaagacgttcggggcaaaaagtggatgttggagatcttccaaaggagttaa >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_2|172_aa MTLSDQAIASVKGPQGVQKMKAERWRWNGGLELVDFNEGVGLYPKSHGKFTEVPDLCLQK DTSGYREAESPDKKLLQGSEPLWVSDGVYLGSGGGEDFKENGKKVWLGDLGEKRVEKQPS LERIPGEEQTQAIRIFCMGDLADRCLGCFQVSPNYEVNYWVFRMSQVQRLAG >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_2|519_bp atgacgttgtcagaccaggccattgccagtgtgaagggacctcagggtgtccagaagatg aaagcagagaggtggcggtggaatgggggcctggaacttgtcgacttcaatgaaggagtt ggactttaccctaagagccatggaaagttcactgaagtccctgatttgtgtttgcagaaa gatacctctggctatcgggaagcagaaagcccagacaagaagctgcttcaaggctcagag ccactttgggtctctgatggagtctacctgggatcaggtggtggagaggacttcaaagaa aatgggaagaaggtttggttaggggacttaggagagaagagagtggaaaagcagccatcc ctggagaggatccctggagaggagcaaacccaggccatccggattttctgcatgggagac ctggctgaccgctgcctgggctgctttcaagtctcaccgaattatgaagtcaactactgg gtgtttagaatgtctcaagttcaaagactagcaggctag >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_3|1065_aa MNWSIFEGLLSGVNKYSTAFGRIWLSLVFIFRVLVYLVTAERVWSDDHKDFDCNTRQPGC SNVCFDEFFPVSHVRLWALQLILVTCPSLLVVMHVAYREVQEKRHREAHGENSGRLYLNP GKKRGGLWWTYVCSLVFKASVDIAFLYVFHSFYPKYILPPVVKCHADPCPNIVDCFISKP SEKNIFTLFMVATAAICILLNLVELIYLVSKRCHECLAARKAQAMCTGHHPHGTTSSCKQ DDLLSGDLIFLGSDSHPPLLPDRPRDHVKKTILMEIVRANAQGWREEGVHRRTHMRAPSS CVAHCQNLIKVNSFAGSRPQGSQGLSGQVAPRYRPSTCSTQDTASMNWAFLQGLLSGVNK YSTVLSRIWLSVVFIFRVLVYVVAAEEVWDDEQKDFVCNTKQPGCPNVCYDEFFPVSHVR LWALQLILVTCPSLLVVMHVAYREERERKHHLKHGPNAPSLYDNLSKKRGGLWWTYLLSL IFKAAVDAGFLYIFHRLYKDYDMPRVVACSVEPCPHTVDCYISRPTEKKVFTYFMVTTAA ICILLNLSEVFYLVGKRCMEIFGPRHRRPRCRECLPDTCPPYVLSQGGHPEDGNSVLMKA GSAPVDAEKRTATKLMAHWGSWGEREEDLTSRHPEKGSLRHLEKEDFAKQGGGIPDRGDS LCKGRVYLVTKMERTQELLGQRQRGWEQYGPTAQIQSKLYALKSIAITSSAQDQGTKSSH MAAQTTAKQRASSHPEGGKKLDEDRQTTASHATDLTPEHQGFLPGGRKEGGKMRLSQKRL PGGGQLVREELLEGAMDWKTLQALLSGVNKYSTAFGRIWLSVVFVFRVLVYVVAAERVWG DEQKDFDCNTKQPGCTNVCYDNYFPISNIRLWALQLIFVTCPSLLVILHVAYREERERRH RQKHGDQCAKLYDNAGKKHGGLWWTYLFSLIFKLIIEFLFLYLLHTLWHGFNMPRLVQCA NVAPCPNIVDCYIARPTEKKIFTYFMVGASAVCIVLTICELCYLICHRVLRGLHKDKPRG GCSPSSSASRASTCRCHHKLVEAGEVDPDPGNNKLQASAPNLTPI >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_3|3198_bp atgaactggagtatctttgagggactcctgagtggggtcaacaagtactccacagccttt gggcgcatctggctgtctctggtcttcatcttccgcgtgctggtgtacctggtgacggcc gagcgtgtgtggagtgatgaccacaaggacttcgactgcaatactcgccagcccggctgc tccaacgtctgctttgatgagttcttccctgtgtcccatgtgcgcctctgggccctgcag cttatcctggtgacatgcccctcactgctcgtggtcatgcacgtggcctaccgggaggtt caggagaagaggcaccgagaagcccatggggagaacagtgggcgcctctacctgaacccc ggcaagaagcggggtgggctctggtggacatatgtctgcagcctagtgttcaaggcgagc gtggacatcgcctttctctatgtgttccactcattctaccccaaatatatcctccctcct gtggtcaagtgccacgcagatccatgtcccaatatagtggactgcttcatctccaagccc tcagagaagaacattttcaccctcttcatggtggccacagctgccatctgcatcctgctc aacctcgtggagctcatctacctggtgagcaagagatgccacgagtgcctggcagcaagg aaagctcaagccatgtgcacaggtcatcacccccacggtaccacctcttcctgcaaacaa gacgacctcctttcgggtgacctcatctttctgggctcagacagtcatcctcctctctta ccagaccgcccccgagaccatgtgaagaaaaccatcttaatggaaatagtgagggccaat gcccagggttggagggaggagggcgttcatagaagaacacacatgcgggcaccttcatcg tgtgtggcccactgtcagaacttaataaaagtcaactcatttgctggttccaggcctcaa ggctcccaaggcctgagtgggcaggtagcacccaggtatagaccttccacgtgcagcacc caggacacagccagcatgaactgggcatttctgcagggcctgctgagtggcgtgaacaag tactccacagtgctgagccgcatctggctgtctgtggtgttcatctttcgtgtgctggtg tacgtggtggcagcggaggaggtgtgggacgatgagcagaaggactttgtctgcaacacc aagcagcccggctgccccaacgtctgctatgacgagttcttccccgtgtcccacgtgcgc ctctgggccctacagctcatcctggtcacgtgcccctcactgctcgtggtcatgcacgtg gcctaccgcgaggaacgcgagcgcaagcaccacctgaaacacgggcccaatgccccgtcc ctgtacgacaacctgagcaagaagcggggcggactgtggtggacgtacttgctgagcctc atcttcaaggccgccgtggatgctggcttcctctatatcttccaccgcctctacaaggat tatgacatgccccgcgtggtggcctgctccgtggagccttgcccccacactgtggactgt tacatctcccggcccacggagaagaaggtcttcacctacttcatggtgaccacagctgcc atctgcatcctgctcaacctcagtgaagtcttctacctggtgggcaagaggtgcatggag atcttcggccccaggcaccggcggcctcggtgccgggaatgcctacccgatacgtgccca ccatatgtcctctcccagggagggcaccctgaggatgggaactctgtcctaatgaaggct gggtcggccccagtggatgcagagaagaggacagctaccaaactcatggcgcactgggga agctggggagagagggaagaggatctaacatccaggcatccagaaaaaggctccctgaga catctagagaaggaggattttgccaagcaaggaggcggaattccagacagaggggatagc ttatgcaaagggagagtatacctggtgacaaagatggaaagaacccaggagctcttgggc cagagacagcgaggctgggaacaatatggccctacagctcagatacaatctaagctctac gccctgaaatccattgccatcacctcttcagcccaggatcaaggcaccaaaagctcccac atggcagctcaaaccacagccaagcaaagagcctcctcccatccagaaggagggaagaag ctggatgaggacagacagactactgccagccacgcaactgacttgacccctgagcaccag ggattcttacctggtggcaggaaggagggaggaaaaatgaggctcagccaaaaaaggctt cctggaggagggcagctggtcagggaggaacttctggagggcgccatggactggaagaca ctccaggccctactgagcggtgtgaacaagtactccacagcgttcgggcgcatctggctg tccgtggtgttcgtcttccgggtgctggtatacgtggtggctgcagagcgcgtgtggggg gatgagcagaaggactttgactgcaacaccaagcagcccggctgcaccaacgtctgctac gacaactacttccccatctccaacatccgcctctgggccctgcagctcatcttcgtcaca tgcccctcgctgctggtcatcctgcacgtggcctaccgtgaggagcgggagcgccggcac cgccagaaacacggggaccagtgcgccaagctgtacgacaacgcaggcaagaagcacgga ggcctgtggtggacctacctgttcagcctcatcttcaagctcatcattgagttcctcttc ctctacctgctgcacactctctggcatggcttcaatatgccgcgcctggtgcagtgtgcc aacgtggccccctgccccaacatcgtggactgctacattgcccgacctaccgagaagaaa atcttcacctacttcatggtgggcgcctccgccgtctgcatcgtactcaccatctgtgag ctctgctacctcatctgccacagggtcctgcgaggcctgcacaaggacaagcctcgaggg ggttgcagcccctcgtcctccgccagccgagcttccacctgccgctgccaccacaagctg gtggaggctggggaggtggatccagacccaggcaataacaagctgcaggcttcagcaccc aacctgacccccatctga >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_4|409_aa MHSSIFLGGSGCLAKGGLERPPPSPSRFLPPPRPSALFKAPPPLVRSSRAPAGVTPAIVP TSTWAARQAGDGGPGAMGDWGFLEKLLDQVQEHSTVVGKIWLTVLFIFRILILGLAGESV WGDEQSDFECNTAQPGCTNVCYDQAFPISHIRYWVLQFLFVSTPTLVYLGHVIYLSRREE RLRQKEGELRALPAKDPQVERALAAVERQMAKISVAEDGRLRIRGALMGTYVASVLCKSV LEAGFLYGQWRLYGWTMEPVFVCQRAPCPYLVDCFVSRPTEKTIFIIFMLVVGLISLVLN LLELVHLLCRCLSRGMRARQGQDAPPTQGTSSDPYTDQVFFYLPVGQGPSSPPCPTYNGL SSSEQNWANLTTEERLASSRPPLFLDPPPQNGQKPPSRPSSSASKKQYV >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_4|1230_bp atgcattccagcatcttcctgggtggttctggctgcctggccaagggtggcttggagcgc ccgccgccctccccgtcgcgtttcctgcccccaccccgcccctctgcgctatttaaggcg cccccgccgctcgtgcggtccagcagggctcccgcgggcgtcactccggccatcgtcccc acctccacctgggccgcccggcaggcaggcgacggaggcccgggagccatgggtgactgg ggcttcctggagaagttgctggaccaggtccaggagcactcgaccgtggtgggtaagatc tggctgacggtgctcttcatcttccgcatcctcatcctgggcctggccggcgagtcagtg tggggtgacgagcaatcagatttcgagtgtaacacggcccagccaggctgcaccaacgtc tgctatgaccaggccttccccatctcccacatccgctactgggtgctgcagttcctcttc gtcagcacacccaccctggtctacctgggccatgtcatttacctgtctcggcgagaagag cggctgcggcagaaggagggggagctgcgggcactgccggccaaggacccacaggtggag cgggcgctggcggccgtagagcgtcagatggccaagatctcggtggcagaagatggtcgc ctgcgcatccgcggagcactgatgggcacctatgtcgccagtgtgctctgcaagagtgtg ctagaggcaggcttcctctatggccagtggcgcctgtacggctggaccatggagcccgtg tttgtgtgccagcgagcaccctgcccctacctcgtggactgctttgtctctcgccccacg gagaagaccatcttcatcatcttcatgttggtggttggactcatctccctggtgcttaac ctgctggagttggtgcacctgctgtgtcgctgcctcagccgggggatgagggcacggcaa ggccaagacgcacccccgacccagggcacctcctcagacccttacacggaccaggtcttc ttctacctccccgtgggccaggggccctcatccccaccatgccccacctacaatgggctc tcatccagtgagcagaactgggccaacctgaccacagaggagaggctggcgtcttccagg ccccctctcttcctggacccaccccctcagaatggccaaaaacccccaagtcgtcccagc agctctgcttctaagaagcagtatgtatag >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_5|156_aa MTSGASLVLALPLQPHLAARPVHAYQCLCLLSMGSESSIASSLSSSKQRAWRRHHGLNYL VKSDITKEMPASNKPSILQSPVPDQCLEPICACVCVAASLGPNLSCQRSKFLSPSLTPAL TPSSEWPLWLIGKIQSLPLLYIAIVLTEVWVHGEQI >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_5|471_bp atgacctccggggccagcctggtcctggccctgcctctgcagccccacctggcagccagg cctgtccatgcttatcagtgcctgtgtttgctcagcatggggtctgagtcctccattgca tcctctttatcctccagcaaacagagagcatggagaaggcaccacggccttaactacctt gtgaagagtgacatcaccaaggagatgcctgcgtccaacaaacctagcattctccagagc cccgtgcctgaccagtgcttggagcccatatgtgcttgtgtttgtgtggccgccagtcta ggacccaacttgtcttgccaaaggtccaagttcctgagccccagcctgaccccggctctg acaccaagctcagagtggcctttgtggctcattggcaagattcagtctcttccacttctg tacattgccattgtcctcactgaggtctgggtgcatggagagcagatatga >gi568815597f:34657331_34858149|GENSCAN_predicted_peptide_6|260_aa XVMWPVFWTVVRTYAPYVTFPVAFVVGAVGYHLEWFIRGKDPQPVEEEKSISERREDRKL DELLGKDHTQVTLSCACCCVSTSRLGSCQNKPAREPFDSAVLLRFAFCNTEEAEGIVIIG SGKINLKGDLMSICKPIEGYHSGQSDRRTRGNGFSMQQRPFGLCHCLEGLQKTRGCQQRL LDTEYTIVLNWQHPMVWGGKWWLKKLRLRELTIVSQGHTAMWQIRAWTPASLLHNQALAP ADYGVSCPCPLAVEAAGEQG >gi568815597f:34657331_34858149|GENSCAN_predicted_CDS_6|783_bp natgtcatgtggcctgtgttttggaccgtggttcgtacctatgctccttatgtcacattc cctgttgccttcgtggtcggggctgtgggttaccacctggaatggttcatcaggggaaag gacccccagcccgtggaggaggaaaagagcatctcagagcgccgggaggatcgcaagctg gatgagcttctaggcaaggaccacacgcaggtgactctttcctgtgcctgttgctgtgtt agcacaagtcggctgggcagctgccagaacaagcctgccagggagccttttgactctgct gtcttgcttcgctttgctttttgtaacacggaggaagctgaaggcattgtgattatcggc tccgggaagataaacttgaagggtgacttaatgagcatttgtaaacctatcgaaggttat cattcagggcagagtgaccggcggacacgcggaaatgggtttagtatgcagcaaaggcct tttgggctctgccattgtctggaggggctgcaaaagactaggggctgccagcagaggctt ttggacacagaatacactattgtgctgaactggcagcaccccatggtgtggggaggaaag tggtggctgaagaaactgaggctcagagaattaactattgtttctcaaggtcacacagcc atgtggcagatccgggcctggactccggcatccctgcttcacaaccaggccctggcacct gctgattacggagtgtcctgcccctgccctctggctgtggaggcagctggggagcagggc tga