GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:15:57 Sequence gi568815588r:80055856_80272861 : 217006 bp : 46.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 22845 22931 87 2 0 110 70 122 0.963 12.15 1.02 Intr + 23228 23329 102 1 0 42 77 81 0.853 2.77 1.03 Intr + 25986 26089 104 2 2 98 82 77 0.636 7.07 1.04 Term + 34942 35062 121 1 1 109 45 23 0.071 -1.95 1.05 PlyA + 35988 35993 6 1.05 2.00 Prom + 42473 42512 40 -3.26 2.01 Sngl + 50051 50455 405 1 0 82 38 203 0.660 10.98 2.02 PlyA + 51410 51415 6 1.05 3.00 Prom + 67401 67440 40 -4.66 3.01 Init + 76908 76971 64 2 1 105 117 216 0.999 25.51 3.02 Intr + 86227 86324 98 2 2 83 77 50 0.924 3.13 3.03 Intr + 88368 88488 121 2 1 93 83 137 0.980 13.77 3.04 Term + 89045 89055 11 0 2 98 52 10 0.864 -3.24 3.05 PlyA + 89157 89162 6 1.05 4.16 PlyA - 90488 90483 6 1.05 4.15 Term - 92833 92615 219 1 0 32 46 160 0.248 3.24 4.14 Intr - 101908 101786 123 1 0 55 72 287 0.987 24.58 4.13 Intr - 102170 102112 59 0 2 65 75 61 0.959 1.10 4.12 Intr - 103340 103245 96 0 0 73 106 192 0.965 19.48 4.11 Intr - 106344 106080 265 0 1 134 110 147 0.805 18.59 4.10 Intr - 107550 107494 57 0 0 91 109 80 0.762 9.48 4.09 Intr - 107758 107679 80 2 2 110 80 87 0.963 9.37 4.08 Intr - 108288 108198 91 0 1 72 94 141 0.984 12.67 4.07 Intr - 110342 110229 114 2 0 30 93 117 0.959 7.04 4.06 Intr - 111129 111023 107 1 2 128 23 208 0.843 18.23 4.05 Intr - 111458 111371 88 2 1 2 103 143 0.997 6.74 4.04 Intr - 113503 113114 390 1 0 122 109 241 0.992 24.52 4.03 Intr - 115060 114945 116 2 2 72 101 103 0.793 10.17 4.02 Intr - 117014 116952 63 0 0 111 110 38 0.512 6.99 4.01 Init - 122246 122093 154 2 1 62 65 92 0.286 2.75 4.00 Prom - 128715 128676 40 -5.66 5.00 Prom + 131596 131635 40 -4.76 5.01 Init + 139445 139626 182 1 2 67 77 114 0.716 6.86 5.02 Intr + 142964 143080 117 2 0 46 84 55 0.399 0.48 5.03 Intr + 145480 145508 29 1 2 102 105 0 0.586 0.86 5.04 Intr + 148401 148518 118 1 1 75 84 57 0.551 3.52 5.05 Term + 151854 151917 64 0 1 101 39 70 0.116 0.76 5.06 PlyA + 151932 151937 6 1.05 6.04 PlyA - 153487 153482 6 1.05 6.03 Term - 169640 169411 230 0 2 -39 40 370 0.761 16.29 6.02 Intr - 178005 177783 223 0 1 59 101 40 0.001 0.10 6.01 Init - 184600 184544 57 1 0 76 100 29 0.019 4.21 6.00 Prom - 184765 184726 40 -6.26 7.05 PlyA - 184941 184936 6 1.05 7.04 Term - 191765 191364 402 2 0 47 36 568 0.061 42.65 7.03 Intr - 196744 196538 207 1 0 -50 80 247 0.723 9.37 7.02 Intr - 197203 196949 255 1 0 -44 53 465 0.520 27.44 7.01 Init - 197528 197460 69 2 0 56 75 104 0.870 6.95 7.00 Prom - 198239 198200 40 -1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100057 99998 60 1 0 92 47 81 0.840 2.20 S.002 Init - 170830 170821 10 1 1 71 82 9 0.854 -1.16 S.003 Intr + 177809 177928 120 1 0 129 113 97 0.987 16.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_1|137_aa MATAAGATYFQRGSLFWFTVITLSFGYYTRMVKSEASNAATAYFRPARPLPSLVTVLGLG YFAWVVFWPQSIPYQNLGPLGPFTQYLVDHHHTLLCNGHKGITSGRAQLLWFLQTFFFGI ASLTILIAYKRKRQKQT >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_1|414_bp atggctacggcagccggcgcgacctactttcagcgaggcagtctgttctggttcacagtc atcaccctcagctttggctactacacaaggatggtgaagtcggaggccagcaatgcggct accgcctatttccgtccagcaagacccttgccctctctggtcacagtcctggggctgggc tacttcgcgtgggttgtcttctggcctcagagtatcccttatcagaaccttgggcccctg ggccccttcactcagtacttggtggaccaccatcacaccctcctgtgcaatgggcataaa ggcatcacaagtggtcgggctcagctactctggttcctacagactttcttctttgggata gcgtctctcaccatcttgattgcttacaaacggaagcgccaaaaacaaacttga >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_2|134_aa MELDEWERIGRDFKKAYKDGAEIPVSIWSMWALIKAALEPFQTEDEADSDEEEEDECKKL TSDSECEEQLLEEIKEKKGKLRKACFTSPSAPSAELSEWPPPLSPLNGRENELAEKLTAP VVATLKPGATGGVV >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_2|405_bp atggagttggatgagtgggagagaattggaagagattttaaaaaggcgtataaagatgga gcagaaattccagtttccatttggtcaatgtgggcactaataaaggcagctcttgagcca tttcaaacagaagatgaggcagattcagatgaggaagaggaggacgagtgtaaaaaacta acttcagattctgagtgtgaggaacagctactggaggagattaaagaaaagaaaggaaaa cttagaaaagcatgttttactagcccgtcagctccatctgctgaattaagtgaatggcca cctcctctctctccccttaatgggagagaaaatgaattagctgaaaaacttactgctcct gtagttgcaacattaaaacctggagcaactggtggtgttgtataa >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_3|97_aa MRPLLCALTGLALLRAAGSLAAAEPFSPPRGDSAQSTACDRHMAVQRRLDVMEEMVEKTV DHLGTEVKGLLGLLEELAWNLPPGPFSPAPDLLGDGF >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_3|294_bp atgcggcccctgctctgcgcgctgaccggactggccctgctccgcgccgcgggctctttg gccgctgccgaacccttcagccctccgcgaggagactcagctcagagcacagcgtgtgac agacacatggctgtgcaacgccgtctagatgtcatggaggagatggtagagaagaccgtg gatcacctggggacagaggtgaaaggcctgctgggcctgctggaggagctggcctggaac ctgcccccgggacccttcagccccgctcccgaccttctcggagatggcttctga >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_4|673_aa MAALGCQERLLQGQPLAVASFEKMQRSQLPEDWTMSRFLNFQRKGGEECCFYLTMSYPGY PPPPGGYPPAAPGGGPWGGAAYPPPPSMPPIGLDNVATYAGQFNQDYLSGMAANMSGTFG GANMPNLYPGAPGAGYPPVPPGGFGQPPSAQQPVPPYGMYPPPGGNPPSRMPSYPPYPGA PVPGQPMPPPGQQPPGAYPGQPPVTYPGQPPVPLPGQQQPVPSYPGYPGSGTVTPAVPPT QFGSRGTITDAPGFDPLRDAEVLRKAMKGFGTDEQAIIDCLGSRSNKQRQQILLSFKTAY GKASCGDLIKDLKSELSGNFEKTILALMKTPVLFDIYEIKEAIKGVGTDEACLIEILASR SNEHIRELNRAYKAEFKKTLEEAIRSDTSGHFQRLLISLSQGNRDESTNVDMSLAQRDAQ GCYLRPSRHFRGHLVQRLQIGKYFRLKPSEVYQALLGDLGLGSGCGLCHMCVFLSLQELY AAGENRLGTDESKFNAVLCSRSRAHLVAVFNEYQRMTGRDIEKSICREMSGDLEEGMLAV VKCLKNTPAFFAERLNKAMRGAGTKDRTLIRIMVSRSETDLLDIRSEYKRMYGKSLYHDI SDCTGSDLKASTALALTQGLQLPLPGYCLCLLKALGLYNRLVAEPARLVSFPSEQQGPPG PSWAQRCHLGASN >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_4|2022_bp atggcagccctggggtgccaggaacgacttctccaaggacagccccttgcggtagcctct tttgaaaaaatgcagaggtcacagttaccagaagactggaccatgagcagatttcttaat tttcaaagaaaaggaggagaagaatgctgcttctatctaaccatgagctaccctggctat cccccgcccccaggtggctacccaccagctgcaccaggtggtggtccctggggaggtgct gcctaccctcctccgcccagcatgccccccatcgggctggataacgtggccacctatgcg gggcagttcaaccaggactatctctcgggaatggcggccaacatgtctgggacatttgga ggagccaacatgcccaacctgtaccctggggcccctggggctggctacccaccagtgccc cctggcggctttgggcagcccccctctgcccagcagcctgttcctccctatgggatgtat ccacccccaggaggaaacccaccctccaggatgccctcatatccgccatacccaggggcc cctgtgccgggccagcccatgccaccccccggacagcagcccccaggggcctaccctggg cagccaccagtgacctaccctggtcagcctccagtgccactccctgggcagcagcagcca gtgccgagctacccaggatacccggggtctgggactgtcacccccgctgtgcccccaacc cagtttggaagccgaggcaccatcactgatgctcccggctttgaccccctgcgagatgcc gaggtcctgcggaaggccatgaaaggcttcgggacggatgagcaggccatcattgactgc ctggggagtcgctccaacaagcagcggcagcagatcctactttccttcaagacggcttac ggcaaggcgagctgcggggatttgatcaaagatctgaaatctgaactgtcaggaaacttt gagaagacaatcttggctctgatgaagaccccagtcctctttgacatttatgagataaag gaagccatcaagggggttggcactgatgaagcctgcctgattgagatcctcgcttcccgc agcaatgagcacatccgagaattaaacagagcctacaaagcagaattcaaaaagaccctg gaagaggccattcgaagcgacacatcagggcacttccagcggctcctcatctctctctct cagggaaaccgtgatgaaagcacaaacgtggacatgtcactcgcccagagagatgcccag ggctgctacctgaggcctagcaggcactttagaggccatctagttcagaggttgcaaatt ggcaaatactttaggctcaaaccttcagaagtttaccaggctctcctgggtgacctgggc ctggggtctgggtgtggcctgtgccacatgtgcgtcttcctctctctccaggagctgtat gcggccggggagaaccgcctgggaacagacgagtccaagttcaatgcggttctgtgctcc cggagccgggcccacctggtagcagttttcaatgagtaccagagaatgacaggccgggac attgagaagagcatctgccgggagatgtccggggacctggaggagggcatgctggccgtg gtgaaatgtctcaagaataccccagccttctttgcggagaggctcaacaaggccatgagg ggggcaggaacaaaggaccggaccctgattcgcatcatggtgtctcgcagcgagaccgac ctcctggacatcagatcagagtataagcggatgtacggcaagtcgctgtaccacgacatc tcggactgcactgggtcagacctgaaagccagcacagcactggctctcacccaaggcctg cagttaccactccctggctactgcctgtgtctgctcaaggccctggggctctacaatcgg ctggtggcagagccagccaggcttgtgtccttcccttcagaacagcaaggtcctccaggc cccagctgggcccaaaggtgccatctgggagccagcaactag >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_5|169_aa MSPVSRMQLEVQGPAAHSGGENTFTWEWADPSAAGILSCVKQSGAKSHSHWATEEQHENR RWRGEDDDEKLNPTAKIRKGMHASIISNKQDAKNKIKYIWLIFLYSAVRGNTLQHKECLE ERGPVNPLESPTIDCRIKGMRKAMRASEGDREGQLLRKAHFSPWLSECS >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_5|510_bp atgtccccagtcagccgcatgcaactggaagttcaaggacctgcagctcactctggagga gaaaacaccttcacttgggaatgggctgacccttctgcagctggcatcctaagctgtgtc aagcagtctggagccaagagtcactcacactgggccactgaggaacagcatgaaaacaga aggtggcgtggggaagatgacgatgagaaacttaacccaacagcaaaaatcaggaaagga atgcatgcaagtataataagtaataagcaagatgcaaagaataaaatcaagtatatctgg ctcatcttcctttactcagctgttcgagggaacaccttgcagcacaaagaatgccttgag gaaaggggccctgttaatcccctggaatccccaacaattgactgccgcataaaaggaatg aggaaagcaatgagagccagtgaaggcgaccgcgaaggccagcttcttcgaaaggcccat ttttctccgtggctgagcgaatgctcctag >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_6|169_aa MSKGESRKCNEENVSKSSKPPYETWGRSTGSYSGAPDPFHLPSLTFLGDLHETVTDFTCC CSSLVSWSLSRDRNCGVRTINTFTTCRTRAEAYERGRRRRRRRRRRRRRRRRKEEKKKKK KKKEKEKEKEKEKEKKRKEEEEEEEEEEEEEEEEEEKMYAYGCPSQKNK >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_6|510_bp atgagcaagggtgaaagtagaaaatgcaatgaggaaaatgtatccaagtcttcaaagcca ccctatgagacatggggcagaagcactgggagctattcaggggcaccagaccccttccat ctgccctcactcaccttccttggggacttgcatgagactgtcactgactttacatgctgc tgcagctccttggtcagctggtccttgtcacgggacagaaactgtggggtcaggacaatc aacaccttcacgacctgcagaacaagggcagaggcttatgaaagaggaagaagaagaagg agaaggagaaggagaaggagaaggagaagaagaagaaaagaagaaaagaagaagaagaag aagaagaaggagaaggagaaggagaaggagaaggagaaggagaagaaaagaaaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaaaatgtatgca tatggatgtccaagccagaaaaacaaataa >gi568815588r:80055856_80272861|GENSCAN_predicted_peptide_7|310_aa MEIIEVTEEIHVKVVIEGPDATQEVDEVPEDQQLQEVDEVPEDLQLQGVDEVPEDQQLQE VNEVPEDQQLQKVDEVPEDHQLQEVDEVPEDHQLREVDEVPEDRQLQEELDKAPENNRVE EVVKFSGDSLVQEVAEFPEDSRVEVVEFPEDSPVEEFVEVPENLQMEGVFEFPDNTQCSA LRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVGIDIFTGKKYEDICPSTHNMDVPNI RRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLGKETEQKYDCGEEILITVLSAMTEE AAVAIKAMAK >gi568815588r:80055856_80272861|GENSCAN_predicted_CDS_7|933_bp atggagattattgaagtaaccgaggagattcatgtgaaggtggttattgaggggccagac gccacccaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgaggtc ccagaagacctacagctgcagggtgtggatgaagtcccagaggaccaacagctgcaggag gtgaatgaggtcccggaagaccaacagctgcaaaaggtggatgaggtcccggaggaccac cagctgcaggaggtggatgaggttccggaggaccaccagcttcgagaggtggatgaggtc ccggaggaccgacagctgcaggaggagctggataaggccccagagaacaatcgagtggag gaggtggttaagttttcaggggactctctagtgcaggaggtggctgagttcccagaggac agtcgagtggaggtggttgaattcccagaggactctccagtggaggagtttgttgaggtc ccagaaaaccttcagatggagggagtgtttgagttcccagacaacacccagtgctcagca ttacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatcgtggagatgtct gcttcgaagactggcaagcacggccacgccaaggtccatctggttggtattgacatcttt actgggaagaaatatgaagatatctgcccgtcaactcataatatggatgtccccaacatc agaaggaatgacttccagctgattggcatccaggatgggtacctatcactgctccaggac agcggggaggtaccagaggaccttcgtctccctgagggagaccttggcaaggagactgag cagaagtacgactgtggagaagagatcctgatcacggtgctgtctgccatgacagaggag gcagctgttgcaatcaaggccatggcaaaataa