GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:06:02 Sequence gi568815587r:122957737_123161324 : 203588 bp : 44.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 9788 10120 333 1 0 55 47 140 0.805 2.62 1.02 PlyA + 10492 10497 6 1.05 2.05 PlyA - 10611 10606 6 1.05 2.04 Term - 20155 19913 243 1 0 68 52 361 0.981 26.40 2.03 Intr - 21721 21525 197 2 2 118 80 296 0.999 30.93 2.02 Intr - 23949 23674 276 1 0 99 114 237 0.420 24.89 2.01 Init - 32082 31872 211 0 1 102 -7 509 0.111 41.75 2.00 Prom - 32540 32501 40 -2.86 3.00 Prom + 37858 37897 40 -1.56 3.01 Init + 56008 56085 78 0 0 59 78 55 0.366 2.66 3.02 Intr + 60018 60287 270 2 0 48 15 235 0.301 9.94 3.03 Term + 60326 60928 603 1 0 -11 46 482 0.464 28.62 3.04 PlyA + 61854 61859 6 1.05 4.00 Prom + 86434 86473 40 -3.96 4.01 Sngl + 93475 93741 267 0 0 21 53 192 0.654 4.24 4.02 PlyA + 93762 93767 6 1.05 5.10 PlyA - 94276 94271 6 1.05 5.09 Term - 95761 95655 107 2 2 130 47 82 0.836 6.87 5.08 Intr - 100183 100057 127 1 1 96 -6 126 0.055 4.25 5.07 Intr - 100748 100516 233 0 2 69 72 389 0.999 32.89 5.06 Intr - 101094 100896 199 0 1 47 86 247 0.991 19.32 5.05 Intr - 101525 101323 203 0 2 87 91 194 0.998 18.60 5.04 Intr - 102292 101737 556 1 1 57 92 774 0.967 67.12 5.03 Intr - 102532 102380 153 1 0 86 95 131 0.999 13.87 5.02 Intr - 103062 102857 206 1 2 56 100 185 0.998 15.42 5.01 Init - 103588 103384 205 1 1 94 97 197 0.848 20.21 5.00 Prom - 104436 104397 40 -4.16 6.00 Prom + 108588 108627 40 -1.76 6.01 Init + 111729 111736 8 2 2 72 89 7 0.771 -0.81 6.02 Term + 115858 116065 208 1 1 67 41 210 0.884 11.01 6.03 PlyA + 116789 116794 6 1.05 7.10 PlyA - 118590 118585 6 1.05 7.09 Term - 122205 122134 72 0 0 48 32 104 0.197 -1.29 7.08 Intr - 125471 125368 104 0 2 67 42 93 0.197 2.49 7.07 Intr - 126111 125944 168 1 0 65 111 147 0.978 14.62 7.06 Intr - 126977 126776 202 2 1 112 82 191 0.980 19.76 7.05 Intr - 140216 140059 158 0 2 48 103 115 0.842 8.73 7.04 Intr - 178390 178225 166 1 1 46 33 184 0.097 8.23 7.03 Intr - 182517 182366 152 1 2 69 -12 128 0.014 0.78 7.02 Intr - 195194 195067 128 1 2 111 25 63 0.002 2.62 7.01 Intr - 201092 200993 100 0 1 53 62 67 0.010 -0.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 32082 31858 225 0 0 102 53 503 0.887 43.34 S.002 Init - 144398 144365 34 2 1 101 58 10 0.870 -0.63 S.003 Term - 178390 178184 207 1 0 46 49 176 0.821 6.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_1|110_aa MNGDVGQGELMKGNCMKVSFRAVTPVNPFLNSHAAKHSPCTATVKDTALVSLESGTIGSI EDAQYVLKKESIVNMCTVNTETSGLPFASQNVGTLSYKQEFGRKVLLWGI >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_1|333_bp atgaatggagacgtggggcagggggagctaatgaagggtaactgcatgaaagtcagtttc agagcagttacacctgtaaatcctttcctcaattcccatgcagccaagcacagcccttgc actgccacagtaaaagacactgccctagtgtctttggagtcagggacaattggttctatt gaggatgcacagtacgtgctgaaaaaagagtcaatagtgaacatgtgtactgtaaacaca gaaacctctggtcttccatttgcctcccagaacgttgggactttatcctacaagcaggaa tttggaaggaaagttcttctctggggaatctga >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_2|308_aa MAKTITTTIITTTTITITTTNTTITTTTITTTTTITITTTTIITTTTTTTTTTIITTTII TSSSSSNNSSGAPLKMNLNFTSPLHPASSQRPTSFFIEDILLHKPKPLREVAPDHFASSL ASRVPLLDYGYPLMPTPTLLAPHAHHPLHKGDHHHPYFLTTSGMPVPALFPHPQHAELPG KHCRRRKARTVFSDSQLSGLEKRFEIQRYLSTPERVELATALSLSETQVKTWFQNRRMKH KKQLRKSQDEPKAPDGPESPEGSPRGSEAATAAEARLSLPAGPFVLTEPEDEVDIGDEGE LGSGPHVL >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_2|927_bp atggccaaaaccatcaccactactatcatcaccacaaccaccatcaccatcaccaccacc aacaccaccatcaccaccaccaccatcaccaccaccaccaccatcaccatcaccaccacc accatcatcaccaccaccaccactaccaccactactaccatcatcaccaccactatcatc accagcagcagcagcagcaacaacagcagtggtgctccgctcaagatgaatctcaacttc acctctcctctacacccggcgtcttctcagaggcccacatccttcttcatcgaggacatc ctgctgcacaagcccaagccgctgagagaggtggccccagaccatttcgccagctctctg gcctctcgggtgcctctgctagactatggctaccccctcatgcccacacccaccctcttg gctcctcacgcccatcaccctctgcataagggagaccaccatcatccttatttcctcacc acctcggggatgccagtcccagcgctgttcccgcacccgcagcacgcggagctgccgggg aagcactgccgccgccgcaaagcccgcacggttttctctgactcgcagctctcgggcttg gagaagaggttcgagatccagcgctacctgtccacgccagaacgagtggagctggccacg gccctcagcctgtccgagacgcaggtgaaaacgtggttccagaaccggcggatgaagcat aaaaagcaactgcggaaaagccaagacgaacccaaagcaccagacgggccagaaagcccc gagggcagcccccgcggttcagaggccgccaccgccgccgaggctcggctgagcctgccc gccggtcccttcgtgctgaccgagccagaggacgaggtggacattggagacgagggggag ctgggctcagggccgcacgtgctctga >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_3|316_aa MTYQRLCQEDDWFKAILPQKLPPNLEVLLAGMKGLGAEIAKNLILAGVKGLTMLDHKQIS PEEPGAQFLIRIGSVGRNRAEASLERAQNLNPMVDVKLDTEDIEKKPESFFTQFDAVDQI CHKNSIKFFAGDVFSYHGYTFANLGEHEFVEEKTKVAKVSQGVEDGPDTKRVKLDSSETT MVKKKVVFCPVKEALEVDWSSKKAKAALKRTTSDHFLLQVLLKFRTDKGRDPSSDTHGED SELLLQIRNDVLDSLGIIPDPRFITYFFSEMAPVCAVVGGILAQEIVKALSQQDPPHNFF FNGMKGNGILEWLGPK >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_3|951_bp atgacttaccagaggctttgccaagaagatgactggtttaaagcaatcctaccacagaag ctgccacccaacctggaggtgcttcttgcaggcatgaaaggactcggggctgaaattgcc aagaatctcatcctggcaggagtgaaaggactgaccatgctggatcacaaacagatatct ccagaagaacccggagctcagttcttgattcgtattgggtctgttggccgaaatagggct gaagcctctttggagcgagctcagaatcttaaccccatggtggatgtgaagttggacact gaggatatagagaagaaaccagagtcatttttcactcaatttgatgctgttgaccagatc tgtcacaaaaatagcatcaagttctttgcaggagatgtttttagctaccatggatacaca tttgccaatctaggagaacatgagtttgtagaggagaaaactaaagttgctaaagttagc caaggagtagaagatgggcctgataccaagagagtaaaacttgattcttctgagacaacg atggtcaagaagaaggtggtcttctgccccgttaaagaagcgctggaggtggactggagc agtaagaaagcaaaggctgctctgaagcgcacgacctccgaccactttctccttcaagtg ctcctaaagttccgcacagataaaggaagagatcccagttctgatacacacggggaagat tccgagttgttgctccagatacgaaacgatgtgcttgactcactgggtattattcctgac ccgcgctttatcacgtacttcttctctgagatggccccagtgtgtgcggtggttggaggg attttggcacaggaaattgtgaaggccctgtctcagcaggaccctcctcacaacttcttc ttcaatggcatgaaggggaatgggattctggagtggcttggccccaagtga >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_4|88_aa MRTREVGFKKGAPWALREIRKLAMEMRTPDVHIDIRLNKTVWAKGIRNVPYYIHIHVHRK HEDEDSPKTALYLCTCYHFQKCIDRQHG >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_4|267_bp atgcgtacccgtgaagtgggcttcaaaaagggtgccccttgggcactcagagagatccgg aaattggccatggagatgcggactccagatgtacacattgatatcaggcttaacaaaact gtctgggccaaaggaataaggaatgtcccatactatatccatatccatgtgcacagaaaa catgaggatgaagattcaccaaaaacagcgctgtacctatgtacctgttaccactttcaa aaatgtatagaccgtcaacatggatga >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_5|662_aa MSKGPAVGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVA MNPTNTVFDAKRLIGRRFDDAVVQSDMKHWPFMVVNDAGRPKVQVEYKGETKSFYPEEVS SMVLTKMKEIAEAYLGKTVTNAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAA IAYGLDKKVGAERNVLIFDLGGGTFDVSILTIEDGIFEVKSTAGDTHLGGEDFDNRMVNH FIAEFKRKHKKDISENKRAVRRLRTACERAKRTLSSSTQASIEIDSLYEGIDFYTSITRA RFEELNADLFRGTLDPVEKALRDAKLDKSQIHDIVLVGGSTRIPKIQKLLQDFFNGKELN KSINPDEAVAYGAAVQAAILSGDKSENVQDLLLLDVTPLSLGIETAGGVMTVLIKRNTTI PTKQTQTFTTYSDNQPGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDI DANGILNVSAVDKSTGKENKITITNDKGRLSKEDIERMVQEAEKYKAEDEKQRDKVSSKN SLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIINWLDKNQTAEKEEFEHQQKELE KVCNPIITKLYQSAGGMPGGMPGGFPGGWGANIRVACDEKKVIKPGADLGKGKLAITELP RS >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_5|1989_bp atgtccaagggacctgcagttggtattgatcttggcaccacctactcttgtgtgggtgtt ttccagcacggaaaagtcgagataattgccaatgatcagggaaaccgaaccactccaagc tatgtcgcctttacggacactgaacggttgatcggtgatgccgcaaagaatcaagttgca atgaaccccaccaacacagtttttgatgccaaacgtctgattggacgcagatttgatgat gctgttgtccagtctgatatgaaacattggccctttatggtggtgaatgatgctggcagg cccaaggtccaagtagaatacaagggagagaccaaaagcttctatccagaggaggtgtct tctatggttctgacaaagatgaaggaaattgcagaagcctaccttgggaagactgttacc aatgctgtggtcacagtgccagcttactttaatgactctcagcgtcaggctaccaaagat gctggaactattgctggtctcaatgtacttagaattattaatgagccaactgctgctgct attgcttacggcttagacaaaaaggttggagcagaaagaaacgtgctcatctttgacctg ggaggtggcacttttgatgtgtcaatcctcactattgaggatggaatctttgaggtcaag tctacagctggagacacccacttgggtggagaagattttgacaaccgaatggtcaaccat tttattgctgagtttaagcgcaagcataagaaggacatcagtgagaacaagagagctgta agacgcctccgtactgcttgtgaacgtgctaagcgtaccctctcttccagcacccaggcc agtattgagatcgattctctctatgaaggaatcgacttctatacctccattacccgtgcc cgatttgaagaactgaatgctgacctgttccgtggcaccctggacccagtagagaaagcc cttcgagatgccaaactagacaagtcacagattcatgatattgtcctggttggtggttct actcgtatccccaagattcagaagcttctccaagacttcttcaatggaaaagaactgaat aagagcatcaaccctgatgaagctgttgcttatggtgcagctgtccaggcagccatcttg tctggagacaagtctgagaatgttcaagatttgctgctcttggatgtcactcctctttcc cttggtattgaaactgctggtggagtcatgactgtcctcatcaagcgtaataccaccatt cctaccaagcagacacagaccttcactacctattctgacaaccagcctggtgtgcttatt caggtttatgaaggcgagcgtgccatgacaaaggataacaacctgcttggcaagtttgaa ctcacaggcatacctcctgcaccccgaggtgttcctcagattgaagtcacttttgacatt gatgccaatggtatactcaatgtctctgctgtggacaagagtacgggaaaagagaacaag attactatcactaatgacaagggccgtttgagcaaggaagacattgaacgtatggtccag gaagctgagaagtacaaagctgaagatgagaagcagagggacaaggtgtcatccaagaat tcacttgagtcctatgccttcaacatgaaagcaactgttgaagatgagaaacttcaaggc aagattaacgatgaggacaaacagaagattctggacaagtgtaatgaaattatcaactgg cttgataagaatcagactgctgagaaggaagaatttgaacatcaacagaaagagctggag aaagtttgcaaccccatcatcaccaagctgtaccagagtgcaggaggcatgccaggagga atgcctgggggatttcctggtgggtggggagccaacattagagttgcatgtgatgagaag aaagtgatcaaacctggagcagatttgggtaaaggaaagttggcaataacagaattgcca agatcttga >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_6|71_aa MAQLYAWVASPGWGAASVDSVRWLREALFAVERVEEEPERELREPEEEELGFTRRAFGAS ASSLKRKNSKD >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_6|216_bp atggctcagctgtatgcctgggtggccagccctggctggggtgctgcgtcagttgacagt gtccgctggctgcgtgaggcactatttgctgtggagcgagtggaggaagaaccagagcgt gagctccgagagcctgaggaagaggagctgggtttcacaagacgggcttttggagcttca gcatcttctctgaagagaaaaaacagcaaagattaa >gi568815587r:122957737_123161324|GENSCAN_predicted_peptide_7|416_aa XPPFVNENNLGLTGVKKLAQVQQPGSDKSDFKPRYYAHNLYAQDPSPGPPEYVWDIYDIN GKGYRGQCGHKGKKWQVTLVLFSPDLLSDPDNYALANPINTASHIKPEWYFLFAYATLRS IPNKLGGIASHSGYVQINWKRVEKDSNKAKRQMKKRANKAAPEINNLIEEAIEFIKQHIV IPVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKVVITYS SRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHVILK VLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRIDY NHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCVIAGFDIELRSRRLIPGVICLTGG >gi568815587r:122957737_123161324|GENSCAN_predicted_CDS_7|1251_bp ncaccaccctttgtaaatgaaaacaacctggggctcacaggggtgaagaaacttgcccaa gttcaacagccaggaagtgacaaatcagatttcaaacccagatattacgcacacaacctc tatgctcaagatccttcaccaggccctcctgagtatgtttgggacatatatgatataaat gggaaaggatacagggggcaatgtggacacaaagggaaaaagtggcaagtaaccctagta ctattttcacctgacctcctgagcgacccagataactacgctttagccaaccccatcaac accgcatcccacattaagccagagtggtacttcctgtttgcctatgcaactttacgatcc atccctaacaaactaggaggcattgctagtcatagtggctacgtgcagattaactggaag agagttgaaaaagattcaaacaaagcaaaaaggcagatgaagaaacgagcgaacaaagca gcaccggaaatcaacaatttaattgaagaagcaatagaatttatcaagcagcacattgtg atacccgtttcctactatgttggaaccttggggactcacactgagatcaagagagtggca gaggaaaaggtcactttgccctgccaccatcaactggggcttccagaaaaagacactctg gatattgaatggctgctcaccgataatgaagggaaccaaaaagtggtgatcacttactcc agtcgtcatgtctacaataacttgactgaggaacagaagggccgagtggcctttgcttcc aatttcctggcaggagatgcctccttgcagattgaacctctgaagcccagtgatgagggc cggtacacctgtaaggttaagaattcagggcgctacgtgtggagccatgtcatcttaaaa gtcttagtgagaccatccaagcccaagtgtgagttggaaggagagctgacagaaggaagt gacctgactttgcagtgtgagtcatcctctggcacagagcccattgtgtattactggcag cgaatccgagagaaagagggagaggatgaacgtctgcctcccaaatctaggattgactac aaccaccctggacgagttctgctgcagaatcttaccatgtcctactctggactgtaccag tgcacagcaggcaacgaagctgggaaggaaagctgtgtgattgctggatttgacatcgag ctacgaagtcgcagactaattcctggggtcatctgtttaactggtggataa