GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:22:37 Sequence gi568815585f:99672825_99991410 : 318586 bp : 44.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2529 2524 6 1.05 1.03 Term - 11904 11792 113 1 2 83 42 81 0.951 1.72 1.02 Intr - 20584 20553 32 0 2 70 85 0 0.029 -4.33 1.01 Init - 21736 21669 68 1 2 90 106 76 0.838 10.14 1.00 Prom - 22423 22384 40 -3.96 2.08 PlyA - 24256 24251 6 1.05 2.07 Term - 31585 31431 155 2 2 39 36 181 0.618 6.08 2.06 Intr - 33949 33867 83 0 2 73 57 56 0.218 0.28 2.05 Intr - 36491 36432 60 1 0 91 99 27 0.047 1.95 2.04 Intr - 57324 57192 133 1 1 -13 42 163 0.057 1.30 2.03 Intr - 64798 64710 89 0 2 47 94 69 0.163 3.01 2.02 Intr - 105978 105814 165 2 0 81 105 21 0.022 2.18 2.01 Init - 114977 114757 221 2 2 86 39 160 0.057 9.01 2.00 Prom - 117082 117043 40 -4.96 3.00 Prom + 118529 118568 40 -6.86 3.01 Init + 123341 123426 86 1 2 109 75 45 0.530 5.67 3.02 Intr + 144068 144185 118 2 1 27 82 136 0.228 7.37 3.03 Intr + 186037 186225 189 0 0 62 92 127 0.951 10.28 3.04 Intr + 191994 192087 94 2 1 107 6 50 0.588 -1.76 3.05 Intr + 193416 193583 168 1 0 95 64 200 0.986 18.22 3.06 Intr + 198114 198238 125 1 2 59 94 92 0.957 7.20 3.07 Term + 202279 202302 24 0 0 88 55 27 0.601 -2.38 3.08 PlyA + 202505 202510 6 1.05 4.00 Prom + 205550 205589 40 -4.26 4.01 Init + 213926 213992 67 1 1 103 69 76 0.286 6.59 4.02 Intr + 222168 222229 62 1 2 115 27 88 0.952 3.85 4.03 Term + 223557 223688 132 2 0 89 41 125 0.945 5.99 4.04 PlyA + 225215 225220 6 1.05 5.00 Prom + 229401 229440 40 -1.96 5.01 Init + 233787 233835 49 2 1 80 58 51 0.259 0.41 5.02 Intr + 238751 238884 134 0 2 104 68 107 0.490 10.66 5.03 Intr + 255608 255655 48 1 0 60 82 66 0.330 2.08 5.04 Intr + 256588 256640 53 0 2 111 81 38 0.416 3.11 5.05 Intr + 257611 257885 275 1 2 91 52 60 0.090 -0.12 5.06 Intr + 283584 283734 151 1 1 106 69 82 0.810 7.32 5.07 Intr + 284429 284542 114 2 0 106 51 53 0.269 2.96 5.08 Term + 287279 287399 121 2 1 48 42 125 0.428 1.85 5.09 PlyA + 287506 287511 6 1.05 6.05 PlyA - 289859 289854 6 1.05 6.04 Term - 292995 292553 443 1 2 134 39 389 0.889 34.02 6.03 Intr - 296162 296019 144 0 0 3 56 126 0.593 1.15 6.02 Intr - 296539 296450 90 2 0 60 85 41 0.558 0.97 6.01 Init - 298695 297303 1393 0 1 41 94 2023 0.978 187.02 6.00 Prom - 304350 304311 40 -4.76 7.00 Prom + 304457 304496 40 -7.36 7.01 Init + 309241 310315 1075 0 1 70 91 1566 0.990 148.84 7.02 Intr + 312122 312285 164 0 2 91 76 344 0.705 33.19 7.03 Term + 312499 312858 360 0 0 127 49 419 0.672 36.44 7.04 PlyA + 313097 313102 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 158055 158255 201 2 0 80 49 174 0.954 7.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_1|70_aa MEMEPAGTKCEKQPFIEVALMDGEERKRKPVFPGDHYVHCIIGLVDFAEIHEAVEHNQSA LVNIHPQNPL >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_1|213_bp atggagatggaacctgctggaaccaaatgtgaaaaacagccatttatagaggtggccctg atggatggggaagagagaaagagaaagcctgtgttcccaggggatcattatgttcactgc atcatcggactggtagattttgcagaaatacatgaagctgttgaacataaccaatctgcc ctggtgaacattcatcctcagaatcccttgtga >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_2|301_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRIRIAKSILSQKNKAGGITLPD FKLYYKATVTKTACVMELHQKSVHYLAYFYCGPASSGLDSPTPPHAFNNEKEEIRGSGGS LFYHPCLGSLCFKEKQPVMSPQTRQLQKSCHFPGKEVRGLGKISSSNSGQPSSSQQLGRS TGHADVGPTVVEAPATLMWDQRCCGGWKSETEVSAGLAPSEDCLTLNAAIFSFWSSAEEA EMHSSAGGKIELDFKCSEDIRITGSLDFQKEKVTIKVIYSKKISISGSLDFHKENETIKV I >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_2|906_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagaatc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggtatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatgtgtcatggagttacatcag aaatcggtgcattacttggcatatttttactgtggcccagcaagcagtggcttagacagc cccacacctcctcatgcatttaacaatgaaaaggaggagataagagggagtggaggcagt cttttttatcatccgtgcttgggaagtctatgcttcaaggagaaacaaccagtcatgagt ccccagactcgccagctccaaaaatcttgccacttccctggaaaggaggtgagaggcctt ggaaagatcagcagctccaacagtggccagccatcctccagccagcaactgggtcgcagc actggccacgctgacgtggggccaacggtggtggaggcaccggccacgctgatgtgggac caacggtgctgtggaggctggaagtctgaaactgaggtgtcagcagggctggctccttct gaggactgtctgactctaaatgcagccatcttctccttctggagctcggctgaagaagca gagatgcactccagtgctggagggaaaatagaactagacttcaaatgttctgaggacatc cgtataactgggtcactggacttccaaaaagagaaagtaaccatcaaagtaatatattcc aagaagatctctatatctgggtcactggacttccacaaagaaaatgaaaccatcaaagta atataa >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_3|267_aa MAPCAGSHDWAASLETCCMPSAFNFEGHRLGLHGSGGEPERALRIGFYFLLPINHGIMPD KDANDPGNNEARLRIVKTLEDIDLGPTEKCVRVNSVSSGLAEEDLETLLQSRVLPSSLML PKVESPEEIQWAVCEETLKVGPQVGLFLDAVVFGGEDFRASIGATSSKETLDILYARQKI VVIAKAFGLQAIDLVYIDFRDGAGLLRQSREGAAMGFTGKQVIHPNQIAVVQEQFSPSPE KIKWAEELIAAFKEHQQLGKLHKLAKD >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_3|804_bp atggctccctgtgctggcagccatgattgggctgcttcactggagacatgctgcatgcca tcagccttcaactttgagggccacaggctggggcttcatggaagtggtggggagcctgaa cgtgcgctgcggatagggttttacttcctgctacccatcaaccatgggataatgcctgac aaagatgcgaatgaccccggaaataatgaagctcgactgagaattgtaaaaactcttgaa gacattgatctgggccctactgaaaaatgtgtgagagtcaactcagtttccagtggtctg gcggaagaagacctagagacccttttgcaatcccgggtccttccttccagcctgatgcta ccaaaggtggaaagtcctgaagaaatccagtgggcagtgtgtgaagaaaccctgaaggtc gggcctcaagtaggtctctttctagatgcagtcgtttttggaggagaagactttcgagcc agcataggtgcaacaagtagtaaagaaaccctggatattctctacgcccggcaaaagatt gttgtcatagcgaaagcctttggtctccaagccatagatctggtgtacattgactttcga gatggagctgggctgcttagacagtcacgagaaggagccgccatgggcttcactggtaag caggtgattcaccctaaccaaattgccgtggtccaggagcagttttctccttcccctgaa aaaattaagtgggctgaagaactgattgctgcctttaaagaacatcaacaattaggaaag cttcacaagttggctaaagactga >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_4|86_aa MAGPQIVPVLTTLAFSPILVASGTAGKTVLRDEDFTASDLNLKAPVPPRTLRAGRTPAQG PCARPGVPPALGSLQPRIRTRATRND >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_4|261_bp atggcaggccctcagatcgtccctgtgctcaccaccctggccttttctccgattcttgtt gcatcagggacggctgggaaaacagtcctgcgggatgaagacttcaccgcctccgatttg aatttgaaagccccagtacccccgcgcaccctgcgcgcaggccggacacctgcgcagggc ccttgcgcccgccctggggtcccgccggccctggggtccctgcagccccgaatccgcacc cgagccacgcggaacgactag >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_5|314_aa MGFLHVGQAGLELLTSEELALHSSNPGNAQAQRRHLSVILGTVIPGTVIPRTAARASSPP HTLTKPPMTVATCIRIQDREKWLIHSSHVNQYINQHRKYQAEKAGITLWSLENFHASAQV PTLCHKQKNQKEKTKVLRLRGALEGKGLLRQGCVQELQQEWELNRKLARKAVAQGRRRRP SWERSQGPELLLLVFSFLSFIASSFATFLLDPPGWLLPLCPGGALAYSSREEPGPQRNSN QCATQGARELLARCLDPERVPSALAGAGLGGLNCAKGEQIADKGERFRFSGVVKATPKYS SYNKFHFKGFLIQR >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_5|945_bp atggggtttctccatgttggccaggctggtctcgagctcctgacctcagaggaattggcc ctgcactcttcaaacccaggcaatgctcaagcacagcgcaggcatctctcggtcatcctg gggacggtcatccccgggacggtcatccccaggacagccgcgagagcctcctctcctccc cacactctcaccaagccccccatgactgtggccacctgcatcagaatacaggacagggag aaatggcttattcactccagccatgtaaaccagtacatcaaccagcacaggaagtaccaa gcagagaaggcaggcatcactttgtggtcactggaaaactttcatgcgtctgcccaggtc ccaactttatgccacaaacagaagaaccaaaaagagaaaacaaaagttctgaggcttaga ggtgcactggagggaaagggcctgctgagacaggggtgcgtgcaggagctgcagcaggaa tgggagctaaaccggaagcttgctaggaaggcggtggcccagggaaggaggaggaggcct tcttgggaacgaagccaaggcccagagctcctgctgctggttttctcctttctctccttc atcgcctcttccttcgccactttccttctggaccctccgggctggctcctgcccctctgc cccggtggcgctctcgcctattcctcccgggaggagccggggccgcagaggaactccaac cagtgtgccacgcagggtgctcgcgagctcttggcaaggtgcctggatcccgagcgtgtg cccagtgcactggctggggctgggctgggggggttgaattgcgccaaaggagaacaaatc gctgacaaaggcgagcgctttcgattcagcggcgtcgttaaggcaaccccaaagtattcc tcctataataagttccacttcaaagggtttctcattcagcggtga >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_6|689_aa MTGFPALAGPPAHSQLRAAVAHLRLRDLGADPGVATTPLGPEHMAQASTLGLSPPSQAFP AHPEAPAAAARAAALVAHPGAGSYPCGGGSSGAQPSAPPPPAPPLPPTPSPPPPPPPPPP PALSGYTTTNSGGGGSSGKGHSRDFVLRRDLSATAPAAAMHGAPLGGEQRSGTGSPQHPA PPPHSAGMFISASGTYAGPDGSGGPALFPALHDTPGAPGGHPHPLNGQMRLGLAAAAAAA AAELYGRAEPPFAPRSGDAHYGAVAAAAAAALHGYGAVNLNLNLAAAAAAAAAGPGPHLQ HHAPPPAPPPPPAPAQHPHQHHPHLPGAAGAFLRYMRQPIKQELICKWIDPDELAGLPPP PPPPPPPPPPPPAGGAKPCSKTFGTMHELVNHVTVEHVGGPEQSSHVCFWEDCPREGKPF KAKYKLINHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTVSHSPWRVLELTSAFL EKVLVVGGPEPPTIDLSALLRPGGSRRVPRSGRGRSGLCGGSSVGVARGGSLGLAPAGRL MAGEKPFKCEFDGCDRKFANSSDRKKHSHVHTSDKPYYCKIRGCDKSYTHPSSLRKHMKI HCKSPPPSPGPLGYSSVGTPVGAPLSPVLDPARSHSSTLSPQVTNLNEWYVCQASGAPSH LHTPSSNGTTSETEDEEIYGNPEVVRTIH >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_6|2070_bp atgacaggcttcccggcgctggccggcccgcccgcccactcccaactccgggccgccgtc gcgcacctccgcctgcgggacctgggcgctgaccccggcgtggccaccactccgctcgga cccgagcacatggcccaggcgagcacgctgggcctcagccctccctcccaggcgttcccg gcacacccggaggctccggcagccgccgcccgtgctgcagccttggtcgcgcaccccggc gcgggcagctacccctgcggcgggggcagcagtggcgcgcagccctccgcgcccccgccc ccagcccctcctcttcctcccaccccttcaccccctccccctcccccgcctcctcctcct cctgccctctcgggctacaccaccaccaacagtggcggcggcggcagcagcggcaaaggc cacagcagggacttcgtcctccggagggacctttccgccacggcccccgcggcggccatg cacggggccccgctcggaggggagcagcggtccggcaccggctccccccagcacccggcc ccgcctccccactcggccggcatgttcatctccgccagcggcacctacgcgggcccggac ggcagcggcggcccggcgctcttccccgcgctgcacgacacgccgggggccccaggcggc cacccgcacccgctcaacggccagatgcgcctggggctggcggcggcagcggcagccgcg gcggctgagctgtacggccgcgccgaaccgcccttcgcgccgcgctctggggacgcgcac tacggggcggttgcggccgcagcggcggccgccctgcacggctacggagccgtgaactta aacctgaacctggcggctgcggcggccgcagcagcggccgggcccgggccccacctgcag caccacgcgccgcccccggcgccgccgccgccgccggcgcccgcgcagcacccgcaccag caccacccccacctcccaggggcggctggggccttcctgcgctacatgcggcagccaatc aagcaggagctcatctgcaagtggatcgaccccgacgagctggccgggctgccgccgccg ccgccgccgccgccgccgccgccgccaccgcccccggccggcggcgccaagccctgctcc aaaactttcggcaccatgcacgagctggtgaatcacgtcacggtggagcacgtgggaggc cccgagcagagcagccacgtctgcttctgggaggactgtccgcgcgagggcaagcccttc aaggccaaatacaagctcatcaaccacatccgcgtgcacaccggcgagaagccctttccc tgccctttccccggctgcggcaaggtcttcgcgcgctccgagaacctcaagatccacaag cgtactcatacagtatcacactcgccctggagggtgctggagttgacatctgcctttcta gagaaggttctagttgtaggaggacccgagcctcccaccatcgatttatcggcgctgctg cggccaggaggtagccgccgggtcccacgctcgggccgggggcgctcggggctctgcggg ggcagcagcgtcggggtggcccgaggcgggagccttggcttggctcctgcgggccgcctg atggccggggaaaagcctttcaaatgtgaatttgatggctgtgacaggaagtttgccaat agcagtgatcggaagaaacattcccatgtccacaccagtgacaagccctactactgcaag attcgaggctgtgacaaatcctacactcacccaagctccctgaggaagcacatgaagatt cactgcaagtccccgccaccttctccaggaccccttggttactcatcagtggggactcca gtgggcgcccccttgtcccctgtgctggacccagccaggagtcactccagcactctgtcc cctcaggtgaccaacctcaatgagtggtacgtttgccaggccagtggggcccccagccac ctccacaccccttccagcaacggaaccacctctgagactgaagatgaggaaatttacggg aaccctgaagttgtgcggacgatacattag >gi568815585f:99672825_99991410|GENSCAN_predicted_peptide_7|532_aa MLLDAGPQFPAIGVGSFARHHHHSAAAAAAAAAEMQDRELSLAAAQNGFVDSAAAHMGAF KLNPGAHELSPGQSSAFTSQGPGAYPGSAAAAAAAAALGPHAAHVGSYSGPPFNSTRDFL FRSRGFGDSAPGGGQHGLFGPGAGGLHHAHSDAQGHLLFPGLPEQHGPHGSQNVLNGQMR LGLPGEVFGRSEQYRQVASPRTDPYSAAQLHNQYGPMNMNMGMNMAAAAAHHHHHHHHHP GAFFRYMRQQCIKQELICKWIDPEQLSNPKKSCNKTFSTMHELVTHVSVEHVGGPEQSNH VCFWEECPREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGE KPFQCEFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSLRKHMKVHESSPQ GSESSPAASSGYESSTPPGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSAVHRGGGS GSGGAGGGSGGGSGSGGGGGGAGGGGGGSSGGGSGTAGGHSGLSSNFNEWYV >gi568815585f:99672825_99991410|GENSCAN_predicted_CDS_7|1599_bp atgctcctggacgcgggtccgcagttcccggccatcggggtgggcagcttcgcgcgccac catcaccactccgccgcggcggcggcggcggctgccgccgagatgcaggaccgtgaactg agcctggcggcggcgcagaacggcttcgttgactccgccgccgcgcacatgggagccttc aagctcaacccgggcgcgcacgagctgtccccgggccagagctcggcgttcacgtcgcag ggccccggcgcctaccccggctccgctgcggctgccgctgcggccgcagcgctcgggccc cacgccgcgcacgttggctcctactctgggccgcccttcaactccacccgggacttcctg ttccgcagccgcggcttcggggactcggcgccgggcggcgggcagcacgggctgttcggg ccgggcgcgggcggcctgcaccacgcgcactcggacgcgcagggccacctcctcttcccg ggcctgccagagcagcacgggccgcacggctcgcagaatgtgctcaacgggcagatgcgc ctcgggctgcccggcgaggtgttcgggcgctcggagcaataccgccaggtggccagcccg cggaccgacccctactcggcggcgcaactccacaaccagtacggccccatgaatatgaac atgggtatgaacatggcagcagccgcggcccaccaccaccaccaccaccaccaccacccc ggtgcctttttccgctatatgcggcagcagtgcatcaagcaggagctaatctgcaagtgg atcgaccccgagcaactgagcaatcccaagaagagctgcaacaaaactttcagcaccatg cacgagctggtgacacacgtctcggtggagcacgtcggcggcccggagcagagcaaccac gtctgcttctgggaggagtgtccgcgcgagggcaagcccttcaaggccaaatacaaactg gtcaaccacatccgcgtgcacacaggcgagaaacccttcccctgccccttcccgggctgt ggcaaagtcttcgcgcgctccgagaacctcaagatccacaaaaggacccacacaggggag aagccgttccagtgtgagtttgagggctgcgaccggcgcttcgccaacagcagcgacagg aagaagcacatgcacgtccacacctccgataagccctatctctgcaagatgtgcgacaag tcctacacgcaccccagctcgctgcggaagcacatgaaggtccatgagtcctccccgcag ggctctgaatcctccccggccgccagctccggctatgagtcgtccacgcccccggggctg gtgtcccccagcgccgagccccagagcagctccaacctgtccccagcggcggcggcagcg gcggcggcggctgcggcggcggcggccgcggtgtccgcggtgcaccggggcggaggctcg ggcagtggcggcgcgggaggcggctcaggcggcggcagcggcagtggcgggggcggcggc ggggcgggcggcgggggcggcggcagctctggcgggggcagcgggacagccgggggtcac agcggcctctcctccaacttcaatgaatggtacgtgtga