GENSCAN 1.0 Date run: 27-Jul-117 Time: 10:34:40 Sequence gi568815589r:33013457_33267169 : 253713 bp : 45.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13029 13160 132 2 0 64 63 144 0.994 9.64 1.02 Intr + 13357 13534 178 0 1 78 94 100 0.990 9.09 1.03 Intr + 16429 16533 105 2 0 52 102 63 0.940 4.29 1.04 Intr + 16984 17211 228 2 0 71 113 90 0.990 7.64 1.05 Intr + 20760 20874 115 1 1 65 28 80 0.962 -0.79 1.06 Intr + 23118 23233 116 0 2 52 80 141 0.982 9.79 1.07 Intr + 23559 23659 101 1 2 45 121 108 0.999 9.63 1.08 Intr + 25229 25421 193 1 1 9 26 231 0.199 8.07 1.09 Term + 30937 31136 200 2 2 10 48 156 0.169 1.36 1.10 PlyA + 31646 31651 6 1.05 2.10 PlyA - 33118 33113 6 1.05 2.09 Term - 33935 33837 99 2 0 90 31 114 0.985 4.03 2.08 Intr - 39834 39667 168 0 0 43 109 141 0.964 11.84 2.07 Intr - 44258 44142 117 2 0 93 86 109 0.980 11.86 2.06 Intr - 47128 47009 120 1 0 39 98 101 0.974 6.89 2.05 Intr - 48721 48593 129 1 0 96 97 94 0.999 11.99 2.04 Intr - 55478 55368 111 2 0 110 91 24 0.958 5.48 2.03 Intr - 60328 60140 189 1 0 45 110 168 0.650 14.48 2.02 Intr - 63273 63127 147 0 0 77 64 133 0.546 10.23 2.01 Init - 69701 69696 6 2 0 64 97 10 0.160 -0.19 2.00 Prom - 70589 70550 40 -9.26 3.00 Prom + 74421 74460 40 -4.96 3.01 Init + 75866 76346 481 1 1 42 -22 242 0.330 4.12 3.02 Term + 76376 76848 473 0 2 12 55 304 0.394 14.59 3.03 PlyA + 76867 76872 6 1.05 4.09 PlyA - 77664 77659 6 1.05 4.08 Term - 100130 99998 133 1 1 121 40 109 0.945 6.96 4.07 Intr - 102657 102535 123 2 0 90 95 37 0.943 4.30 4.06 Intr - 107150 106963 188 2 2 80 110 173 0.990 17.19 4.05 Intr - 114658 114580 79 0 1 99 55 81 0.816 5.45 4.04 Intr - 120813 120674 140 0 2 52 56 154 0.912 7.86 4.03 Intr - 121968 121733 236 1 2 109 94 397 0.970 39.81 4.02 Intr - 146423 146307 117 0 0 90 81 83 0.417 8.24 4.01 Init - 153713 153302 412 2 1 84 87 454 0.202 39.58 4.00 Prom - 158125 158086 40 -8.86 5.00 Prom + 163588 163627 40 -3.36 5.01 Init + 163984 164013 30 0 0 59 111 -8 0.653 -1.80 5.02 Term + 165869 165961 93 1 0 87 55 140 0.923 8.43 5.03 PlyA + 166507 166512 6 1.05 6.03 PlyA - 169382 169377 6 1.05 6.02 Term - 173803 173651 153 1 0 109 36 47 0.323 -0.48 6.01 Init - 176135 175881 255 2 0 67 92 85 0.447 4.06 6.00 Prom - 178774 178735 40 -1.86 7.04 PlyA - 181626 181621 6 1.05 7.03 Term - 189028 188939 90 1 0 103 44 63 0.307 1.12 7.02 Intr - 205248 205156 93 0 0 23 67 106 0.188 2.16 7.01 Init - 208186 208127 60 1 0 62 111 9 0.349 1.85 7.00 Prom - 213450 213411 40 -3.86 8.00 Prom + 215431 215470 40 -4.86 8.01 Init + 226753 226813 61 0 1 90 84 137 0.716 13.05 8.02 Intr + 231656 231696 41 0 2 113 91 -10 0.799 -0.26 8.03 Intr + 233160 233272 113 2 2 131 95 57 0.927 9.88 8.04 Term + 234970 235015 46 1 1 77 42 81 0.852 -0.92 8.05 PlyA + 235089 235094 6 1.05 9.10 PlyA - 239035 239030 6 1.05 9.09 Term - 241852 241763 90 1 0 96 51 123 0.979 7.12 9.08 Intr - 242471 242409 63 2 0 85 109 -1 0.583 0.61 9.07 Intr - 243452 243345 108 2 0 95 119 131 0.998 17.38 9.06 Intr - 245577 245464 114 0 0 64 93 117 0.999 10.44 9.05 Intr - 247713 247631 83 1 2 105 94 51 0.992 6.76 9.04 Intr - 249374 249246 129 0 0 105 98 74 0.999 10.77 9.03 Intr - 249930 249781 150 1 0 53 86 44 0.615 0.83 9.02 Intr - 251139 250768 372 1 0 44 79 354 0.496 25.13 9.01 Intr - 252607 252492 116 0 2 64 80 39 0.242 0.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_1|455_aa MVKETTYYDVLGVKPNATQEELKKAYRKLALKYHPDKNPNEGEKFKQISQAYEVLSDAKK RELYDKGGEQAIKEGGAGGGFGSPMDIFDMFFGGGGRMQRERRGKNVVHQLSVTLEDLYN GATRKLALQKNVICDKCEGRGGKKGAVECCPNCRGTGMQIRIHQIGPGMVQQIQSVCMEC QGHGERISPKDRCKSCNGRKIVREKKILEVHIDKGMKDGQKITFHGEGDQEPGLEPGDII IVLDQKDHAVFTRRGEDLFMCMDIQLVEALCGFQKPISTLDNRTIVITSHPGQIVKHGDI KCVLNEGMPIYRRPYEKGRLIIEFKVNFPENGFLSPDKLSLLEKLLPERKEVEETDEMDQ VELVDFDPNQERRRHYNGEAYEDDEHHPRGALPALDPRWDSYLRPPGAAAPQIRPSALRV RGPTPPRSGCKGAGGELGCPSLQCWRDGRVGLQKG >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_1|1368_bp atggtgaaagaaacaacttactacgatgttttgggggtcaaacccaatgctactcaggaa gaattgaaaaaggcttataggaaactggctttgaagtaccatcctgataagaacccaaat gaaggagagaagtttaaacagatttctcaagcttacgaagttctctctgatgcaaagaaa agggaattatatgacaaaggaggagaacaggcaattaaagagggtggagcaggtggcggt tttggctcccccatggacatctttgatatgttttttggaggaggaggaaggatgcagaga gaaaggagaggtaaaaatgttgtacatcagctctcagtaaccctagaagacttatataat ggtgcaacaagaaaactggctctgcaaaagaatgtgatttgtgacaaatgtgaaggtaga ggaggtaagaaaggagcagtagagtgctgtcccaattgccgaggtactggaatgcaaata agaattcatcagataggacctggaatggttcagcaaattcagtctgtgtgcatggagtgc cagggccatggggagcggatcagtcctaaagatagatgtaaaagctgcaacggaaggaag atagttcgagagaagaaaattttagaagttcatattgacaaaggcatgaaagatggccag aagataacattccatggtgaaggagaccaagaaccaggactggagccaggcgatattatc attgtgttagatcagaaggaccatgctgtttttactcgacgaggagaagaccttttcatg tgtatggacatacagctcgttgaagcactgtgtggcttccagaagccaatatctactctt gacaaccgaaccatcgtcatcacctctcatccaggtcagattgtcaagcatggagatatc aagtgtgtactaaatgaaggcatgccaatttatcgtagaccatatgaaaagggtcgccta atcatcgaatttaaggtaaactttcctgagaatggctttctctctcctgataaactgtct ttgctggaaaaactcctacccgagaggaaggaagtggaagagactgatgagatggaccaa gtagaactggtggactttgatccaaatcaggaaagacggcgccactacaatggagaagca tatgaggatgatgaacatcatcccagaggggctctaccggccctggacccaagatgggac tcctacctccgaccacccggagcagcggcgccccagatccggccgtccgccctgcgcgtg cgcggcccgaccccgccgcggtctggctgtaagggcgctggaggggagctgggctgcccc agccttcaatgctggagggacggccgcgtggggcttcagaaaggctag >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_2|361_aa MKSGSQTPTQAEVTLGVRRRCRWCVARLALRESWGLLPERYGYVDRNRIFGYLKENSLHR ALATLQEETTVSLNTVDSIESFVADINSGHWDTVLQAIQSLKLPDKTLIDLYEQAYPDGS SKEKRRAAIAQALAGEVSVVPPSRLMALLGQALKWQQHQGLLPPGMTIDLFRGKAAVKDV EEEKFPTQLSRHIKFGQKSHVECARFSPDGQYLVTGSVDGFIEVWNFTTGKIRKDLKYQA QDNFMMMDDAVLCMCFSRDTEMLATGAQDGKIKIWNMKTTECSNTFKSLGSTAGTDITVN SVILLPKNPEHFVVCNRSNTVVIMNMQGQVHEKDVIGIAHHPHQNLIATYSEDGLLKLWK P >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_2|1086_bp atgaagagcgggagccagacgccgacccaagcggaagtgacgttaggtgtccgccggagg tgtcgttggtgtgttgcgcgactggccttgagggagagctggggcctgctcccggagaga tacggctatgtcgatcgaaatcgaatcttcggatacttgaaggagaacagtttacatcgg gcgttagccaccttgcaggaggagactactgtgtctctgaatactgtggacagcattgag agttttgtggctgacattaacagtggccattgggatactgtgttgcaggctatacagtct ctgaaattgccagacaaaaccctcattgacctctatgaacaggcatacccagatggaagt agcaaagaaaagagaagagcagcaattgcccaggccttagctggcgaagtcagtgtggtg cctccatctcgtctcatggcattgctgggacaggcactgaagtggcagcagcatcaggga ttgcttcctcctggtatgaccatagatttgtttcgaggcaaggcagctgtcaaagatgtg gaagaagaaaagtttcctacacaactgagcaggcatattaagtttggtcagaaatcacat gtggagtgtgctcgattttctccagatggtcagtatttggtcactgggtctgttgatgga ttcattgaagtatggaactttactactggaaaaatcagaaaggatcttaagtaccaggcc caagataactttatgatgatggatgatgctgtcctctgcatgtgtttcagcagagataca gaaatgttagcaactggggcccaagatggaaaaatcaagatctggaatatgaagaccaca gaatgttcaaatacctttaaatccctgggcagcaccgcagggacagatattaccgtcaac agtgtgattctacttcctaaaaaccctgagcactttgtggtgtgcaacagatcaaacacg gtggtcatcatgaacatgcaggggcaggtgcacgagaaggatgtgattggtattgcacat caccctcatcagaacctgattgctacctacagtgaagatggactcctaaagctctggaaa ccataa >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_3|317_aa MIISIDAEKAFNKIQQPFMLKTLNKLGIDGTYLKLIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNLVLEVLARTITQEKEIKGIRLGKGEVKFSLFADDMIVYLENPIVS APNLLKLISNFSKVSGYKINVQKPQAFLYTNNRQTESQIMKNKRPRNPTYKGCEGPLQGE LQTTAQRNKRGHKQMEEHSMLINIVKMAILPKVIYRFNAIPIKLPMTFHTELEKTTLKFM WNQKRACIAKSNLSQKNKAGDITLPDFKLSYKATVTKTARYWYQNRDIDQWNRTEPSEIM LHIYNYLIFGKPDKNKK >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_3|954_bp atgattatttccatagatgcagaaaaggccttcaacaaaattcaacagcccttcatgcta aaaactctcaataaattaggtattgatgggacgtatctcaagttaataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcactagacagggatgccctctctcaccactcctattcaacctagtgttggaagtt ctggccaggacaatcacgcaggagaaagaaataaagggtattcgattaggaaaaggggaa gtcaaattttccctgtttgcagatgacatgattgtatatttagaaaaccccatcgtctca gccccaaatctccttaagctgatcagcaacttcagcaaagtctcaggatacaaaatcaat gtgcaaaaaccacaagcattcctatacaccaataacagacaaacagagagccaaatcatg aagaataaaagacctaggaatccaacttataagggatgtgaaggacctcttcaaggagaa ctacaaaccactgctcaacggaataaaagaggacacaaacaaatggaagaacattccatg ctcatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgactttccacacagaattggaaaaaactactttaaagttcatg tggaaccaaaaaagagcctgcattgccaagtcaaacctaagccaaaagaacaaagctgga gacatcacgctacctgacttcaaactatcctacaaggctacagtaaccaaaacagcacgg tactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaaataatg ctgcatatctacaactatctgatctttggcaaacctgacaaaaacaagaaatga >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_4|475_aa MRLREPLLSGSAAMPGASLQRACRLLVAVCALHLGVTLVYYLAGRDLSRLPQLVGVSTPL QGGSNSAAAIGQSSGELRTGGARPPPPLGASSQPRPGGDSSPVVDSGPGPASNLTSVPVP HTTALSLPACPEESPLLAKSLCWTSVDYCGSGAVVVGEQQLAKPVLLTSLRCDAQEVGPM LIEFNMPVDLELVAKQNPNVKMGGRYAPRDCVSPHKVAIIIPFRNRQEHLKYWLYYLHPV LQRQQLDYGIYVINQVGGLLAVYGFHQGQGTLCTFPMTPGEALEDCCGIVLHHLLQGDSE EVERERERNADLTTAATAAATTAATTTLAGDTIFNRAKLLNVGFQEALKDYDYTCFVFSD VDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGGVSALSKQQFLTINGFPNNYW GWGGEDDDIFNRFDRIAHTKETMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPS >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_4|1428_bp atgaggcttcgggagccgctcctgagcggcagcgccgcgatgccaggcgcgtccctacag cgggcctgccgcctgctcgtggccgtctgcgctctgcaccttggcgtcaccctcgtttac tacctggctggccgcgacctgagccgcctgccccaactggtcggagtctccacaccgctg cagggcggctcgaacagtgccgccgccatcgggcagtcctccggggagctccggaccgga ggggcccggccgccgcctcctctaggcgcctcctcccagccgcgcccgggtggcgactcc agcccagtcgtggattctggccctggccccgctagcaacttgacctcggtcccagtgccc cacaccaccgcactgtcgctgcccgcctgccctgaggagtccccgctgcttgctaagtcc ctgtgctggacttctgtggactactgtggctctggggctgtggttgtgggtgaacaacag ctagctaaaccagtgctgttgacatcattgagatgtgacgcacaggaagtgggccccatg ctgattgagtttaacatgcctgtggacctggagctcgtggcaaagcagaacccaaatgtg aagatgggcggccgctatgcccccagggactgcgtctctcctcacaaggtggccatcatc attccattccgcaaccggcaggagcacctcaagtactggctatattatttgcacccagtc ctgcagcgccagcagctggactatggcatctatgttatcaaccaggtaggaggattgttg gcagtttacggcttccatcaaggtcaaggaactctgtgcaccttccctatgaccccaggg gaagcactcgaggactgctgtggcattgtgctgcatcacttgctgcagggagattctgaa gaagtagagagagagagagagaggaatgccgacctaactaccgctgccactgctgctgcc accaccgctgccaccaccaccctggcgggagacactatattcaatcgtgctaagctcctc aatgttggctttcaagaagccttgaaggactatgactacacctgctttgtgtttagtgac gtggacctcattccaatgaatgaccataatgcgtacaggtgtttttcacagccacggcac atttccgttgcaatggataagtttggattcagcctaccttatgttcagtattttggaggt gtctctgctctaagtaaacaacagtttctaaccatcaatggatttcctaataattattgg ggctggggaggagaagatgatgacatttttaacaggtttgaccgaattgcacacacaaag gagacaatgctctctgatggtttgaactcactcacctaccaggtgctggatgtacagaga tacccattgtatacccaaatcacagtggacatcgggacaccgagctag >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_5|40_aa MAKPSPPSVLFPVLPNLRYGLHQISASERIWKDIDPNFIQ >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_5|123_bp atggccaaacctagcccaccgtctgttttgtttcctgttcttcccaacctgcgctatgga cttcatcagatttcagcatcagagagaatatggaaggacatcgaccctaacttcatccag tga >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_6|135_aa MSGLGQQTAPAVKEPAGLWGRHAIKRQLQERQWDECQTDVSAKYGRSSEEASSLVAWDRR LAGLPTSHPSPPCALLPECLPEHSYGSQGTVPPELELLADRCKNMGTRDAGSSARIRVLA KQLKSFDSPFPARPF >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_6|408_bp atgtcaggcctggggcaacaaacagcccctgctgtcaaggaacccgcaggcttgtggggc cgtcatgctatcaaaaggcagctccaagagagacagtgggatgagtgtcaaacagatgtt tcagccaagtatggcaggagctcagaggaggcctcatcacttgttgcctgggacagacgt ctggctggtctgcccacctcccaccccagccctccctgtgcgctgctgccagaatgtctt cctgaacacagctatggttcacaggggacagtccctcctgagctggaacttcttgctgac aggtgtaagaacatggggaccagagacgcagggtcctctgctcgtatcagagtgctggcc aagcaactgaagtcctttgacagcccctttcctgctaggcccttctaa >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_7|80_aa MREGCFPLPSSRDTTSAVDQPGSRQKTDGLLSVVIEGFNEDIVYESASKAKVPTIPLFLR QHQHFQGYEGHSHQMGCNTE >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_7|243_bp atgagggaaggatgcttcccattaccctcatctagagacaccacctctgctgtggaccag ccagggtctcggcagaaaacagatgggttactcagtgtggtcattgaagggtttaatgaa gacattgtttatgaaagtgccagcaaggctaaggttcccactatccccttgtttctgaga cagcatcagcatttccagggctatgaaggccacagccaccagatgggctgcaacactgaa tga >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_8|86_aa MAVRQWVIALALAALLVVDREVPVAAGKLPFSRMPICEHMVESPTCSQMSNLVCGTDGLT YTNECQLCLARIKTKQDIQIMKDGKC >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_8|261_bp atggccgtccgccagtgggtaatcgccctggccttggctgccctccttgttgtggacagg gaagtgccagtggcagcaggaaagctccctttctcaagaatgcccatctgtgaacacatg gtagagtctccaacctgttcccagatgtccaacctggtctgcggcactgatgggctcaca tatacgaatgaatgccagctctgcttggcccggataaaaaccaaacaggacatccagatc atgaaagatggcaaatgctga >gi568815589r:33013457_33267169|GENSCAN_predicted_peptide_9|408_aa XSASNREIFLSMDSALLSTLGENIKCTGSYCNTLIPEGAGREPRQSEPPAQRGPPPSGRP PARSTASGHDRPTRGAAAGARRPRMKKKTRRRSTRSEELTRSEELTLSEEATWSEEATQS EEATQGEEMNRSQEVTRDEESTRSEEVTREEMAAAGLTVTVTHTQIMDALKCVQWERLGG KQSLAHYSKRDETNTIQGTGQSSHRKQMACDEKGNEKHDLHVTSQQGSSEPVVQDLAQVV EEVIGVPQSFQKLIFKGKSLKEMETPLSALGIQDGCRVMLIGKKNSPQEEVELKKLKHLE KSVEKIADQLEELNKELTGIQQGFLPKDLQAEALCKLDRRVKATIEQFMKILEEIDTLIL PENFKDSRLKRKGLVKKVQAFLAECDTVEQNICQETERLQSTNFALAE >gi568815589r:33013457_33267169|GENSCAN_predicted_CDS_9|1227_bp nnctcagcatccaatcgagaaatcttcttgtcaatggattctgctctactgtccacctta ggggaaaatatcaaatgcactggtagttactgcaatacacttattccagaaggggcaggc cgggagccgcgccagtcggagcccccggcccagcgtggtccgcctccctctgggcgtcca cctgcccggagtactgccagcgggcatgaccgacccaccaggggcgccgccgccggcgct cgcaggccgcggatgaagaagaaaacccggcgccgctcgacccggagcgaggagttgacc cggagcgaggagttgaccctgagtgaggaagcgacctggagtgaagaggcgacccagagt gaggaggcgacccagggcgaagagatgaatcggagccaggaggtgacccgggacgaggag tcgacccggagcgaggaggtgaccagggaggaaatggcggcagctgggctcaccgtgact gtcacccacacacaaatcatggatgcactcaagtgtgttcagtgggagagacttggagga aaacagtcacttgcccattacagcaagcgagacgagaccaatacaatacagggtacaggg cagtcaagccacaggaagcagatggcttgtgatgaaaaaggcaatgagaagcacgacctt catgttacctcccagcagggcagcagtgaaccagttgtccaagacctggcccaggttgtt gaagaggtcataggggttccacagtcttttcagaaactcatatttaagggaaaatctctg aaggaaatggaaacaccgttgtcagcacttggaatacaagatggttgccgggtcatgtta attgggaaaaagaacagtccacaggaagaggttgaactaaagaagttgaaacatttggag aagtctgtggagaagatagctgaccagctggaagagttgaataaagagcttactggaatc cagcagggttttctgcccaaggatttgcaagctgaagctctctgcaaacttgataggaga gtaaaagccacaatagagcagtttatgaagatcttggaggagattgacacactgatcctg ccagaaaatttcaaagacagtagattgaaaaggaaaggcttggtaaaaaaggttcaggca ttcctagccgagtgtgacacagtggagcagaacatctgccaggagactgagcggctgcag tctacaaactttgccctggccgagtga