GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:00:33 Sequence gi568815593f:6637544_6854942 : 217399 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 2104 2099 6 1.05 1.01 Sngl - 9806 9312 495 2 0 22 42 194 0.458 4.25 1.00 Prom - 14522 14483 40 -2.26 2.00 Prom + 27185 27224 40 -4.26 2.01 Sngl + 29209 29541 333 0 0 33 41 256 0.501 9.42 2.02 PlyA + 31067 31072 6 1.05 3.00 Prom + 33333 33372 40 -6.56 3.01 Init + 35861 35936 76 1 1 74 44 137 0.242 9.15 3.02 Intr + 66666 66832 167 1 2 70 19 159 0.342 6.88 3.03 Intr + 75525 75732 208 2 1 17 33 129 0.006 -1.05 3.04 Intr + 76025 77156 1132 0 1 9 84 1098 0.136 90.00 3.05 Intr + 99967 100090 124 1 1 75 76 84 0.892 6.49 3.06 Intr + 101140 101186 47 0 2 82 95 6 0.880 -2.09 3.07 Intr + 102189 102309 121 0 1 57 113 191 0.993 19.00 3.08 Intr + 104947 105054 108 0 0 43 79 75 0.712 2.58 3.09 Intr + 108671 108884 214 1 1 43 110 247 0.941 20.69 3.10 Intr + 110921 111047 127 0 1 50 90 182 0.981 14.34 3.11 Intr + 112014 112114 101 0 2 85 72 96 0.999 7.45 3.12 Intr + 112788 112960 173 1 2 105 81 158 0.997 16.46 3.13 Term + 113496 113735 240 2 0 40 45 176 0.912 4.53 3.14 PlyA + 114145 114150 6 -3.44 4.00 Prom + 114224 114263 40 -2.56 4.01 Init + 114795 115009 215 2 2 39 67 174 0.563 8.62 4.02 Intr + 115383 115494 112 0 1 96 75 74 0.982 7.28 4.03 Intr + 117208 117289 82 0 1 100 53 120 0.325 8.91 4.04 Intr + 121337 121516 180 0 0 66 89 54 0.473 3.14 4.05 Term + 123677 123735 59 0 2 118 42 67 0.526 2.95 4.06 PlyA + 126441 126446 6 1.05 5.07 PlyA - 126678 126673 6 1.05 5.06 Term - 130834 130618 217 0 1 32 55 96 0.365 -2.68 5.05 Intr - 131254 131153 102 0 0 106 35 133 0.629 9.09 5.04 Intr - 132630 132447 184 1 1 82 91 59 0.691 4.45 5.03 Intr - 136175 136011 165 0 0 34 59 124 0.703 4.03 5.02 Intr - 137620 137380 241 1 1 104 61 131 0.980 9.12 5.01 Init - 138730 138644 87 1 0 60 13 140 0.899 2.78 5.00 Prom - 139939 139900 40 -5.86 6.00 Prom + 144062 144101 40 -6.36 6.01 Init + 151373 151535 163 1 1 67 81 104 0.516 7.50 6.02 Intr + 153533 153661 129 0 0 58 41 73 0.269 0.27 6.03 Intr + 153925 153956 32 2 2 137 94 1 0.162 3.45 6.04 Intr + 164069 164127 59 1 2 115 101 -7 0.082 0.98 6.05 Intr + 165409 165523 115 1 1 36 92 76 0.113 3.25 6.06 Intr + 173207 173275 69 1 0 45 71 78 0.030 1.18 6.07 Intr + 175508 175589 82 1 1 74 68 84 0.553 4.21 6.08 Term + 177131 177204 74 0 2 122 38 47 0.622 1.17 6.09 PlyA + 180562 180567 6 1.05 7.06 PlyA - 180713 180708 6 1.05 7.05 Term - 184798 184519 280 0 1 64 40 126 0.700 0.32 7.04 Intr - 186727 186537 191 1 2 12 99 115 0.223 3.38 7.03 Intr - 199791 199666 126 0 0 37 105 33 0.010 0.78 7.02 Intr - 206208 206075 134 1 2 110 83 -6 0.020 1.46 7.01 Intr - 210792 210654 139 0 1 73 45 75 0.045 1.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_1|164_aa MRQKINKDIQDLNSALDQVDLVDIYRTLHPKSTEYTLFSAPHHTYSKIDHMVGSKTLLNK CKRMEIITNSLLYHSAIKLELKIKKLTQNHTTTWKLNNLLLNDYWINNKMKAEIKMFFKT NKNKDTTYQNLWDTFKAVCRGKFIALNARKRKQERSKIDTLTHN >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_1|495_bp atgagacagaaaattaacaaggatatccaggacttgaactcagctctggatcaagtggac ctagtagacatctacagaactctccaccccaaatcaacagaatatacactcttctcagca ccacatcacacttattctaaaattgaccacatggttggaagtaaaacactcctcaacaaa tgcaaaagaatggaaatcataacaaacagtctcttatatcacagtgcaatcaaattagaa ctcaagattaagaaactcactcaaaaccacacaactacatggaaactgaacaacctgctc ctgaatgactactggataaataacaaaatgaaagcagaaataaagatgttctttaaaacc aataagaacaaagatacaacataccagaatctttgggacacatttaaagcagtgtgtaga gggaaatttatagcactaaatgcccgcaagagaaagcaggaaagatctaaaatcgacacc ctaacccacaattaa >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_2|110_aa MFFLMSRSPALLPSLVGGRVFSASCGSPVFPISHLPSRVCPFQALLLSCTLPLPFRARVP IFPRAVSGREALTCYVTVRRPVPLAHGEALQHLQVLDGLDPSRPVNTFSL >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_2|333_bp atgttctttctgatgtcacggagcccagccttgctgcccagcctggtgggggggcgcgtt ttctcggcatcctgcggcagtcctgtgtttcccatctcccacctgccctctcgtgtttgc ccatttcaggctctcctgctgtcatgtactctgcccttgccttttcgggccagggtgccc atcttcccacgggcagtttctgggagagaggcgctcacctgctatgtcactgtgcggcgg ccagtgccgctggcccatggggaagctcttcagcacttgcaagtgctggatggcctggat cccagcaggcctgtgaacaccttcagcctctaa >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_3|945_aa MAKIQKTDITNASENVEQQELSFTAAIIRVMAKIQKKTAKIQKESKITSSIFLNKQPTGD LVRAPAIAYACPSGGTEFTGKRACLHAGVCAPRRAEGGVRAFQPLPIPPNAGLVSNGSTP PPLPPAPPSRVASDAVLTPPAADGAPTRRRAARRLGPPPRPRRPPFSDGLSPRPEWNAAA AAAPAPAPRRVKREPGDRSRPLGRAKRRSRPPRPAGAHHRRRRPTPEPDGSAARAGGRGS ARPPPPPPPPPPAQARPSVRARAAGPRGAAGAGPRRGGRARGPRGGGAWMDPRVAWIQPE QKGPANALWMQIWETSQGVGRGGSGFASYFCLNSPALDTAAAAGAAGRGSGGLGPALPAA SPPPPGPTAPAALPPALLTALGPAAEGARRLHKSPSLSSSSSSSSSNAESGTESPGCSSS SSSSASLGRPGGGRGGAFFNFADGAPSAPGTANGHPGPRGPAPAGSPSQHQFHPGRRKRE NKASTYGLNYLLSGSRAAALSGGGGPGAQAPRPGTPWKSRAYSPGIQGLHEEIIDFYNFM SPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPL QLLEQALRKHNVAEPCSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNYM KLHPRIDARRADENLGMLLVEFFELYGRNFNYLKTGIRIKEGGAYIAKEEIMKAMTSGYR PSMLCIEDPLLPGNDVGRSSYGAMQVKQVFDYAYIVLSHAVSPLARSYPNRDAESTLGRI IKVTQEVIDYRRWIKEKWGSKAHPSPGMDSRIKIKERIATCNGEQTQNREPESPYGQRLT LSLSSPQLLSSGSSASSVSSLSGSDVDSDTPPCTTPSVYQFSLQAPAPLMAGLPTALPMP SGKPQPTTSRTLIMTTNNQVRGPLAPFPLVAPGNSIRAVICTRGD >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_3|2838_bp atggccaaaatccagaagactgacatcaccaatgctagtgagaatgtggagcaacaggaa ctctcattcacggctgccatcatccgtgtaatggctaaaattcagaagaaaacggctaaa attcagaaggaaagtaaaattacaagcagcatttttctcaacaaacagcccactggtgac ctcgtgagggcaccagccatcgcctatgcctgtccttctggtggcacagaattcactgga aagcgcgcatgtctgcacgccggggtctgcgcgccgcggcgggccgagggcggcgtgcgg gccttccagccgctgcctattccacccaacgccggcctagtcagtaatggctcaacgcca ccgcctctccctccagcccctcccagtcgcgtagcttctgacgccgtcctcaccccgccc gcggccgacggggccccaacgcgcaggcgcgccgctcggcgcctgggcccgccccctcgg ccccgccgcccgcccttctccgatggcctctccccgcggcccgagtggaacgccgccgcc gccgcggcccccgcgcccgccccgcgccgcgtgaagcgggagcccggagaccgcagccgc ccgctgggacgcgccaagcgccggagccgcccgccgcggcctgccggggcccatcaccgc cgccgccgccccacgccggagcccgacgggagcgcggctagagcaggaggccggggctcg gcccgcccgccgccgccgccgccgccgccgccaccggcccaggcccgtccgtccgtccgt gcgcgcgcggccgggcctcggggcgcggcgggggcggggccgcgtcggggcgggcgggcg cgcgggccccgcgggggcggcgcgtggatggatccgcgcgtggcctggatccagcccgag cagaaggggccggccaatgccctgtggatgcagatctgggagacctcgcagggcgtgggc cgcggcggctcgggcttcgcgtcctatttctgcctcaactcgccggcgctggacacggcg gccgcggcgggggcggccgggcggggcagtggcggcctgggccccgcgctgcccgccgcg tcgcccccgccgcccggccccaccgcgcccgccgcgctgccccccgcgctgctgacggcg ctggggcccgcggccgagggcgcgcggcgcttgcacaagtcgccgtcgctgtcgtcctcg tcgtcgtcctcctcgtccaacgcggagtcgggcaccgagagccccggctgctcgtcgtcg tcctccagcagcgcctcgctgggccggccgggcggcggccgcggcggcgccttcttcaac ttcgccgacggcgcgcccagcgcccctggcacagccaacgggcaccccgggccgcgcggc cccgcgcccgccggctccccgtcgcagcaccagttccacccgggtcgccggaaacgcgag aacaaggccagcacctacggcctcaactacctgctgtccggcagccgcgcggccgctctc agcggagggggcggccccggggcccaggcgccgcggcccggcaccccgtggaagagccgc gcgtacagcccgggcatccagggactacatgaggaaataattgacttttataacttcatg tccccttgtcctgaagaagcagctatgagaagagaggtggtgaaacggatcgaaactgtg gtgaaagacctttggccgacggctgatgtacagatatttggcagctttagtacaggtctt tatcttccaactagcgacatagacctggtggtcttcgggaaatgggagcgtcctccttta cagctgctggagcaagccctgcggaagcacaacgtggctgagccgtgttccatcaaagtc cttgacaaggctacggtaccaataataaagctcacagatcaggagactgaagtgaaagtt gacatcagctttaacatggagacgggcgtccgggcagcggagttcatcaagaattacatg aagttgcatccaagaattgatgcccggagagctgatgaaaaccttggaatgcttcttgta gaattttttgaactctatgggagaaattttaattacttgaaaaccggtattagaatcaaa gaaggaggtgcctatatcgccaaagaggagatcatgaaagccatgaccagcgggtacaga ccgtcgatgctgtgcattgaggaccccctgctgccagggaatgacgttggccggagctcc tatggcgccatgcaggtgaagcaggtcttcgattatgcctacatagtgctcagccatgct gtgtcaccgctggccaggtcctatccaaacagagacgccgaaagtactttaggaagaatc atcaaagtaactcaggaggtgattgactaccggaggtggatcaaagagaagtggggcagc aaagcccacccgtcgccaggcatggacagcaggatcaagatcaaagagcgaatagccaca tgcaatggggagcagacgcagaaccgagagcccgagtctccctatggccagcgcttgact ttgtcgctgtccagcccccagctcctgtcttcaggctcctcggcctcttctgtgtcttca ctttctgggagtgacgttgattcagacacaccgccctgcacaacgcccagtgtttaccag ttcagtctgcaagcgccagctcctctcatggccggcttacccaccgccttgccaatgccc agtggcaaacctcagcccaccacttccagaacactgatcatgacaaccaacaatcaggta cgtggccctctggcacccttcccgctggtggcccctgggaacagcatccgagctgtgata tgcactagaggagattga >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_4|215_aa MHGMPRCNQVSIASLRKGVLVSALLLLTVTGDADIEAAWDLDSRQGRNCQPSHGLSGHQL GQPWAVTPEFSVQAGVEGTASLKAVHHMSSPAIPSASPNPLSSPHLYHKQHNGMKLSMKG SHGHTQGGGYSSVGSGGSRVPPLYPVRYAEDPVNAVPRMPRPGREPYAGEVVTFVIQTSF RCDWVTCDALSPENFSGLPDLENENAKLQVKFEFK >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_4|648_bp atgcacgggatgccaagatgcaaccaggtcagtattgcaagtctgagaaaaggggttctc gttagcgcacttctgctgctgacagtaacgggtgatgctgacatagaagcagcctgggac ctggacagcaggcaaggaaggaactgccagccgtcccacggcctctcaggccaccagttg ggccagccttgggctgtgacccctgagttcagcgtacaagctggtgtagaaggaactgcg tctttgaaagccgtccaccacatgtcttccccggccattccctcagcgtcccccaacccg ctctcgagccctcatctgtatcataagcagcacaacggcatgaaactgtccatgaagggc tctcacggccacacccaaggcggcggctacagctctgtgggtagcggagggtctcgagtg ccacctctctaccctgtgagatacgcagaggaccctgtaaatgccgtgccccgcatgcct cggcctgggcgtgagccgtacgcgggggaggtggtgacgtttgtaatccagacgtcgttt cgttgcgactgggtcacctgtgatgccctaagccctgagaacttttcaggattgccagat ttagaaaatgaaaacgcaaaactccaagttaaatttgaatttaagtag >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_5|331_aa MTRQHVQATIQILLNGALALVVGAATLCLLVITFFHNITARDRLSFSNSFLQENLILISA RRHILLWGAAGKATDVAGAVGRDQLHHQIPTTLLTPVSRRKLPECGGRQGEIQTYVHTKN LRVNVPSGILQKSHKVEATGNPISGCEDRGRWRSRTAGIIRAETDPASSSQKSKEAKTQG GNQPAPGHRGHPAYGQQPKPHAVAPAKLASHVLSPFLCQNRTESPRVPDELNTKCSEALA TGISVHQQPGPFRVDALTNGSAGCTGSMAPTFASGEDLGSFHSWQEAKGEQVPHLAREGA REREEVPGSLNNQNLRELVVRTHYHEDSSNH >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_5|996_bp atgacacggcagcacgtccaggcgaccatccagatccttctgaacggagccctcgcactg gttgttggtgctgccactctttgtctgcttgtcatcactttctttcacaacatcactgca cgtgaccggctttccttcagcaactctttcttacaagaaaacctgattctcatctctgct aggcggcacatccttttgtggggagctgcgggaaaagccactgatgtggcgggggctgtg ggcagggaccagctccatcaccagattccaaccacgctgctgaccccagtgtcacgcagg aaactccccgagtgcggtggacgacagggagaaatacaaacatacgtccacaccaaaaac ctgcgtgtgaatgttcctagcggcattcttcaaaagagccacaaagtggaagcgaccgga aatcccatcagcggatgcgaggacagaggaaggtggcgcagcaggacagctggcatcatt cgagcggaaacggatccagcttcctcctcccagaagagcaaagaggcaaaaacacaaggc ggaaatcagccagcccctggccacagaggacatcctgcctatgggcagcagcccaagcca catgcagtggcccccgccaagcttgctagccatgtcctttcccccttcctgtgccagaac aggacagagtctcccagagtccccgatgagctgaacaccaaatgctcggaggccctggcc acagggatcagtgtccaccagcaacctggtccattcagggtggacgctttgaccaatggt tctgcaggctgtacaggaagcatggcaccaacatttgcatctggtgaggacttgggaagc ttccactcatggcaggaggcaaagggggagcaggtgcctcacctggcgagagaaggagcc cgagagagggaggaggtgccaggctctttaaacaaccagaacttgcgtgaactcgtagtg agaactcattaccacgaggacagcagcaaccactga >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_6|240_aa MNPTFVVIQDEVLQKITGQCSVNIERGWWALDIQPNSGFPNLLLRVMRSEPIRKAFSLDL CNHNPISSPCSDNSAAKRGAAIKIKSLQLQGQTHQPPALATLSHHLLQMGPSSCRKTSSG FPLNSTLWKVVSEEKRKRVEASFRMEEKKTVGVGQVPFDGVWSLQQLRVKKLDMEVVRNS SVVAAAKWQMVPPEDLDPSTTLGKMVTFWQQGQLVHCTGVLEPAVTSVPNFTLANIKSVT >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_6|723_bp atgaacccgacatttgtagttatccaagatgaggtgctccaaaaaatcacgggccagtgt tcagtgaacatcgaacgtggttggtgggctctagatattcagccgaattcaggatttcct aacctcctgctgcgggtgatgcgttcagagcccatcaggaaagccttctccctggacctc tgcaaccacaaccccatctcctctccctgctctgataactctgctgcaaagagaggagca gcaatcaaaataaaatccctgcagctccagggccaaacgcaccagccgccggccctggcc acgctttcccatcacctcctccagatgggaccatctagttgcagaaaaacaagctcaggg ttcccactgaattctacattatggaaggtggtcagtgaagagaagaggaagagagtagaa gcttcttttaggatggaggagaagaagacagttggagtgggacaagtcccatttgatggt gtgtggtcgctacagcagttacgggtgaagaagctggacatggaggtagtaaggaactcg tcagtggtggctgctgctaagtggcagatggttcctcctgaggacctggatcctagcacc acccttggaaaaatggtgactttctggcagcaaggccagctggtgcattgcacaggtgtg ctggagccagctgtgacctctgttcctaacttcacgctcgcgaacattaagtcagtcact taa >gi568815593f:6637544_6854942|GENSCAN_predicted_peptide_7|289_aa LCARSKRTTIPDRGGLVFSQGYTSSCAPLLEPPKKLSKEDLQTANKSSLHTDCSQAPLQP LSCPQLALLALVQVHNSSRPSYTQRPPHPHKEQLLCLATAMKVHCCLLHSDFGSSCVCST PAPIAAANLGEMEKYTRNCIGAQEHQLAGPSEAAEAGDVQVSSMKQKIKEEPGRDQSKKK SSVYAEKSQHRVIPPRSQGNCERLDCAPPRCEAATLLFQKPWVPSPSSWLCSHLTDVGWS ITVAIASSRATDRMHDFPPEPVSVCLPVAPSHQAGKSAFYVLKQMQLCL >gi568815593f:6637544_6854942|GENSCAN_predicted_CDS_7|870_bp ctgtgtgctcggtcgaagaggacgaccatacccgatagaggaggactggtcttcagtcaa gggtatacgagtagctgcgctcccctgctagaacctccaaagaagctctcaaaagaagac ctacaaacagccaacaaatcctctctccacactgactgttctcaagcccccttgcagccc ttgagctgcccacagctagccctgctagccctggtccaggttcacaatagctctagacct tcctatacacagagaccgccccatccccacaaggagcagttgctctgtctggccacagcc atgaaagtgcactgttgtcttttacactcagactttgggtccagctgtgtttgcagcacc ccagcccccatagcagctgccaatctgggggagatggagaaatatacccggaattgtata ggagctcaggaacatcaacttgcagggccatcggaagctgctgaggctggagacgttcag gtgagctccatgaaacagaagatcaaagaagagccggggcgggatcagagcaagaaaaag agctctgtttatgccgagaagtcgcagcacagagtgataccacccaggagccaaggcaac tgtgagaggctggactgtgcacctccacgctgtgaagcagcaactctcctgtttcagaaa ccatgggttccatcgcccagctcctggctctgctcccacttaacagacgtgggttggagc atcactgtggccatcgcatcctcgcgtgcaacagacaggatgcacgatttccctccagag cctgtgtcagtctgcttgcctgtggcaccttcacaccaagcagggaagtctgcattttat gtcttgaagcaaatgcagctgtgcttatag