GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:03:12 Sequence gi568815584f:100627069_100834893 : 207825 bp : 52.64% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3539 3597 59 1 2 83 66 63 0.051 4.33 1.02 Intr + 7114 7330 217 1 1 41 99 93 0.173 4.73 1.03 Intr + 9248 9290 43 1 1 79 74 27 0.369 -1.30 1.04 Intr + 12884 13080 197 0 2 83 84 83 0.509 7.05 1.05 Term + 18841 18963 123 0 0 83 44 56 0.159 -0.61 1.06 PlyA + 19256 19261 6 1.05 2.00 Prom + 22676 22715 40 0.19 2.01 Init + 24884 24900 17 1 2 78 106 5 0.045 1.06 2.02 Intr + 26596 26689 94 1 1 117 98 7 0.053 5.07 2.03 Term + 34431 34610 180 2 0 76 55 209 0.998 14.33 2.04 PlyA + 35145 35150 6 1.05 3.03 PlyA - 35556 35551 6 1.05 3.02 Term - 36467 36369 99 2 0 100 47 76 0.647 3.13 3.01 Init - 44285 44232 54 2 0 81 103 42 0.663 6.26 3.00 Prom - 44631 44592 40 -3.01 4.00 Prom + 44710 44749 40 -12.47 4.01 Init + 44859 45067 209 2 2 75 90 150 0.645 12.17 4.02 Term + 45141 45207 67 0 1 96 46 50 0.539 -0.70 4.03 PlyA + 45685 45690 6 1.05 5.09 PlyA - 47267 47262 6 1.05 5.08 Term - 49068 48937 132 0 0 81 43 65 0.483 -0.30 5.07 Intr - 54194 53923 272 0 2 107 44 155 0.699 10.80 5.06 Intr - 54729 54661 69 1 0 109 30 46 0.456 0.55 5.05 Intr - 61774 61588 187 1 1 73 103 121 0.634 11.88 5.04 Intr - 64253 64191 63 2 0 75 90 46 0.298 2.91 5.03 Intr - 67952 67795 158 0 2 79 113 70 0.848 8.84 5.02 Intr - 69992 69684 309 0 0 57 16 147 0.158 1.23 5.01 Init - 75132 75084 49 0 1 88 77 24 0.339 2.35 5.00 Prom - 78617 78578 40 -4.31 6.00 Prom + 80932 80971 40 -2.71 6.01 Init + 87196 87335 140 0 2 80 -1 183 0.434 8.23 6.02 Intr + 90162 90239 78 0 0 107 75 17 0.527 1.46 6.03 Intr + 92623 92714 92 1 2 92 49 58 0.420 2.44 6.04 Intr + 93766 93818 53 2 2 57 92 27 0.300 -0.88 6.05 Term + 95037 95198 162 2 0 68 37 90 0.444 0.15 6.06 PlyA + 96047 96052 6 -0.45 7.00 Prom + 99788 99827 40 -6.70 7.01 Init + 100001 100067 67 1 1 81 101 157 0.942 15.51 7.02 Intr + 100533 100634 102 1 0 87 53 40 0.671 1.05 7.03 Intr + 101328 101391 64 1 1 70 90 65 0.985 3.17 7.04 Intr + 101868 101998 131 0 2 121 36 49 0.969 3.84 7.05 Intr + 103083 103226 144 1 0 66 20 105 0.811 2.26 7.06 Intr + 104974 105115 142 2 1 84 66 217 0.977 19.02 7.07 Term + 107081 107828 748 2 1 93 47 1636 0.986 153.37 7.08 PlyA + 110207 110212 6 1.05 8.00 Prom + 118627 118666 40 -1.91 8.01 Init + 122707 122764 58 0 1 79 100 59 0.470 7.81 8.02 Intr + 145293 145452 160 1 1 69 48 93 0.529 2.96 8.03 Intr + 145763 145830 68 2 2 88 50 70 0.172 2.24 8.04 Intr + 148521 148639 119 1 2 103 93 38 0.904 6.49 8.05 Intr + 150339 150414 76 2 1 112 41 13 0.457 -1.42 8.06 Intr + 152587 152712 126 2 0 54 119 63 0.531 7.16 8.07 Term + 161727 161839 113 1 2 120 48 26 0.035 0.83 8.08 PlyA + 162685 162690 6 1.05 9.00 Prom + 169413 169452 40 -1.41 9.01 Init + 176196 176310 115 2 1 87 74 87 0.574 7.53 9.02 Term + 177318 177442 125 1 2 66 50 116 0.977 4.45 9.03 PlyA + 179292 179297 6 1.05 10.08 PlyA - 186297 186292 6 1.05 10.07 Term - 187212 187127 86 1 2 68 39 127 0.938 3.91 10.06 Intr - 188587 188478 110 0 2 76 78 88 0.872 7.03 10.05 Intr - 190939 190761 179 1 2 112 24 37 0.043 -1.07 10.04 Intr - 193370 192829 542 0 2 11 96 216 0.519 7.92 10.03 Intr - 197631 197120 512 2 2 58 -4 222 0.253 3.21 10.02 Intr - 198539 198365 175 0 1 51 32 83 0.063 -1.49 10.01 Intr - 205538 205386 153 0 0 112 86 91 0.811 11.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_1|212_aa MEKQLKTKKMEVATRRPDHSALILPAHWQLLLISWSPQLDAAESKRCCKSYLGRPTLAQG WHLSSQVAALSLAEAFWKLVLRSADYSTRTKQAGSGPVGSVDPTVAVVPNPHQSKNPQGW AIWQMGHCEKGSGRENALGAVLEGRSWLLPACPRAHAEPDMEEPLNQLVNVRRPLLTKLN VVSTGKENHKGTSSFFNTTANAGQIRSWETIN >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_1|639_bp atggaaaagcagctgaagacaaagaaaatggaagtggccaccaggaggcctgatcatagt gctttgatcctgcccgcccactggcagctgctcctcattagctggtccccacagctggac gcggccgagtcaaagcgttgttgcaagagctacctgggcaggccaactcttgcccagggc tggcatctcagcagccaggtggctgctctgagcctggcagaggctttctggaagctggtg ctcaggagtgcagattatagcaccagaaccaaacaggcaggaagtggccccgttggctct gtggatcctacagtggccgtggtacccaacccccaccagagcaagaatcctcaaggctgg gccatatggcagatggggcactgcgagaaagggagcgggagggagaacgcgttgggggca gtcttggaaggcaggtcgtggcttctacctgcatgtccccgagcccatgcagagcctgac atggaggagcccctcaaccaacttgtgaacgtcaggcgccctctgctgacaaagcttaat gtggtgtcgacaggcaaagaaaatcacaaaggcaccagctcattttttaatacaacagcc aatgcagggcagattcggagctgggagacaataaattga >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_2|96_aa MQALIKSNLGNSYTFNPYCALSSEDGRSMSPSLQPCKSFILGDTRFAMYGEPIAWSPSWQ DVKPEVLGVLELLPGVRNPRKLKCGKRGSSNVNEDD >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_2|291_bp atgcaggctttgataaagagcaacttggggaattcctacacatttaatccctactgtgcg ctaagtagcgaagatggtcgttccatgtcaccctccctgcaaccttgcaagtctttcatc ttgggagacacgcggttcgcaatgtacggggagcccattgcttggagcccaagctggcag gatgtcaaacctgaagtactaggcgtcttggagcttctgcctggagtacgaaatccgagg aaactgaagtgtggaaaacggggcagtagcaatgtgaatgaggatgattga >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_3|50_aa MALPLLPPCFNVSEVDSQKCGLRSVSVGQRWSRVQAVPTAPLVAEVGKFI >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_3|153_bp atggccctgcctttgctgccgccgtgcttcaatgtttcagaggtggattctcagaaatgt ggcttacgttcggtttccgttggacagcgctggtctcgagttcaagctgtgccaacagcg cccctggtggccgaagtgggtaaattcatctga >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_4|91_aa METSKIILEKMQSDDVLDGNRERSNEREGRDSLSEKLKSKQNLKDEEKLRYIKTGKSIQV EGTVRAKALRAQKAKAMFPRGPVAKVLHGPG >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_4|276_bp atggaaacaagcaagatcattctggagaaaatgcaatcagatgatgtgctagatggtaac agagagaggtcgaatgagagagagggccgtgacagcctctctgagaagctgaaatccaag cagaacctgaaagacgaggagaaactcagatacataaagactgggaaaagcattcaagta gagggaacagtgcgtgcaaaggccctgagagctcagaaagctaaagcaatgttccccaga ggtcctgtggccaaggtcctgcatgggcctgggtga >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_5|412_aa MHRTGPHKESSGPKGQVSTKSNTLPVSMITTTPKLSSSTRVWQATEPDKKTLSRRCKKPA PGSNIYMLVMRVPKEQHQRVLLLSGERAWLGSVLWKAQSVHNHTGRSKTSCLPTISGEDG IIAVPSSEGPWEGLGVYIRANVLDLRQTHSKHRLELTTASVSTTQAQLGDTKCVLIHSFI TSIFKAGYCERCRPLPASSNRPINPPNLEQPGPQQPTLAHADRREATQAEALLQTPKATG LLFTRKWRAGEGRCYAAHTQCHCPPPASTPTPKDAKDPGTLHISPSGPQRAAQGRVCIMD PSAWSLLFPAAGVPPCYPGPRSDEKPDADSVLLPLRSQSEPVISLSNHMPCPPGALWDTE VSVISVSLMASVSRGEGSSPLGAPGINPVCRASSRSVLEGRVDIAGFGGMSR >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_5|1239_bp atgcacaggacaggcccccacaaagaatcctccggccccaaaggccaagtgtccactaag tccaacactctccctgtctccatgattaccacaacacccaaactctccagctccacaagg gtttggcaggccactgaaccagacaagaagactctgagccgccgctgcaagaaaccagcc cctggcagcaatatttacatgctggtcatgcgtgttccaaaggagcagcaccagagggtt ttgctactgagtggagagcgcgcctggctcggatcagtcctgtggaaagcacagtctgtg cacaatcacacgggcagatcaaagaccagctgccttcccacaatctcgggggaagatgga ataatagcagtgccttcctctgagggaccctgggagggcttgggagtttacatccgtgca aatgtgctggatctacgccagacacatagtaagcacaggctggagctgaccactgctagc gtttccacgacccaggcccaactgggggacacaaagtgtgtactaatccattcattcatc acgtccatctttaaagcgggctactgcgagcgctgcaggcctctgcctgcctccagcaac aggccaattaatccccccaacctggagcagccggggccacagcagcccacgctggcccac gctgacaggagagaagcgacccaagctgaggccttgctgcagacgcccaaggccacaggt ctgcttttcaccaggaagtggagggctggtgaaggtcggtgttacgctgctcacacccag tgccactgccccccgccagcttctacccccactcccaaggatgccaaggacccaggcact ctccacatcagcccctccggtccccagcgggctgctcagggacgtgtttgcatcatggac ccttctgcctggagcctgctgtttcctgcagccggggtgcccccttgctaccctggccca cgttcggacgagaagccagatgcggactccgtgttgctccctctcaggtcccagtcagag cctgtgatatcactgtccaatcacatgccgtgtcccccgggggccctctgggatacagaa gtgtcagttatctctgtgtccctaatggccagtgtgagccggggtgagggctccagcccc cttggagcgcccggcatcaaccccgtgtgccgggcaagctccagaagtgtgttggaaggc agggtcgatattgccggctttggggggatgagtcgataa >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_6|174_aa MQAIGAAETGALNHEEAVVAPPQGIPAHVLSVAQLADNPSAMWRETGTTLCLPPLSVHGV ACLDGLYQLATASWILAADSQLGPLEVQCCHRNRLMESVSEEPGCVACASEGLHELGPTG EHPAALENPGVGLELSYASSGAKHKARLSMCPSAAVLPARLSTMPLHTEDLFSD >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_6|525_bp atgcaggccatcggagccgccgaaacaggcgccctgaaccacgaggaagccgtggtggct cctccccagggaatccctgcccacgtcctgtcggtggcccagctggcagacaacccgtca gccatgtggagggagacaggaaccaccctctgtcttccacccctgtctgtgcatggggtg gcctgcctagatggtctgtatcagctggctacagccagctggatcctggctgcagacagc cagcttgggcctctggaggtgcagtgttgccataggaacaggctcatggagagcgtctca gaggaaccagggtgtgttgcctgtgcctctgaaggtctccacgagctgggccctactggg gagcaccctgccgccctggagaaccctggagtgggcctggaactgtcatacgcctcttca ggcgccaagcacaaggccaggctcagcatgtgtccctctgcagccgtactcccagctcgg ctgagcacgatgccacttcacactgaagaccttttctctgattaa >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_7|465_aa MTATEALLRVLLLLLAFGHSTYGEGLRHFCLQRPSVSDSERVAPFLQRPPTHCTSGWAEC FPACNPQNGFCEDDNVCRCQPGWQGPLCDQCVTSPGCLHGLCGEPGQCICTDGWDGELCD RAPAIWCKVYPQLTHALASISAAPHRSSGSSSESFEDLGDSSKATIWDQDVRACSSAPCA NNRTCVSLDDGLYECSCAPGYSGKDCQKKDGPCVINGSPCQHGGTCVDDEGRASHASCLC PPGFSGNFCEIVANSCTPNPCENDGVCTDIGGDFRCRCPAGFIDKTCSRPVTNCASSPCQ NGGTCLQHTQVSYECLCKPEFTGLTCVKKRALSPQQVTRLPSGYGLAYRLTPGVHELPVQ QPEHRILKVSMKELNKKTPLLTEGQAICFTILGVLTSLVVLGTVGIVFLNKCETWVSNLR YNHMLRKKKNLLLQYNSGEDLAVNIIFPEKIDMTTFSKEAGDEEI >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_7|1398_bp atgaccgcgaccgaagccctcctgcgcgtcctcttgctcctgctggctttcggccacagc acctatggtgaggggctgcgacacttctgtctgcagcggccatctgtctctgacagcgag agagttgcccccttcctgcagcgcccccccactcattgcaccagtggttgggctgaatgc ttcccggcctgcaacccccaaaatggattctgcgaggatgacaatgtttgcaggtgccag cctggctggcagggtcccctttgtgaccagtgcgtgacctctcccggctgccttcacgga ctctgtggagaacccgggcagtgcatttgcaccgacggctgggacggggagctctgtgat agagcccctgcaatatggtgcaaggtttacccgcagctgactcatgctctggccagcatc tctgctgcccctcacagaagcagcggcagcagctctgagtcgtttgaggatctgggggat tccagcaaagccaccatttgggatcaggatgttcgggcctgctcctcggccccctgtgcc aacaacaggacctgcgtgagcctggacgatggcctctatgaatgctcctgtgcccccggg tactcgggaaaggactgccagaaaaaggacgggccctgtgtgatcaacggctccccctgc cagcacggaggcacctgcgtggatgatgagggccgggcctcccatgcctcctgcctgtgc ccccctggcttctcaggcaatttctgcgagatcgtggccaacagctgcacccccaaccca tgcgagaacgacggcgtctgcactgacattgggggcgacttccgctgccggtgcccagcc ggcttcatcgacaagacctgcagccgcccggtgaccaactgcgccagcagcccgtgccag aacgggggcacctgcctgcagcacacccaggtgagctacgagtgtctgtgcaagcccgag ttcacaggtctcacctgtgtcaagaagcgcgcgctgagcccccagcaggtcacccgtctg cccagcggctatgggctggcctaccgcctgacccctggggtgcacgagctgccggtgcag cagccggagcaccgcatcctgaaggtgtccatgaaagagctcaacaagaaaacccctctc ctcaccgagggccaggccatctgcttcaccatcctgggcgtgctcaccagcctggtggtg ctgggcactgtgggtatcgtcttcctcaacaagtgcgagacctgggtgtccaacctgcgc tacaaccacatgctgcggaagaagaagaacctgctgcttcagtacaacagcggggaggac ctggccgtcaacatcatcttccccgagaagatcgacatgaccaccttcagcaaggaggcc ggcgacgaggagatctaa >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_8|239_aa MVLAPASDEGLMVLPLLVEALFTVPLRNEELLLGQKPTRCSLLGESLTLGEMFTEPQGVF PDGPMDLHAAGTRPLSNPTTVLLDEAVDAERGAVAESNLKNQRPELQTKLPSHLLGFEEP AFPAHSSPAVPTAGLSAKWAIFLFPSKQGLEGVELRQGCAGVSGKEPVLTTPHMEGMRLD HTRTSLPPEPGAAGLDAERFPSGPPRPCSEALPILEQADLAKFLYWKINHIEILFKTHR >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_8|720_bp atggtgctggcacctgcatctgatgagggcctcatggttcttccactcctggtggaagcg ctcttcacagtgcctctccgaaatgaagagcttctcttggggcagaaaccaacacgctgc tcccttctaggagaaagtctcactctgggagaaatgttcactgagcctcaaggtgtgttc ccagacggtcccatggatttacacgccgccggcaccaggcccctgagcaatccaactact gtccttttggatgaggctgtggatgctgagagaggggctgtggctgagtccaatctcaag aaccaaagacctgaactgcagacgaagcttccttcccacctcctgggctttgaggagcct gcattccctgcccactcttctcctgctgtgcccacagcagggctgagtgccaaatgggcc attttcctcttccccagcaagcagggcctggaaggggtggaactaaggcaaggctgtgca ggggtctcaggcaaggagcccgtcctcactacaccacacatggagggcatgaggctggac cacaccagaacgtccctccccccggagcctggagctgcaggtcttgatgcggagaggttt ccatctgggccacccagaccttgctctgaggctctcccgatcctggaacaagcagatctt gccaagttcctgtattggaagataaatcatattgaaattctctttaaaacccatagatga >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_9|79_aa MAGYRAEGYALEPPTNEKTAEFAGPRSPFPSHIPLVCTSLPGKRLPGPHRKAGAARRSPG PDGEAVLLVSVDHPPTVEP >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_9|240_bp atggctggctacagggctgaaggctatgccctggaaccaccaacaaatgagaaaacagca gagtttgctggacccagaagcccattcccctcccacatccctctggtgtgcaccagtctc cctggtaaacggcttcctggtcctcatcggaaggcaggggccgcccggcgctcccctgga cctgacggggaggctgtcttgctcgtgtccgtggaccatccacccacagtggagccctag >gi568815584f:100627069_100834893|GENSCAN_predicted_peptide_10|585_aa XKEGSLCDEYWNPAANLINVCSLFLRQGPRLALCSLREKGGKWNKTEKHRQAHPCVSDWF SLTWETELRVSESENPGCGVPGSAAYNSCSRNPLCPFLFHWVRFLHLNQGRTVGEIETWD VGVLMGRCFNMSGNTKEKENTPFTACGACQEPQGLEPRKQALHSRNARTSHRRQERFKKS WLRAPASTGERLLATVLIARKQPELAGANQQPGAPVVRPPRPIRRRQARLLASRGQRRGA RIDKGLAFTRRAWELWWPPESIRSHAPGGPGGCRRHSASPLYHHCHRCLPPLPTTTAYHH CCHHCHCCHHCHCCLPPLPATAAYHCCLAPLPTTTAYHCCLPPLPGTTAYHRCPPLLPAT AAYHCLPPLPPLPITAACHHCCHRCLPPLPATAALHCCHHCCLPLPATNATAAYHRCLPP LPTASVYYRCLPPLPATAACHRCLTLKGDENQTEGSTWQCRRAPAAPRRAGLTVPPTPQE CRAQNLYIGLLFAGTAPGLADKGCEPLWPPPPAFPGNPRPGGFRRPAILTKGEEHVHLEW EGSLLPIPGPQKCGPGREILAQRKETQSDKDIRHSVISLQTEYSI >gi568815584f:100627069_100834893|GENSCAN_predicted_CDS_10|1758_bp nacaaagaaggatccttatgcgacgaatactggaatccagcagctaacctcattaacgtc tgcagcctcttccttcgacaaggcccgagattagccctgtgttcattaagggaaaagggc ggaaagtggaataaaactgaaaaacaccgccaggcccacccgtgcgtttctgactggttc tctttgacatgggagacagagctccgcgtctccgagtcagaaaaccctggctgtggagtc ccgggctctgccgcttacaactcatgttcgcggaacccgctatgccccttcctcttccac tgggtccgcttcctccaccttaaccaggggcgcactgttggggagatagagacgtgggac gttggggtcctcatggggcgctgctttaatatgtcaggtaacacgaaggagaaagaaaac accccctttacagcctgtggagcttgccaggagccccagggactagagccccggaagcag gccttgcacagcagaaacgcccgcaccagccacagacgtcaagaacgcttcaaaaaatca tggctccgagcacccgcgtcaacaggagaacgtttattagccacagtattaatagctagg aaacagccagagctggccggggccaatcagcaaccaggagcccctgtcgtcaggccccca cggccaatcagacgccggcaggcccggttgctagcctctcgggggcagaggcgaggggca cgcattgacaaggggctcgcattcacccgccgcgcgtgggaactgtggtggccgccagag agcatccgcagccacgcacccgggggcccgggaggctgtcgtagacacagtgccagcccg ctctaccaccactgccaccgctgcctgccaccgctgcctaccaccactgcctaccaccac tgctgccaccactgccactgctgccaccactgccactgctgcctaccaccactgcctgcc accgctgcctaccactgctgcctggcaccgctgcctaccaccactgcctaccactgctgc cttccaccgctgcctggcaccactgcctaccaccgctgccctccactgctgcctgccacc gctgcctaccactgcctgccaccactgccaccgctgcctatcaccgctgcctgccaccac tgctgccaccgctgcctgccaccgctgcctgccactgctgccctccactgctgccaccac tgctgcctaccactgcctgccaccaatgccaccgctgcctatcaccgctgcctgccacca ctgcctaccgcatcagtctactaccgctgcctgccaccgctgcctgccaccgctgcctgc caccgctgtctgacgctgaaaggagatgagaatcaaacagagggcagcacatggcagtgc aggagagcccccgctgccccaaggagagcaggcctcacagtgccacccactccacaagaa tgcagagctcagaacctttacatcgggctcctgtttgcaggaacggctccaggcctggct gacaagggctgtgagcctctctggcctccaccaccagcctttcctgggaatcccaggccg gggggtttccgcaggcctgccatcctgaccaaaggggaagagcacgtccacctggaatgg gaaggctccctcctgcccatcccagggccacagaaatgtggacctggaagggaaatactt gcacagcgcaaagagacacagagcgacaaggacatccggcacagcgttatcagtctgcaa actgagtacagcatttag