GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:38:59 Sequence gi568815593f:31432393_31652899 : 220507 bp : 41.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.27 Intr - 3472 3373 100 0 1 102 75 86 0.277 7.89 1.26 Intr - 4906 4847 60 0 0 81 91 63 0.546 2.83 1.25 Intr - 16215 16155 61 1 1 53 88 5 0.095 -6.13 1.24 Intr - 17027 16889 139 2 1 47 95 120 0.237 7.72 1.23 Intr - 19248 19141 108 0 0 85 110 8 0.120 2.26 1.22 Intr - 31951 31844 108 1 0 86 75 105 0.979 8.56 1.21 Intr - 33889 33790 100 0 1 109 58 39 0.828 2.19 1.20 Intr - 35671 35547 125 1 2 77 61 118 0.997 6.46 1.19 Intr - 39788 39671 118 1 1 42 92 167 0.108 12.05 1.18 Intr - 41606 41505 102 1 0 56 41 135 0.029 3.97 1.17 Intr - 51236 51134 103 0 1 99 85 14 0.294 0.51 1.16 Intr - 52570 52489 82 1 1 94 115 -8 0.558 0.79 1.15 Intr - 54170 54099 72 2 0 95 94 18 0.473 1.78 1.14 Intr - 56748 56590 159 0 0 60 64 72 0.282 1.26 1.13 Intr - 60901 60815 87 1 0 88 101 35 0.886 4.05 1.12 Intr - 62980 62894 87 1 0 142 83 82 0.962 12.35 1.11 Intr - 68304 68226 79 2 1 53 62 81 0.043 0.63 1.10 Intr - 70575 70478 98 0 2 118 86 10 0.048 1.69 1.09 Intr - 72243 72163 81 0 0 69 78 96 0.075 5.62 1.08 Intr - 76383 76229 155 1 2 30 96 285 0.992 22.37 1.07 Intr - 78784 78643 142 1 1 51 95 119 0.998 7.91 1.06 Intr - 80616 80440 177 0 0 58 64 66 0.532 0.39 1.05 Intr - 80955 80791 165 0 0 38 80 99 0.843 3.34 1.04 Intr - 82827 82596 232 2 1 62 66 327 0.726 24.75 1.03 Intr - 83172 83062 111 2 0 73 89 150 0.999 12.18 1.02 Intr - 88823 88731 93 1 0 50 121 61 0.887 3.76 1.01 Init - 94513 93687 827 1 2 52 97 257 0.352 17.23 1.00 Prom - 98687 98648 40 -10.55 2.00 Prom + 99593 99632 40 -2.85 2.01 Init + 100001 100081 81 1 0 67 83 38 0.582 2.23 2.02 Intr + 101880 102025 146 2 2 120 25 93 0.948 4.36 2.03 Intr + 105868 106297 430 1 1 76 99 343 0.951 26.99 2.04 Intr + 108889 109010 122 0 2 41 86 128 0.866 6.27 2.05 Intr + 111804 111873 70 0 1 102 31 58 0.031 -0.23 2.06 Intr + 119668 119768 101 0 2 86 111 94 0.967 9.49 2.07 Term + 120381 120510 130 0 1 106 47 131 0.999 7.47 2.08 PlyA + 122394 122399 6 1.05 3.04 PlyA - 123360 123355 6 1.05 3.03 Term - 133154 133021 134 0 2 7 48 154 0.634 0.57 3.02 Intr - 133970 133563 408 0 0 -3 71 353 0.956 17.81 3.01 Init - 137453 137399 55 2 1 84 92 43 0.713 5.80 3.00 Prom - 138405 138366 40 -8.65 4.00 Prom + 139447 139486 40 -6.95 4.01 Init + 139644 139730 87 2 0 84 16 120 0.677 5.09 4.02 Intr + 145981 146292 312 0 0 38 81 288 0.803 18.56 4.03 Intr + 154692 154757 66 2 0 101 79 53 0.031 3.88 4.04 Term + 160032 160073 42 2 0 97 53 40 0.017 -2.42 4.05 PlyA + 160302 160307 6 1.05 5.00 Prom + 163167 163206 40 -7.15 5.01 Init + 170981 171098 118 1 1 57 85 136 0.701 10.61 5.02 Intr + 179742 179928 187 1 1 39 63 132 0.023 3.63 5.03 Intr + 205460 205676 217 2 1 36 60 198 0.932 9.38 5.04 Intr + 206419 207087 669 0 0 62 51 460 0.793 30.67 5.05 Intr + 207425 207455 31 1 1 102 95 -25 0.598 -3.61 5.06 Intr + 207850 208061 212 2 2 28 97 131 0.603 5.71 5.07 Intr + 209878 209965 88 0 1 110 55 76 0.932 5.12 5.08 Term + 210981 211177 197 1 2 75 40 188 0.924 9.29 5.09 PlyA + 212777 212782 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 111804 111897 94 0 1 102 42 63 0.837 -0.58 S.002 Intr + 118901 119040 140 0 2 87 115 14 0.916 3.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:31432393_31652899|GENSCAN_predicted_peptide_1|1257_aa MSFHPGRGCPRGRGGHGARPSAPSFRPQNLRLLHPQQPPVQYQYEPPSAPSTTFSNSPAP NFLPPRPDFVPFPPPMPPSAQGPLPPCPIRPPFPNHQMRHPFPVPPCFPPMPPPMPCPNN PPVPGAPPGQGTFPFMMPPPSMPHPPPPPVMPQQVNYQYPPGYSHHNFPPPSFNSFQNNP SSFLPSANNSSSPHFRHLPPYPLPKAPSERRSPERLKHYDDHRHRDHSHGRGERHRSLDR RERGRSPDRRRQDSRYRSDYDRGRTPSRHRSYERSRERERERHRHRDNRRSPSLERSYKK EYKRSGRSYGLSVVPEPAGCTPELPGEIIKNTDSWAPPLEIVNHRSPSREKKRARWEEEK DRWSDNQSSGKDKNYTSIKEKEPEETMPDKNEEEEEELLKPVWIRCTHSENYYSSDPMDQ VVPGGRPWLVFTEWLSDGRPCCLPEHLPQLQDDAGLAGLRSLHTRILTRRMQRACWLGRS EVQHGPHWLHQDVSRSVLLYGGSGENLFSCSFRLLAEFSCLCLEDRGPCVLVGCQGDSTV VGTSRLRDLYDKFEEELGSRQEKAKAARPPWEPPKTKLDEDLESSSESECESDEDSTCSS SSDSEVFDVIAEIKRKKAHPDRLHDELWYNDPGQMNDGPLCKCSAKARRTGIRHSIYPGE EAPDQQAWSLASDQESIYILTSGHFSFHTKWMTSQPWPAEEATTDLPDDVCAPVTCTFVL AIKPCRPMTNNAGRLFHYRITVSPPTNFLTDRPTVIEYDDHEYIFEGFSMFAHAPLTNIN INSCDMYLCINTPLSHFRLPFEMNLLDVSSDTLPSTCVKLDSSSSLHIEFEIPLCKVIRF NIDYTIHFIEEMMPENFCVKGLELFSLFLFRDILELYDWNLKGPLFEDSPPCCPRFHFMP RFVRFLPGKSSGLSVTCDTDMGPLTVHGTYGEKHAPISESTPELLMSGKALCSKALVPEE EIANMLQWEELEWQKYAEECKGMIVTNPGTKPSSVRIDQLDREQFNPDVITFPIIVHFGI RPAQLSYAGDPQYQKLWKSYVKLRHLLANSPKVKQTDKQKLAQREEALQKIRQKNTMRRE VTVELSSQGFWKTGIRSDVCQHAMMLPVLTHHIRYHQCLMHLDKLIGYTFQDRCLLQLAM THPSHHLNFGMNPDHARNSLSNCGIRQPKYGDRKVHHMHMRKKGINTLINIMSRLGQDDP TPSRINHNERLEFLGDAVVEFLTSVHLYYLFPSLEEGGLATYRTAIVQNQHLAMLAK >gi568815593f:31432393_31652899|GENSCAN_predicted_CDS_1|3771_bp atgtcgttccacccgggacgagggtgtccccgaggacgaggaggacatggagccagaccc tcagcaccatcctttaggccccaaaatctgaggctgcttcaccctcagcagcctcctgtg caatatcaatatgaacctccaagtgccccttccaccactttctcaaactctccagccccc aattttctccctccacgaccagactttgtacccttccccccacccatgcctccgtcagcg caaggccctcttcccccctgcccaatcaggccgcctttccccaaccaccagatgaggcac cccttcccagttcctccttgttttcctcccatgccaccaccaatgccttgtcctaataac cccccagtccctggggcacctcctggacaaggcactttccccttcatgatgccccctccc tccatgcctcatcccccgccccctccagtcatgccgcagcaggttaattatcagtaccct ccgggctattctcaccacaacttcccacctcccagttttaatagtttccagaacaaccct agttctttcctgcccagtgctaataacagcagtagtcctcatttcagacatctccctcca tacccactcccaaaggctcccagtgagagaaggtccccagaaaggctgaaacactatgat gaccacaggcaccgagatcacagtcatgggcgaggtgagaggcatcggtccctggatcgg cgggagcgaggccgcagtcccgacaggagaagacaagacagccggtacagatctgattat gaccgagggagaacaccatctcgccaccgcagctacgaacggagcagagagcgagaacgg gagagacacaggcatcgagacaaccgaagatcaccatctctggaaaggtcctacaaaaaa gagtataagagatctggaaggagttacggtttatcggttgttcctgaacctgctggatgc acaccagaattacctggggagattattaaaaatacagattcttgggccccacccctggag attgtgaatcatcgctccccaagtagggagaagaagagagctcgttgggaggaagaaaaa gaccgttggagtgacaaccagagttctggcaaagacaagaactatacctcaatcaaggaa aaagagcccgaggagaccatgcctgacaagaatgaggaggaagaagaagaacttcttaag cctgtgtggattcgatgcactcattcagaaaactactactccagtgaccccatggatcag gtggtacctggagggaggccatggctggtgttcactgagtggctcagtgatggcaggccc tgctgtcttcctgagcatcttccacagttgcaggatgatgcaggactagcaggcctcagg tccttacacacccgcatcctcacacgcaggatgcagagggcatgttggcttgggaggtca gaagtccagcatgggcctcactggctccatcaggatgtcagcaggtctgtgctcctttat ggaggctctggagaaaatctgttttcttgctcattcaggttgctggcagagttcagttgc ttgtgtctggaggacagaggaccttgtgtccttgttggctgtcagggagattctacagtg gttggaacgagtaggcttcgtgacttatatgacaaatttgaggaggagttggggagcagg caagaaaaggccaaagctgctcggcctccgtgggaacctccaaagacgaagctcgatgaa gatttagagagttccagtgaatccgagtgtgagtctgatgaggacagcacctgttctagc agctcagactctgaagtttttgacgttattgcagaaatcaaacgcaaaaaggcccaccct gaccgacttcatgatgaactttggtacaacgatccaggccagatgaatgatggaccactc tgcaaatgcagcgcaaaggcaagacgcacaggaattaggcacagcatttatcctggagaa gaggcccctgaccagcaagcatggtcacttgcttctgaccaggagtccatctatattcta acttcaggccacttctctttccacacaaagtggatgaccagtcagccctggcctgcagag gaggctaccactgatcttcctgatgatgtttgtgctcctgtcacctgcacatttgttttg gccatcaagccctgtcgtcctatgaccaacaatgctggcagacttttccactaccggatc acagtctccccgcctacgaactttttaactgacaggccaactgttatagaatacgatgat cacgagtatatctttgaaggattttctatgtttgcacatgcccccctgaccaatatcaat attaatagctgtgacatgtacctctgtattaatactccactctcacattttagattgcct tttgaaatgaacctcctcgatgtttcctcagatactttaccctcaacctgtgtgaaacta gactcgtcatcttccctgcatattgaatttgagattccactgtgtaaagtaattagattc aacatagactacacgattcatttcattgaagagatgatgccggagaatttttgtgtgaaa gggcttgaactcttttcactgttcctattcagagatattttggaattatatgactggaat cttaaaggtcctttgtttgaagacagccctccctgctgcccaagatttcatttcatgcca cgttttgtaagatttcttccaggtaaatcttctggcttgtcagtgacctgtgatactgac atgggaccactcacagtgcatggcacatatggtgaaaaacatgctcccatctcagagtca actccagagctcttaatgagcggtaaagctttgtgcagcaaagccctggtgcctgaggag gagattgccaatatgcttcagtgggaggagctggagtggcagaaatatgcagaagaatgc aaaggcatgattgttaccaaccctgggacgaaaccaagctctgtccgtatcgatcaactg gatcgtgaacagttcaaccccgatgtgattacttttccgattatcgtccactttgggata cgccctgcacagttgagttatgcaggagacccacagtaccaaaaactgtggaagagttat gtgaaacttcgccacctcctagcaaatagtcccaaagtcaaacaaactgacaaacagaag ctggcacagagggaggaagccctccaaaaaatacggcagaagaatacaatgagacgagaa gtaacggtggagctaagtagccaaggattctggaaaactggcatccgttctgatgtctgt cagcatgcaatgatgctacctgttctgacccatcatatccgctaccaccaatgcctaatg catttggacaagttgataggatatactttccaagatcgttgtctgttgcagctggccatg actcatccaagtcatcatttaaattttggaatgaatcctgatcatgccaggaattcatta tctaactgtggaattcggcagcccaaatacggagacagaaaagttcatcacatgcacatg cggaagaaagggattaacaccttgataaatatcatgtcacgccttggccaagatgaccca actccctcgaggattaaccacaatgaacggttggaattcctgggtgatgctgttgttgaa tttctgaccagcgtccatttgtactatttgtttcctagtctggaagaaggaggattagca acctatcggactgccattgttcagaatcagcaccttgccatgctagcaaag >gi568815593f:31432393_31652899|GENSCAN_predicted_peptide_2|359_aa MSDSAGGRAGLRRYPKLPVWVVEDHQEVLPFIYRAIGSKHLPASNVSFLHFDSHPDLLIP VNMPADTVFDKETLFGVTSTDHYFLSDGLYVPEDQLENQKPLQLDVIMVKPYKLCNNQEE NDAVSSAKKPKLALEDSENTASTNCDSSSEGLEKDTATQRSDQTCLEPSCSCSSENQECQ TAASTGEILEILKKGKAFVLDIDLDFFSVKNPFKEMFTQEDLVDIVDTRIHQLEDLEATF ADLCDGDDEETVQRWASNPGKFVNICGGGIKTLEALANQSFQKKALFTLIRNQETLVNPV VTGSSIVTAQCNSGCSWSSLDDYCPSDQVDTIQEKVLNMLRALYGNLDLQVYAAESPPS >gi568815593f:31432393_31652899|GENSCAN_predicted_CDS_2|1080_bp atgagtgactccgcgggagggcgcgctggtctccggcgttaccccaagctcccagtgtgg gtggtggaggatcatcaggaggttctaccctttatataccgggccataggctcaaagcat cttcctgccagtaatgtaagttttttacatttcgactcacatccagacctccttattcct gtgaatatgccagcagacaccgtgtttgataaggaaacactctttggggttacaagtaca gatcattatttcctaagtgatggtctgtatgtacctgaagaccagctagagaaccaaaaa cctttacaattggatgtaattatggtaaaaccttataaactctgtaacaatcaagaagaa aacgatgcagtgtcttctgctaagaaaccaaagctagccctggaagattcggaaaacact gcctctactaactgtgactcttcttcagaaggactggaaaaggacacagcaacacagaga agtgaccagacttgcctagaaccatcatgttcatgttcttctgaaaatcaggaatgccag actgctgccagcactggggaaattctggaaattttgaagaaagggaaggcatttgtttta gatattgacttggattttttttcagtcaagaatcccttcaaagaaatgttcactcaggaa gatttggtagatattgttgatactcgaattcatcaattagaggatttagaagccactttc gctgatttgtgtgatggtgatgatgaagaaacggtacagagatgggcttcaaaccctggc aaattcgtgaatatttgtggaggaggaattaaaacactagaagcattagccaatcagagc ttccaaaagaaggctttattcacccttatccggaaccaggagacactggttaatccagtt gtaactggcagcagcattgtaactgcccagtgcaactcaggttgcagttggtcaagtctg gatgattactgtccttctgaccaagttgacactattcaagaaaaggtcctcaatatgcta cgtgccctctatggaaatctagacctccaagtgtatgcagcagagtctcctccatcttga >gi568815593f:31432393_31652899|GENSCAN_predicted_peptide_3|198_aa MPKHHTLGHQFLSPNTCVYLLNGFDQNVDSDVDNEVQAEVVSDRDDELIGNCSKGDSCYA LAKRLVAFCPYPRDLWNFELERDNLGYLIEEISKQQNIQEETEHKSLESLQPEDAIEKKN TFSGEKFKQAAEICISNEELNVNSKDNAGNVSRAESPLEHCLLELREEGHCLPDPRVVDP PTACTEHLEKPQIINASR >gi568815593f:31432393_31652899|GENSCAN_predicted_CDS_3|597_bp atgccgaagcaccatactttggggcatcagtttctgagccccaacacttgtgtatacttg ttgaatggttttgaccaaaatgttgatagtgatgtggacaatgaagtccaggctgaggtg gtctcagatagagatgatgaacttattgggaactgcagcaaaggtgactcttgctatgct ttggcaaagagactggtggcattttgcccctaccctagagatctgtggaactttgaactt gagagagataatttagggtatctgatagaagaaatttctaagcagcaaaacattcaagag gaaacagagcataaaagtttggaaagtttgcagcctgaagatgcgatagagaagaaaaac acattttctggggagaaattcaagcaggctgcagaaatttgcataagtaatgaggagctg aatgttaatagcaaagacaatgcaggaaatgtctccagggcagagtccccactggagcac tgcctactggagctgagagaagagggccactgtcttccagaccccagagtggtagatcca ccaacagcttgtactgagcacctggaaaagccacagataatcaatgccagccgatga >gi568815593f:31432393_31652899|GENSCAN_predicted_peptide_4|168_aa MVETVFCTADTEHVTEMSKQEEEGVKKCKPGQQSETLSENRERERKKKKKKEKEKEKEKE KEKEEKEKEEEKEKKERERKEERKEGERKREREREKEGKKEKKERKGREEGRKKERKKER KKERKKERKKERKNRNTTLRLYDNMGSPCIITGDTVTGGDRHNGNIGE >gi568815593f:31432393_31652899|GENSCAN_predicted_CDS_4|507_bp atggtggaaacagtattttgcacagccgacacagagcatgtgaccgaaatgagcaaacag gaagaagagggtgtgaaaaaatgcaaacctgggcaacagagtgagaccctgtctgaaaac agagagagagagaggaagaagaagaagaagaaggagaaggagaaggagaaggagaaggag aaggagaaggaggagaaggagaaggaggaggagaaagagaagaaggagagagaaagaaag gaagaaaggaaggaaggagaaaggaagagagagagagaaagagagaaagaaggaaagaaa gaaaagaaagaaagaaaaggaagggaggaaggaaggaagaaagaaagaaagaaagaaaga aagaaagaaagaaagaaagaaagaaagaaagaaagaaagaacagaaacacaacacttcgt ctttatgataacatgggttccccgtgtattatcacaggggatacagtgactgggggtgat aggcacaatggaaacattggagaatga >gi568815593f:31432393_31652899|GENSCAN_predicted_peptide_5|572_aa MDKGTNKGYGDTQRLASKGSHDHMKGKGDSGEEIEFLESDVGETGVPGEKSHRHGENVQT PDRQWPQPGIHFSSYGHYNETTLNEITLFEDLLYSPHCIKNRKRSSECLSAQLVIYDAWI CASIDLNGLVIIKAEEQLKSRTSPPQPLRTIPPPNALGPLWSLVWWDSAVRSAQCYAEAT SRPLARSVPPRWRPKPRLYGDSEQPSPWMPSSHDPQVSEAGAPGLLAHPERRLARGGARS GKSPPTLLFLQGNPRPPSLSLGEGSSRPSNQPGGGNPRPSWRLGGSPAQRGEERTPAPVR AGPGQRDAAGRRLQAAEEPQAEPKAPGLRLPRLPANRGSAARGRRGGGTGGGRGGGSCAG TRRAAPGSGRARKPREPRPSREQVKRPSRCSRDARLREWRYSGTPASAQIYAETLIRDTG VGRGQGLAWAGRRDWNGRHEGIPELQTCPGTGRWAVLGDGAESESVPCSWPNLLPPPKTH PRSTREDAEVRVEFQGQEELCRQAHERVLRLNLMSLSFAKIQLPGARQEEAGQSPLSSMV WKGIMHSHSVTPAFEEAKAQKNRELAPLDNQN >gi568815593f:31432393_31652899|GENSCAN_predicted_CDS_5|1719_bp atggataaaggaaccaacaaaggttatggagacacccagaggctagcaagcaagggtagc catgaccacatgaagggcaaaggagacagcggggaggaaatagaattcctggagtctgat gtgggggaaaccggagtacccggagaaaagtcacacagacatggggagaacgtgcaaact ccagacagacagtggcctcaaccaggaatccatttttcttcttatggacattataatgaa acaacgttgaatgaaataacattatttgaagacctgctgtattcacctcactgtattaag aataggaagagaagctctgagtgcctgagtgcccagcttgtcatatacgatgcttggatt tgtgcaagcattgatctcaatggcctggtaattataaaagctgaagaacagctgaaatca agaacttctccaccccagcccctgaggacaataccacctccaaatgccttgggcccactc tggtctctggtgtggtgggacagcgctgtccggtctgcgcagtgctatgcagaggcaact tcacgtcccctagctcgctctgttccacctagatggcgccctaagccccgcctgtatgga gattcggagcagccatctccctggatgccttcctcccacgatccccaggtttctgaagcg ggcgcaccaggactcctggctcacccagagagacgccttgcccgcggcggggctcgctcg gggaaatccccgcccaccttgttattcctgcaggggaatccccgccccccgtccctgtca ctcggggaagggagttcccgcccctcgaatcaaccagggggagggaatccccgcccatcc tggaggctcggcggatcccctgcgcagcgaggcgaggagcggaccccagcgccggtgcgt gccggccccgggcagcgggacgcggcggggcggcggctgcaggcagccgaggagccgcag gccgaacccaaggcaccgggattgcgcctcccgcggctgccggcgaaccgcggctctgca gctcggggcaggcgcggcggcggcaccggtggtggccgcggtggcggcagctgcgcgggg acccgccgggcggcgcctgggtctggacgcgcgaggaagccgcgggagcctcggccaagc cgcgagcaggtgaagcgaccgtcccgctgcagccgggacgcgcggctccgggagtggagg tactcaggtactccggcctcagcccaaatctatgcagagaccttgataagggacactgga gtgggaagaggacaaggtctggcatgggctggtcgcagagactggaatggcaggcacgag ggcattccagagctccagacctgccctgggactggccgttgggcagtgttgggggatgga gctgagtctgagtctgtcccctgcagctggcctaacctcttgcctccaccaaagacacat ccccgcagcaccagggaggatgctgaggtcagagttgagtttcaaggacaggaagagcta tgcagacaagcccatgagagagtgctgcgtctaaatttaatgtccctctcttttgcaaag atccagttaccaggtgcaaggcaggaagaggctggacagtctcctctgtcctccatggtt tggaagggcatcatgcactcacactcggtaacacctgcatttgaggaagctaaggcccag aaaaacagagaacttgccccattagacaaccagaactag