GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:23:56 Sequence gi568815597r:33047180_33281432 : 234253 bp : 47.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3991 4234 244 0 1 72 53 134 0.097 6.24 1.02 Intr + 11158 11228 71 2 2 64 35 62 0.071 -2.60 1.03 Intr + 17247 17390 144 2 0 96 91 65 0.237 8.08 1.04 Intr + 23469 23837 369 2 0 -118 78 351 0.004 8.80 1.05 Intr + 34999 35175 177 0 0 116 89 190 0.924 22.02 1.06 Intr + 36775 36948 174 0 0 84 105 100 0.912 11.44 1.07 Intr + 44871 45043 173 2 2 66 82 203 0.426 16.24 1.08 Intr + 47369 47534 166 2 1 82 76 169 0.728 15.06 1.09 Intr + 49528 49690 163 0 1 92 73 210 0.995 19.45 1.10 Intr + 50888 51000 113 0 2 95 105 200 0.986 22.50 1.11 Term + 52356 52478 123 2 0 79 39 46 0.401 -2.82 1.12 PlyA + 54383 54388 6 1.05 2.00 Prom + 58880 58919 40 -6.16 2.01 Init + 59472 59651 180 2 0 87 77 108 0.848 8.78 2.02 Intr + 70723 70937 215 0 2 75 99 223 0.997 19.61 2.03 Intr + 72865 72999 135 1 0 100 84 165 0.996 16.98 2.04 Intr + 73238 73259 22 2 1 99 64 7 0.878 -3.05 2.05 Term + 73764 73892 129 2 0 118 53 135 0.987 11.18 2.06 PlyA + 74400 74405 6 1.05 3.03 PlyA - 76195 76190 6 1.05 3.02 Term - 82644 82575 70 2 1 95 36 84 0.619 1.31 3.01 Init - 86424 86279 146 0 2 78 115 73 0.515 6.87 3.00 Prom - 91926 91887 40 -7.56 4.07 PlyA - 93158 93153 6 -0.45 4.06 Term - 100548 99998 551 1 2 107 37 1069 0.998 98.16 4.05 Intr - 111189 111074 116 2 2 106 110 152 0.999 19.29 4.04 Intr - 112765 112509 257 1 2 93 115 678 0.891 67.24 4.03 Intr - 118387 118292 96 1 0 93 105 191 0.753 21.51 4.02 Intr - 134337 133846 492 0 0 49 85 897 0.360 78.70 4.01 Init - 135003 134815 189 0 0 63 74 162 0.622 9.31 4.00 Prom - 160256 160217 40 -2.46 5.02 PlyA - 161002 160997 6 1.05 5.01 Sngl - 162994 162116 879 1 0 45 42 284 0.613 15.51 5.00 Prom - 169550 169511 40 -4.46 6.00 Prom + 188923 188962 40 -4.46 6.01 Init + 192385 192438 54 0 0 98 89 67 0.907 9.11 6.02 Intr + 203581 203771 191 0 2 -103 71 359 0.477 13.48 6.03 Intr + 209111 209475 365 2 2 82 69 261 0.732 18.43 6.04 Intr + 214403 214529 127 0 1 62 47 57 0.033 -1.26 6.05 Intr + 228921 228984 64 0 1 109 110 29 0.604 5.92 6.06 Intr + 229169 229415 247 1 1 140 96 441 0.998 47.13 6.07 Intr + 232945 233278 334 2 1 112 75 285 0.957 24.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_1|638_aa MKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPPRRRAPPGRQRLEERTGPAGPEGKEQP PALASQSAEIAASARPPPRLGRLLGFQKACRCWSLNPHILMALLRSLVPPDKKHPQVWRG RPPLHLAPNVGLFSRVKVRSSVVIEDKSMRDSRRGLSQRRRRRKKKKRGSSSKKKKRRKK RKKKKRKKRKRRKNRKKKKKRKNKRKKKRKKEEKKEEEEERRKKEEEDEEGRGRGRRKRK RKKRKKRRSRKKKETAAAAAAGERLGKWWPGECPVECVAYFLRRRLQQRLHPARQLLLQG MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ IKYAAKHGIQLLSFDNEMELAKVVKSHPSANFHIGSGCPDPQAYAQSIADARLVFEMGTE LGHKMHVLDLGGGFPGTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAF TVAVSIIAKKEVLLDQPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKYS SSCCMLALTHSTPFIGSSEEEMMVPAHCHAPHRDLCFG >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_1|1917_bp atgaagcggagccgctgccgtgaccgaccgcagccgccgccgcccgaccgccgggaggat ggagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggccgccaccccgtctg ggaaggcttctgggattccagaaagcctgcaggtgttggagcctcaaccctcatatcctc atggccctgctgaggtctcttgtcccacctgacaagaaacacccacaggtgtggaggggc aggccaccccttcatctggcgcccaacgtggggcttttctctagggtgaaggtacgctcg agcgtggtcattgaggacaagtcaatgagagattcccgaagaggcctatctcaaagaaga agaaggagaaagaagaagaagagaggtagcagcagcaagaagaagaagaggaggaagaag agaaagaagaagaagaggaagaagaggaagaggaggaagaataggaagaagaagaagaag aggaagaacaagaggaagaagaagaggaagaaggaagaaaagaaagaagaggaagaagaa agaagaaagaaggaggaggaagatgaagaaggaagaggaagaggaagaaggaagaggaag aggaagaagaggaagaaaagaagaagcagaaagaagaaagaaacagcagcagcagcagca gctggagaaaggctgggaaaatggtggccaggagagtgtccggtagagtgtgttgcatac tttctaaggcggcggctgcagcagcggctccatccagcccgtcagctcctcctgcaaggc atggctggctacctgagtgaatcggactttgtgatggtggaggagggcttcagtacccga gacctgctgaaggaactcactctgggggcctcacaggccaccacggacgaggtagctgcc ttcttcgtggctgacctgggtgccatagtgaggaagcacttttgctttctgaagtgcctg ccacgagtccggcccttttatgctgtcaagtgcaacagcagcccaggtgtgctgaaggtt ctggcccagctggggctgggctttagctgtgccaacaaggcagagatggagttggtccag catattggaatccctgccagtaagatcatctgcgccaacccctgtaagcaaattgcacag atcaaatatgctgccaagcatgggatccagctgctgagctttgacaatgagatggagctg gcaaaggtggtaaagagccaccccagtgccaattttcacattggcagtggctgtcctgac cctcaggcctatgctcagtccatcgcagacgcccggctcgtgtttgaaatgggcaccgag ctgggtcacaagatgcacgttctggaccttggtggtggcttccctggcacagaaggggcc aaagtgagatttgaagagattgcttccgtgatcaactcagccttggacctgtacttccca gagggctgtggcgtggacatctttgctgagctggggcgctactacgtgacctcggccttc actgtggcagtcagcatcattgccaagaaggaggttctgctagaccagcctggcagggag gaggaaaatggttccacctccaagaccatcgtgtaccaccttgatgagggcgtgtatggg atcttcaactcagtcctgtttgacaacatctgccctacccccatcctgcagaagtactcg tcttcctgttgcatgctggctctcacccactccactcccttcattggctcctcagaagag gagatgatggtcccagcccactgccacgccccccaccgggatctttgctttggctga >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_2|226_aa MKDKNHMIISVDGEQAFDKIQHPFMIKTFNKLGMEGMYLNMIRAIYDKLTANVILNKDPK KPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQA CHITYAMSRVAWEALRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIIARG LVSQVGKAVRVKQTPGGRFCFFGQCLLEQGWPRSIAVSSADSPGIP >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_2|681_bp atgaaggataaaaatcatatgatcatctcagtagatggagaacaagcatttgacaaaatt cagcatcctttcatgataaaaactttcaacaaattaggtatggaaggaatgtacctcaac atgataagggccatatatgacaagctcacagctaatgttatactcaacaaggatcctaag aaaccatccacggagcagcccctgtacagcagcagcctgtggggcccggcggttgatggc tgtgattgcgtggctgagggcctgtggctgccgcaactacacgtaggggactggctggtc tttgacaacatgggcgcctacactgtgggcatgggttcccccttttgggggacccaggcc tgccacatcacctatgccatgtcccgggtggcctgggaagcgctgcgaaggcagctgatg gctgcagaacaggaggatgacgtggagggtgtgtgcaagcctctgtcctgcggctgggag atcacagacaccctgtgcgtgggccctgtcttcaccccagcgagcatcattgcaaggggc ctggtcagccaggttggcaaggcagtcagagtaaagcagacacctggtggtcgcttttgc ttctttgggcagtgcctgttagaacagggctggccacggagtattgctgtgtccagtgcc gacagccctggcatcccctga >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_3|71_aa MGGLQAAVITIPALLSSSTLLVISCINGKRAAGGPRSPAPAHHPHEGPRNSDDLLILRNI TLYCPKDHQNG >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_3|216_bp atgggtggcttgcaagctgcagtgatcaccatcccggccctgctctcctcatccactctg ctggtcatcagctgtataaatggaaaacgtgccgccggtggtcccaggtcacctgcccca gcccaccatccccatgaggggcccagaaattctgatgacttgctgatacttcggaacatc accctttattgtccaaaagatcaccagaatgggtaa >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_4|566_aa MVGVGRGQPAGGPAPPPPAAGAELLGSRPRGRGGSRGHRQLGFGWFRVERLRWTEAVAAK LAGPSVLPPTAPRSLSRPPAPRAPLSAAPGAMACSLKDELLCSICLSIYQDPVSLGCEHY FCRRCITEHWVRQEAQGARDCPECRRTFAEPALAPSLKLANIVERYSSFPLDAILNARRA ARPCQAHDKVKLFCLTDRALLCFFCDEPALHEQHQVTGIDDAFDELQRELKDQLQALQDS EREHTEALQLLKRQLAETKSSTKSLRTTIGEAFERLHRLLRERQKAMLEELEADTARTLT DIEQKVQRYSQQLRKVQEGAQILQERLAETDRHTFLAGVASLSERLKGKIHETNLTYEDF PTSKYTGPLQYTIWKSLFQDIHPVPAALTLDPGTAHQRLILSDDCTIVAYGNLHPQPLQD SPKRFDVEVSVLGSEAFSSGVHYWEVVVAEKTQWVIGLAHEAASRKGSIQIQPSRGFYCI VMHDGNQYSACTEPWTRLNVRDKLDKVGVFLDYDQGLLIFYNADDMSWLYTFREKFPGKL CSYFSPGQSHANGKNVQPLRINTVRI >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_4|1701_bp atggtgggggtggggcggggtcaaccggctggtggccccgcccctcccccgcccgctgcg ggggcggagttgcttgggtcccgcccacgggggcggggaggcagccgcggccaccggcag ctcggattcggctggttccgggttgagaggctgcgctggaccgaagcggtggctgctaag ctcgcggggccctcggtgctgcctccgacagcgccgcgctctctcagccgcccccctgcc cctcgggcccccctctctgctgcccctggcgccatggcgtgcagcctcaaggacgagctg ctgtgctccatctgcctgagcatctaccaggacccggtgagcctgggctgcgagcattac ttctgccgccgctgcatcacggagcactgggtgcggcaggaggcgcagggcgcccgcgac tgccccgagtgccggcgcacgttcgccgagcccgcgctggcgcccagcctcaagctggcc aacatcgtggagcgctacagctccttcccgctggacgccatcctcaacgcgcgccgcgcc gcgcgaccctgccaggcgcacgacaaggtcaagctcttctgcctcacggaccgcgcgctt ctctgcttcttctgcgacgagcctgcactgcacgagcagcatcaggtcaccggcatcgac gacgccttcgacgagctgcagagggagctgaaggaccaacttcaggcccttcaagacagc gagcgggaacacaccgaagcgctgcagctgctcaagcgacaactggcggagaccaagtct tccaccaagagcctgcggaccactatcggcgaggccttcgagcggctgcaccggctgctg cgtgaacgccagaaggccatgctagaggagctggaggcggacacggcccgcacgctgacc gacatcgagcagaaagtccagcgctacagccagcagctgcgcaaggtccaggagggagcc cagatcctgcaggagcggctggctgaaaccgaccggcacaccttcctggctggggtggcc tcactgtccgagcggctcaagggaaaaatccatgagaccaacctcacatatgaagacttc ccgacctccaagtacacaggccccctgcagtacaccatctggaagtccctgttccaggac atccacccagtgccagccgccctaaccctggacccgggcacagcccaccagcgcctgatc ctgtcggacgactgcaccattgtggcttacggcaacttgcacccacagccactgcaggac tcgccaaagcgcttcgatgtggaggtgtcggtgctgggttctgaagccttcagtagtggc gtccactactgggaggtggtggtggcggagaagacccagtgggtgatcgggctggcacac gaagccgcaagccgcaagggcagcatccagatccagcccagccgcggcttctactgcatc gtgatgcacgatggcaaccagtacagcgcctgcacggagccctggacgcggcttaacgtc cgggacaagcttgacaaggtgggtgtcttcctggactatgaccaaggcttgctcatcttc tacaatgctgatgacatgtcctggctctacaccttccgcgagaagttccctggcaagctc tgctcttacttcagccctggccagagccacgccaatggcaagaacgttcagccgctgcgg atcaacaccgtccgcatctag >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_5|292_aa MKAEIKMFFETNKNKDTTYQNLWGTFKAVCRGNFIALNTHKRKQERSKIDTLTSQLKQLE KEEQTNSKASRRQQITKIRAELKEIETKQTLQKKINESRSWFFEKINKIHRPLARLIKKK REKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLSRLNQE EVESPNRPITGSEIEAIINSLPTKKSPGPDGVTAEFYQRYKEELVSFLLKLFQSIEKERI LPNSFYEASIILIPKPGRDTTTKKENFRPISLMNINAKILNKILANRIQQHI >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_5|879_bp atgaaggcagaaataaagatgttctttgaaaccaataagaacaaagacacaacataccag aatctctggggcacatttaaagcagtgtgtagagggaactttatagcactaaatacccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaacaactagag aaggaagagcaaacaaattcaaaagctagcagaaggcaacaaataactaagatcagagca gaactgaaggaaatagagacaaaacaaacccttcaaaaaaaaatcaatgaatccaggagc tggttttttgaaaagatcaacaaaattcatagaccactagcaagactaataaagaagaaa agagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccgatccc acagaaatacaaactaccatcagggaatattataaacacctctatgcaaataaactagaa aacctagaagaaatggataaattcctggacacatacaccctctcaagactaaaccaggaa gaagttgaatctccaaatagaccaataacaggctctgaaattgaggcaataattaatagc ttaccaaccaaaaaaagtccaggaccagacggagtcacagccgaattctaccagaggtac aaagaagagctggtatcattccttctgaaattattccaatcaatagaaaaagagagaatc ctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcagagacaca acaacaaaaaaagagaattttaggccaatatccctgatgaacatcaatgcaaaaatcctt aataaaatactggcaaaccgaatccagcagcacatctaa >gi568815597r:33047180_33281432|GENSCAN_predicted_peptide_6|461_aa MAEGKGEAGACFTGAQDRKKEEGRRKKEKEEEKEKKKKRRRKEEERRKKEEEEEEEEEEE EEEGEEEGAWLMLESVFSSDRSRQRPALPTRGSPRGRTGCAAGEAGARRGPPGAGRLARG LAERTGGRAALQTQPPAAAAAAASALAEPRSEQQATRKVCARLRSEPEPEPEPGACAGAP ARPCSGRRARGCGRVPEAPPAQPAWCSALIQGNWSGLVYSGKGDTVISPIKSVSDDIYVC NSWRIMMAEPRFNNPYFWPPPPTMPSQLDNLVLINKIKEQLMAEKIRPPHLPPTSASSQQ PLLVPPAPAESSQAVMSLPKLQQVPGLHPQAVPQPDVALHARPATSTVTGLGLSTRTPSV STSESSAGAGTGTGTSTPSTPTTTSQSRLIASSPTLISGITSPPLLDSIKTIQGHGLLGP PKSERGRKKIKAENPGGPPVLVVPYPILASGETAKEGKTYS >gi568815597r:33047180_33281432|GENSCAN_predicted_CDS_6|1383_bp atggcggaaggcaaaggagaagcaggcgcctgcttcacgggggcgcaggacagaaagaaa gaagaaggaagaagaaagaaggagaaggaggaggagaaggagaagaagaagaagagaaga agaaaagaagaagaaagaagaaagaaggaagaagaagaagaagaagaagaagaagaagaa gaagaagaaggagaagaagaaggtgcatggttgatgcttgagagcgtttttagttctgat cggagtcgccagcgcccggcgctgcccactcgcgggagcccccggggccggacgggctgc gcagccggagaggcgggggcccgacgcggcccccccggagccgggcgcctggcgcggggg ctcgccgagcgcactgggggccgcgcggcgctgcagacccagcctcccgccgccgccgcc gccgccgcctcggcgcttgcagaacccagaagtgaacagcaggcgacccggaaggtttgc gcgcggctccgcagcgagccagagccggagcccgagcccggagcctgtgccggagcccca gcccggccctgctcgggccgccgggcgcggggctgcggccgcgtcccggaggcgccgcca gcacagccagcctggtgctcggcactcatccagggtaattggtcaggcttggtgtactca ggaaaaggagacaccgttatctctccaataaagtctgtatctgatgacatttatgtatgc aattcttggagaatcatgatggccgagcctcgatttaacaacccctacttctggccccct cctcccaccatgcccagccagctggacaacctggttctgattaacaagatcaaggagcag ctgatggccgagaagatcaggccgcctcacctgccgcccacgtcggcctcgtcgcagcag ccgttgctagtgccgccggcacccgccgagagcagccaggccgtcatgtcgctgcccaag ctgcagcaggtgccggggctgcatccacaggcggtgccgcagcccgacgtggcgctgcac gcacggccggccaccagcaccgtcacaggtctggggctgtccacccggaccccgtctgtg agcacttctgagtcaagcgcgggcgcgggcacgggcacgggtaccagcaccccgtccaca cccaccaccaccagccagagccgcctcatcgcctcgtcccccaccctcatctcagggatc accagcccccctctcctggactccatcaagacaatccagggccacggcctgcttggcccc cccaagtccgaacgcggccgcaaaaagatcaaggcggagaacccggggggtccgcctgtc cttgtagtcccctatcccatcctggcctcgggcgagactgccaaggagggcaagacgtac agn