GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:09:55 Sequence gi568815581r:47971438_48177004 : 205567 bp : 47.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1358 1537 180 0 0 83 78 42 0.786 2.54 1.02 Intr + 2082 2213 132 1 0 51 76 179 0.991 13.62 1.03 Intr + 2494 2618 125 2 2 83 33 91 0.593 3.40 1.04 Intr + 2963 3011 49 1 1 109 71 71 0.968 5.75 1.05 Intr + 3722 3900 179 0 2 37 101 349 0.999 30.74 1.06 Intr + 4077 4216 140 2 2 121 65 159 0.823 16.16 1.07 Intr + 4432 4576 145 1 1 -18 95 207 0.840 11.08 1.08 Intr + 5275 5385 111 0 0 111 100 78 0.999 11.88 1.09 Intr + 6395 6473 79 1 1 122 110 48 0.975 9.52 1.10 Intr + 7392 7480 89 1 2 106 69 64 0.969 5.99 1.11 Intr + 9156 9361 206 2 2 109 47 150 0.995 11.00 1.12 Intr + 9726 10022 297 0 0 61 77 388 0.553 30.99 1.13 Intr + 11952 12078 127 0 1 92 46 26 0.072 -0.52 1.14 Intr + 16257 16374 118 2 1 73 81 -4 0.027 -2.56 1.15 Term + 18361 18437 77 2 2 110 34 64 0.052 1.20 1.16 PlyA + 19988 19993 6 1.05 2.14 PlyA - 21366 21361 6 1.05 2.13 Term - 25103 24979 125 0 2 84 49 69 0.047 1.15 2.12 Intr - 25266 25153 114 1 0 96 47 30 0.030 0.12 2.11 Intr - 42666 42610 57 1 0 97 96 34 0.109 3.96 2.10 Intr - 45226 45137 90 2 0 60 80 45 0.065 0.87 2.09 Intr - 52299 52158 142 0 1 61 59 79 0.005 2.23 2.08 Intr - 57073 57035 39 1 0 136 96 55 0.930 9.52 2.07 Intr - 57739 57688 52 0 1 80 90 45 0.939 2.81 2.06 Intr - 60826 60719 108 0 0 28 76 214 0.513 13.60 2.05 Intr - 61304 61249 56 2 2 75 113 22 0.990 1.18 2.04 Intr - 61865 61774 92 0 2 78 53 137 0.988 8.91 2.03 Intr - 62507 62426 82 2 1 68 89 127 0.996 10.01 2.02 Intr - 65482 65414 69 1 0 62 86 116 0.770 8.18 2.01 Init - 66340 66230 111 1 0 63 63 103 0.969 3.59 2.00 Prom - 75219 75180 40 -5.16 3.00 Prom + 79639 79678 40 -6.36 3.01 Init + 79682 80191 510 1 0 55 88 423 0.799 31.93 3.02 Intr + 83579 83713 135 1 0 69 77 50 0.403 2.76 3.03 Intr + 84949 85161 213 0 0 70 81 227 0.904 19.11 3.04 Intr + 85595 85684 90 1 0 78 83 56 0.765 4.29 3.05 Intr + 85907 86065 159 1 0 77 91 37 0.893 3.08 3.06 Term + 86858 88204 1347 1 0 96 52 1887 0.987 177.39 3.07 PlyA + 90021 90026 6 1.05 4.09 PlyA - 90306 90301 6 1.05 4.08 Term - 100142 99998 145 1 1 112 42 165 0.999 11.68 4.07 Intr - 103663 103569 95 1 2 73 101 115 0.999 10.06 4.06 Intr - 104741 104564 178 1 1 72 94 173 0.961 16.22 4.05 Intr - 105604 105428 177 0 0 77 89 212 0.211 19.23 4.04 Intr - 136250 136007 244 0 1 79 50 171 0.756 8.96 4.03 Intr - 138674 138530 145 2 1 143 76 95 0.999 13.66 4.02 Intr - 140661 140564 98 1 2 117 96 36 0.998 7.03 4.01 Init - 140914 140902 13 1 1 58 105 12 0.958 0.34 4.00 Prom - 146652 146613 40 -5.26 5.00 Prom + 146773 146812 40 -8.26 5.01 Init + 147253 147362 110 0 2 35 87 126 0.940 6.89 5.02 Intr + 147537 147749 213 0 0 91 78 143 0.876 11.43 5.03 Intr + 153620 153647 28 2 1 24 77 46 0.182 -4.78 5.04 Term + 155973 156293 321 2 0 9 43 254 0.253 7.82 5.05 PlyA + 156766 156771 6 1.05 6.04 PlyA - 157313 157308 6 1.05 6.03 Term - 165900 165799 102 0 0 121 47 46 0.506 2.08 6.02 Intr - 191132 191032 101 0 2 62 116 116 0.724 11.63 6.01 Intr - 199222 199172 51 2 0 105 60 71 0.785 4.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 52874 52692 183 2 0 98 54 218 0.944 12.44 S.002 Term + 182054 182113 60 1 0 95 43 101 0.898 4.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_1|684_aa XSVLLFPILHVGKLRHRIVDLLKKLDNENPGPSDHKAQLLALTEEICMALKPQIWAQTLR DWLVDRRHCSLKWQSLVLTIREKINAAIQDMPESEEIAQLLSGSYIHYFHCLRILDLLKG TEASTKNIFGRYSSQRMKASVGQGPVDWQEIIALYEKDNTYLVELSSLLVRNVNYEIPSL KKQIAKCQQLQQEYSRKEEECQAGAAEMREQFYHSCKQYGITGENVRGELLALVKDLPSQ LAEIGAAAQQSLGEAIDVYQASVGFVCESPTEQVLPMLRFVQKRGNSTVYEWRTGTEPSV VERPHLEELPEQVAEDAIDWGDFGVEAVSEGTDSGISAEAAGIDWGIFPESDSKDPGGDG IDWGDDAVALQITVLEAGTQAPEGVARGPDALTLLEYTETRNQFLDELMELEIFLAQRAV ELSEEADVLSVSQFQLAPAILQGQTKEKMVTMVSVLEDLIGKLTSLQLQHLFMILASPRY VDRVTEFLQQKLKQSQLLALKKELMVQKQQEALEEQAALEPKLDLLLEKTKELQKLVRWE REACQWEDSQSVQNEALSTPASAHLVSLFQIEADISKRTFSLPFLYRRLLPESSPPGHAE GTEGIHPAGSPWQEQLCVFQHSPPSLSPKPRAWQSHLSNRSRHWLRLAVSERTSRPTGED GGIMAFSDIMDAWYFETTVLDHRM >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_1|2055_bp ntcagtgtcttattattccccattttacatgtgggaaaactgaggcacagaatagtagac ttgctcaagaaacttgataacgagaacccaggaccatctgaccacaaagctcagctctta gctctcactgaggaaatatgcatggctctgaagccacaaatctgggcccagactttgaga gattggctggtggacagaaggcactgcagcctgaaatggcagagtctggtgctgacgatc cgcgagaagatcaatgctgccatccaggacatgccagagagcgaagagatcgcccagctg ctgtctgggtcctacattcactactttcactgcctaagaatcctggaccttctcaaaggc acagaggcctccacgaagaatatttttggccgatactcttcacagcggatgaaggcaagt gtgggccaagggccggtagattggcaggagattatagctctgtatgagaaggacaacacc tacttagtggaactctctagcctcctggttcggaatgtcaactatgagatcccctcactg aagaagcagattgccaagtgccagcagctgcagcaagaatacagccgcaaggaggaggag tgccaggcaggggctgccgagatgcgggagcagttctaccactcctgcaagcagtatggc atcacgggcgaaaatgtccgaggagaactgctggccctggtgaaggacctgccgagtcag ctggctgagattggggcagcggctcagcagtccctgggggaagccattgacgtgtaccag gcgtctgtggggtttgtgtgtgagagccccacagagcaggtgttgccaatgctgcggttc gtgcagaagcggggaaactcaacggtgtacgagtggaggacagggacagagccctctgtg gtggaacgaccccacctcgaggagcttcctgagcaggtggcagaagatgcgattgactgg ggcgactttggggtagaggcagtgtctgaggggactgactctggcatctctgccgaggct gctggaatcgactggggcatcttcccggaatcagattcaaaggatcctggaggtgatggg atagactggggagacgatgctgttgctttgcagatcacagtgctggaagcaggaacccag gctccagaaggtgttgccaggggcccagatgccctgacactgcttgaatacactgagacc cggaatcagttccttgatgagctcatggagcttgagatcttcttagcccagagagcagtg gagttgagtgaggaggcagatgtcctgtctgtgagccagttccagctggctccagccatc ctgcagggccagaccaaagagaagatggttaccatggtgtcagtgctggaggatctgatt ggcaagcttaccagtcttcagctgcaacacctgtttatgatcctggcctcaccaaggtat gtggaccgagtgactgaattcctccagcaaaagctgaagcagtcccagctgctggctttg aagaaagagctgatggtgcagaagcagcaggaggcacttgaggagcaggcggctctggag cctaagctggacctgctactggagaagaccaaggagctgcagaagctggtgagatgggaa agggaggcctgccagtgggaggactcccagtctgtgcaaaatgaggccctgagcacccct gcttctgcccacttggtatcactctttcagattgaagctgacatctccaagagaaccttt agcctccctttcctttaccgccgcctgcttcctgagtccagccctcctggccatgctgaa ggcactgaagggatccacccggctggttctccctggcaagagcagctctgtgtctttcag cattcccctccctccctctcgcccaaaccacgagcatggcaaagccatttgtcaaaccgc tcccgccactggctgcgcttggctgtgtctgaaaggacaagccgccccacaggggaagat ggaggaatcatggccttctctgatatcatggatgcctggtattttgaaacaactgttctt gaccacaggatgtag >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_2|378_aa MQRPEAWPRPHPGEGAAAAQAGGPAPPARAGEPSGLREPSLYTIKAVFILDNDGRRLLAK YYDDTFPSMKEQMVFEKNVFNKTSRTESEIAFFGGMTIVYKNSIDLFLYVVGSSYENELM LMSVLTCLFESLNHMLREAQELVSTFRKNVEKRWLLENMDGAFLVLDEIVDGGVILESDP QQVIQKVNFRADDGGLTEQSVAQQNGHRGTSETQTPGARAQPDGTHRNTGPGALARNRTP ATSRANWGRRVKEAGVVIPTCEVIEDALGCWEQKQCGLNPEALSQPGPTGAVTLVELPSG AGVTVPKRRNEKWGSSVFHSLCPSTESGLPEEGKEDRASHPTVFSSRRKMGHAPQEPVIA NLALLLPDRAQPSPARSH >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_2|1137_bp atgcagcggcccgaggcctggccacgtccgcacccgggggagggggccgcggcggcccag gccgggggcccggcgccgcctgctcgagccggggagccctcggggctgcgggaaccttcc ctctacaccatcaaggctgttttcatcctagataatgacgggcgccggctgctggccaag tattatgatgacacattcccctccatgaaggagcagatggttttcgagaaaaatgtcttc aacaagaccagccggactgagagtgagattgcattttttgggggtatgaccatcgtctac aagaacagcattgacctcttcctatacgtggtgggctcatcctacgagaatgagctgatg ctcatgtctgttctcacctgcctgtttgagtctctgaaccacatgttaagggaggctcag gagcttgtgtccaccttcaggaagaacgtggagaagcgctggttgctggagaacatggac ggagccttcttggtgctggacgagattgtggatggcggtgtgattctggagagtgacccc cagcaagtgatccagaaggtgaattttagggcagatgatggcggcttgactgaacagagt gtggcccagcagaatgggcaccggggcacgtctgaaacccagaccccaggagccagggcc cagccagacggcacgcacagaaacaccggcccgggggctctggcacggaacaggacgcca gcaacaagcagagccaactgggggcgcagagtgaaggaggcaggcgttgtcattcccact tgtgaggtgattgaggatgctctcgggtgctgggagcagaagcaatgcggcctgaaccca gaggccctgagccagccagggccaacaggagctgtcactctagtggagctgccttcagga gctggggtcaccgtgccaaagaggaggaatgagaagtgggggtcatctgttttccatagc ctctgcccatctaccgagtctggtttgccagaagaggggaaggaggatagggcttctcac ccgactgtcttctcctccagaaggaaaatgggtcacgctccacaagagcctgttatcgcc aacctggccctgctgctgcctgatagagcccagccctcccccgccagatctcattaa >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_3|817_aa MLSLKKYLTEGLLQFTILLSLIGVRVDVDTYLTSQLPPLREIILGPSSAYTQTQFHNLRN TLDGYGIHPKSIDLDNYFTARRLLSQVRALDRFQVPTTEVNAWLVHRDPEGSVSGSQPNS GLALESSSGLQDVTGPDNGVRESETEQGFGEDLEDLGAVAPPVSGDLTKETMTRLDRSSH SGGGHAKGSRPLNSLLAGHRVAQPASAPGSRASVQDIDLIDILWRQDIDLGAGREVFDYS HRQKEQDVEKELRDGGEQDTWAGEGAEALARNLLVDGETGESFPAQVPSGEDQTALSLEE CLRLLEATCPFGENAEFPADISSITEAVPSESEPPALQNNLLSPLLTGTESPFDLEQQWQ DLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDF LLFSPEVESLPVASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELP DPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSDSGLSLDSSHSPSSLSSSEG SSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMS YQDPAQLSCLPYLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDE HRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSLIRDIRRRGKNKMAAQNCRKRK LDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSP SQYALQYAGDGSVLLIPRTMADQQARRQERKPKDRRK >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_3|2454_bp atgctttctctgaagaaatacttaacggaaggacttctccagttcaccattctgctgagt ttgattggggtacgggtggacgtggatacttacctgacctcacagcttcccccactccgg gagatcatcctggggcccagttctgcctatactcagacccagttccacaacctgaggaat accttggatggctatggtatccaccccaagagcatagacctggacaattacttcactgcc cggcggctcctcagtcaggtgagggccctggacaggttccaggtgccaaccactgaggta aatgcctggctggttcaccgagacccagaggggtctgtctctggcagtcagcccaactca ggcctcgccctcgagagttccagtggcctccaagatgtgacaggcccagacaacggggtg cgagaaagcgaaacggagcagggattcggtgaagatttggaggatttgggggctgtagcc cccccagtcagtggagacttaaccaaagagaccatgacccgcctggaccgcagctctcac agtggtgggggccatgccaaaggcagccggccccttaactctttgctggctggtcaccgg gtggcccagcccgcctctgctcctgggtccagggcttctgttcaggacatagatctgatt gacatcctttggcgacaggatattgatctgggggctgggcgtgaggtttttgactatagt caccgccagaaggagcaggatgtggagaaggagctgcgagatggaggcgagcaggacacc tgggcaggcgagggcgcggaagctctggcacggaacctgctagtggatggagagactggg gagagcttccctgcacaggtgcctagtggggaggaccagacggccctgtccctggaagag tgccttaggctgctggaagccacctgcccctttggggagaatgctgagtttccagcagac atttccagcataacagaagcagtgcctagtgagagtgagccccctgctcttcaaaacaac ctcttgtctcctcttctgaccgggacagagtcaccatttgatttggaacagcagtggcaa gatctcatgtccatcatggaaatgcaggccatggaagtgaacacatcagcaagtgaaatc ctgtacagtgcccctcctggagacccactgagcaccaactacagccttgcccccaacact cccatcaatcagaatgtcagcctgcatcaggcgtccctggggggctgcagccaggacttc ttactcttcagccccgaggtggaaagcctgcctgtggccagtagctccacgctgctcccg ttggcccccagcaattctaccagcctcaactccaccttcggctccaccaacctgacaggg ctcttctttccaccccagctcaatggcacagccaatgacacagcaggcccagagctgcct gaccctttggggggtctgttagatgaagctatgttggatgagatcagccttatggacctg gccattgaagaaggctttaaccctgtgcaggcctcccagctggaggaggaatttgactct gactcaggcctttccttagactcgagccatagcccttcttccctaagcagctctgaaggc agttcttcctcttcttcctcctcctcttcctcttcttcctctgcttcttcctctgcctct tcctccttttctgaggaaggtgcggttggctacagctctgactctgagaccctggatctg gaagaggccgagggtgctgtgggctaccagcctgagtattccaagttctgccgcatgagc taccaggatccagctcagctctcatgcctgccctacctggagcacgtgggccacaaccac acatacaacatggcacccagtgccctggactcagccgacctgccaccacccagtgccctc aagaaaggcagcaaggagaagcaggctgacttcctggacaagcagatgagccgggatgag caccgagcccgagccatgaagatccctttcaccaatgacaaaatcatcaacctgcctgtg gaggagttcaatgaactgctgtccaaataccagttgagtgaagcccagctgagcctcatc cgagacatccggcgccggggcaagaacaagatggcggcgcagaactgccgcaagcgcaag ctggacaccatcctgaatctggagcgtgatgtggaggacctgcagcgtgacaaagcccgg ctgctgcgggagaaagtggagttcctgcgctccctgcgacagatgaagcagaaggtccag agcctgtaccaggaggtgtttgggcggctgcgagatgagaacggacgaccctactcgccc agtcagtatgcgctccagtacgccggggacggcagtgtcctcctcatcccccgcacgatg gccgaccagcaggcccggcggcaggagaggaagccaaaggaccggagaaagtga >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_4|364_aa MNKRHHTLTSCSWFSDILHQKPIGRKHLMRDRGYRIKALDKLPAQEQRVKYSEVSLNKTN NGKDDDKEITVIHSVELIHLLCLEKGPGSEWDSRFRIARLRWPRPRTGVGIPPPLTWDAG ECTQVPGEGRRMDIQSGAAFRPLSGEANQRPGGVSDGQPVGPGSRGVSVTLYTRKLAGTM GKKQNKKKVEEVLEEEEEEYVVEKVLDRRVVKGKVEYLLKWKGFSDEDNTWEPEENLDCP DLIAEFLQSQKTAHETDKSEGGKRKADSDSEDKGEESKPKKKKEESEKPRGFARGLEPER IIGATDSSGELMFLMKWKNSDEADLVPAKEANVKCPQVVISFYEERLTWHSYPSEDDDKK DDKN >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_4|1095_bp atgaacaagagacaccatactcttacctcctgttcttggttctccgacatcctacaccaa aagcccattggaaggaaacatctgatgagggacagaggatacaggatcaaggcattggat aaactgccagcacaggagcagagagttaaatattctgaagtgtctttaaacaaaactaac aacgggaaagacgatgacaaggaaatcacagtcatccactcagtggagctcatacatctt ttgtgccttgaaaaaggcccgggctctgagtgggattcccgcttccgaatagcgcggctc cgctggccaaggccccggactggagtcgggatcccccctccactcacttgggacgccgga gagtgcacgcaggtgccgggtgagggacggcgaatggatatccaatccggggccgcgttc cgcccactgtcaggggaagccaatcagcggccagggggcgtcagcgacgggcagccagtg ggtcccgggagcagaggggtcagcgtcaccctttacaccagaaagctggcgggcactatg gggaaaaaacaaaacaagaagaaagtggaggaggtgctagaagaggaggaagaggaatat gtggtggaaaaagttctcgaccgtcgagtggtaaagggcaaagtggagtacctcctaaag tggaagggattctcagatgaggacaacacatgggagccagaagagaacctggattgcccc gacctcattgctgagtttctgcagtcacagaaaacagcacatgagacagataaatcagag ggaggcaagcgcaaagctgattctgattctgaagataagggagaggagagcaaaccaaag aagaagaaagaagagtcagaaaagccacgaggctttgctcgaggtttggagccggagcgg attattggagctacagactccagtggagagctcatgttcctgatgaaatggaaaaactct gatgaggctgacctggtccctgccaaggaagccaatgtcaagtgcccacaggttgtcata tccttctatgaggaaaggctgacgtggcattcctacccctcggaggatgatgacaaaaaa gatgacaagaactaa >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_5|223_aa MWLHRPVPELPGKSTFFGTSDEFIEKRRQGLQHFLEKVLQSVVLLSDSQLHLFLQSQLSV PEIEACVQGRSTMTVSDAILRYAMSNCGWAQEERQSSSHLAKGDQPKRPYIHTTESKSEV ICSKFWYPKCGDVEEMNFNLQMLRASLEVEQFMTRPEGWWCAYATLVTHSTAMRACLMVW KREMSVACFLFHDGMWRIKGDQLLTIRLQVSTVEPDVLFEDQH >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_5|672_bp atgtggctgcacaggcctgttcctgaacttcctgggaagtcaaccttcttcggcacctca gatgagttcattgagaagcgacgacaaggtctgcagcacttccttgaaaaggtcctgcag agtgtggttctcctgtcagacagccagttgcacctattcctgcaaagccagctctcggtg cctgagatagaagcctgtgtccagggccgaagtaccatgactgtgtctgatgccattctt cgatatgctatgtcaaactgtggctgggcccaggaagagaggcagagctcttctcacctg gctaaaggagaccagcctaagaggccttacatccatactaccgagtccaagagtgaagta atctgctccaagttctggtaccctaaatgtggagatgttgaagaaatgaacttcaatctc cagatgctgcgtgcttctctggaggtggaacagttcatgacacgtccagagggctggtgg tgtgcatatgccaccttggtgactcacagcacagcaatgcgggcatgtctgatggtgtgg aaaagggaaatgtctgtagcctgcttcctgttccatgatgggatgtggaggataaaaggt gaccagctgctcaccatcaggctgcaagtgtctacagtggaaccagatgtcctctttgag gaccagcactag >gi568815581r:47971438_48177004|GENSCAN_predicted_peptide_6|84_aa XEEHDLEEDESGTRRKGVDYASYYQGLWDCHGDQPDELSFQRGDLIRILSKEYNMYGWWV GELNSLVGIVPKEYLTTAFEVEER >gi568815581r:47971438_48177004|GENSCAN_predicted_CDS_6|255_bp natgaagagcatgatctagaagaggatgagagtggcactcgacgaaaaggagtagactat gccagttactaccagggcctatgggattgccatggtgaccagccagatgaactgtccttc caacggggtgacctcatccgtattctgagcaaggagtataacatgtatggctggtgggtg ggagaactgaacagcctcgttgggattgttccaaaggagtatctcaccactgcctttgaa gtggaagaaagatga