GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:17:54 Sequence gi568815578f:38246517_38476666 : 230150 bp : 47.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1078 1269 192 0 0 60 90 179 0.569 14.27 1.02 Term + 10570 10716 147 0 0 137 50 49 0.910 3.90 1.03 PlyA + 11064 11069 6 -0.45 2.00 Prom + 12272 12311 40 -4.66 2.01 Init + 13983 14050 68 2 2 86 33 170 0.973 9.84 2.02 Intr + 14590 14717 128 1 2 65 57 179 0.751 12.82 2.03 Intr + 26040 26095 56 1 2 25 84 65 0.126 -1.50 2.04 Intr + 31669 31745 77 0 2 126 72 31 0.653 3.61 2.05 Term + 31976 32105 130 2 1 95 42 56 0.542 -0.65 2.06 PlyA + 32189 32194 6 1.05 3.06 PlyA - 33124 33119 6 1.05 3.05 Term - 42271 42161 111 1 0 77 43 120 0.693 4.96 3.04 Intr - 46067 45808 260 0 2 65 95 123 0.882 7.88 3.03 Intr - 47852 47765 88 2 1 60 93 62 0.907 3.44 3.02 Intr - 49124 48880 245 0 2 94 73 91 0.440 5.42 3.01 Init - 53338 53323 16 1 1 103 64 28 0.395 0.38 3.00 Prom - 54119 54080 40 -7.36 4.00 Prom + 56035 56074 40 -8.66 4.01 Init + 57708 57837 130 2 1 98 45 201 0.964 15.17 4.02 Intr + 61051 61165 115 2 1 131 94 108 0.997 15.21 4.03 Intr + 62414 62542 129 2 0 105 32 175 0.996 13.41 4.04 Intr + 63975 64136 162 0 0 105 39 174 0.995 13.29 4.05 Intr + 65358 65421 64 0 1 113 91 59 0.916 7.42 4.06 Term + 65741 65749 9 1 0 106 38 0 0.307 -5.01 4.07 PlyA + 66868 66873 6 -0.45 5.02 PlyA - 67079 67074 6 -3.24 5.01 Sngl - 68428 67541 888 1 0 56 37 1867 0.640 174.49 5.00 Prom - 70059 70020 40 -7.66 6.00 Prom + 71261 71300 40 -6.96 6.01 Init + 71472 71489 18 2 0 44 110 5 0.725 -1.44 6.02 Intr + 71897 71960 64 1 1 117 94 84 0.829 10.19 6.03 Intr + 73667 73758 92 0 2 89 100 89 0.890 9.91 6.04 Intr + 77354 77530 177 1 0 93 91 127 0.992 13.62 6.05 Intr + 78222 78317 96 2 0 94 64 28 0.527 1.21 6.06 Intr + 79824 79916 93 2 0 61 103 107 0.478 9.66 6.07 Intr + 81072 81139 68 2 2 69 97 106 0.955 7.30 6.08 Intr + 84532 84574 43 1 1 111 117 23 0.619 5.74 6.09 Intr + 87914 87977 64 1 1 93 103 27 0.276 2.99 6.10 Intr + 89082 89158 77 1 2 83 109 -6 0.108 0.23 6.11 Term + 90630 90668 39 2 0 108 49 41 0.286 -0.61 6.12 PlyA + 93877 93882 6 1.05 7.00 Prom + 94578 94617 40 -5.16 7.01 Init + 100001 100124 124 1 1 60 95 157 0.812 11.95 7.02 Intr + 103032 103146 115 1 1 86 75 95 0.999 7.51 7.03 Intr + 104295 104423 129 0 0 122 89 95 0.964 12.81 7.04 Intr + 107768 107923 156 2 0 74 61 206 0.946 15.63 7.05 Intr + 108830 108893 64 2 1 86 90 98 0.969 8.52 7.06 Intr + 114188 114251 64 1 1 123 91 33 0.968 5.39 7.07 Intr + 117459 117550 92 1 2 111 100 133 0.947 16.51 7.08 Intr + 118060 118236 177 0 0 79 77 164 0.934 14.52 7.09 Intr + 120199 120312 114 0 0 77 90 73 0.663 7.04 7.10 Intr + 122479 122646 168 0 0 82 79 130 0.992 11.64 7.11 Intr + 124222 124289 68 0 2 99 106 56 0.999 6.20 7.12 Intr + 124764 124806 43 0 1 95 101 39 0.971 4.14 7.13 Intr + 126556 126619 64 0 1 132 95 94 0.990 12.79 7.14 Intr + 127421 127497 77 0 2 62 71 93 0.833 4.23 7.15 Intr + 130561 130683 123 0 0 103 7 73 0.107 1.48 7.16 Intr + 134881 134954 74 0 2 13 110 68 0.027 -0.30 7.17 Intr + 142363 142410 48 1 0 101 75 42 0.147 2.00 7.18 Term + 146151 146241 91 0 1 125 42 65 0.654 2.79 7.19 PlyA + 146306 146311 6 1.05 8.10 PlyA - 148965 148960 6 1.05 8.09 Term - 156222 156089 134 1 2 113 54 164 0.988 13.75 8.08 Intr - 174874 174836 39 2 0 99 75 45 0.252 2.50 8.07 Intr - 175852 175775 78 2 0 126 81 22 0.756 4.82 8.06 Intr - 184596 184525 72 1 0 84 111 16 0.790 2.88 8.05 Intr - 186008 185974 35 1 2 73 65 50 0.742 -0.93 8.04 Intr - 188110 187988 123 0 0 98 106 63 0.762 8.80 8.03 Intr - 188810 188681 130 0 1 28 100 63 0.781 1.25 8.02 Intr - 189198 189016 183 1 0 36 2 144 0.367 0.36 8.01 Init - 194318 194237 82 2 1 64 61 101 0.605 6.14 8.00 Prom - 217724 217685 40 -4.16 9.00 Prom + 217929 217968 40 -5.46 9.01 Init + 222453 222462 10 2 1 67 78 0 0.249 -2.52 9.02 Term + 226283 226485 203 0 2 16 54 255 0.800 12.45 9.03 PlyA + 229017 229022 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_1|112_aa MTSNDALILISGTCERAVTRQRGVKATNGIKFANQLTLKQGDHPGLDKWAQCNHQVEEDA EEGRDTCASRQKDGPPPPIRHSLELSVDEPAFSMDCQPPKQEPRPYRGPGTQ >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_1|339_bp atgacctccaatgacgccctcatcctaatctctggaacctgtgaacgtgctgttacaaga caaaggggagttaaggctaccaatggaattaagtttgctaatcagctgactttaaaacaa ggagaccatcctggattagacaaatgggcccagtgtaatcaccaggtggaagaggatgca gaagagggaagggacacatgtgcttccaggcagaaggatggacccccaccacccatcaga cacagcctagaattatctgtggatgaacctgccttctccatggactgccagcccccaaag caggaaccacgtccttatagagggcctggcacacagtag >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_2|152_aa MVTGCQRRAARLLSWARGLWVRGESQTLLSFDFFICKMGWEEDDDDDDDDDDDDDSACCL DLVMRSGEELRLLLANGRHQLASHGRPLSLYGPTSELWGYPRTPQASSLRESSKTVLQEP TRAAFTWFLIKQSLGSAEWEWVFMDLAVETSE >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_2|459_bp atggtgacgggctgccagcggcgggcggcgcggctcctctcctgggcgcggggtctgtgg gtccgcggggagtcacagactctcttaagcttcgatttcttcatctgtaaaatggggtgg gaggaggatgatgatgatgatgatgatgatgatgatgatgatgatagtgcctgctgctta gatttggtgatgaggagtggcgaggaactgaggcttcttcttgccaatggccggcaccaa cttgccagccatggacggcccctttccctctatggccccacttcagagctctggggctac ccaaggacaccccaggcctcttccttacgggagtcctccaagactgtcctccaggagcct acacgtgctgctttcacctggtttcttataaagcagagtctcgggtctgcggagtgggaa tgggtcttcatggatctagctgttgaaacatcagaatga >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_3|239_aa MVGGRGAQAPSWSPTPAKEISLKPTPAPSPLCFTSPRGKAHFLNRLTGPSTSHPIGSRTA TFKYNHPGAYDHADFQKQAPRPDPEYRKPGPRSVTIKRPLSQRKMTKLNGCLVENPAHYW ILTWAFHGLSKIFTHRMPQRALPATKCLLCTLPCAASSPRSEQSLLVPRSCCQSNWVQDN HCTTKPLLMNSYCFPGQGWCFPQGCPEEALHLDISHRHGDTQRSIVHLERNQIAINNRT >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_3|720_bp atggtgggaggccgaggtgcccaggctccaagctggtcacctacacctgccaaagagatc tctctgaagccgacaccagcaccatcccccctctgcttcacatcaccaaggggcaaggcc cacttccttaacaggctcacagggccctccacatcccaccccattggttctcgcactgcc actttcaagtacaaccacccaggagcttatgaccacgcagatttccaaaagcaagcccca cgccctgatcctgagtacaggaaacctgggcctaggtctgtcaccatcaaacggccactg tcccagaggaagatgactaagctaaatggatgcctggtggaaaacccagcacattactgg attctgacatgggccttccatgggctcagcaagatattcacccataggatgccccagagg gccctgccagccaccaagtgcctactgtgcaccttgccctgtgctgcaagctccccaagg agtgaacagtcgctcctcgtgccccgaagctgctgccaatccaactgggtacaagacaac cactgtacaacaaagccactcttgatgaactcctactgttttccaggccagggctggtgc tttccacagggatgcccggaagaagccttgcatctggatatcagccacagacacggagat actcaaagaagcattgtgcacctggagagaaaccagatcgccattaacaatcgaacatag >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_4|202_aa MARGPCNAPRWASLMVLVAIGTAVTAAVNPGVVVRISQKGLDYASQQGTAALQKELKRIK IPDYSDSFKIKHLGKGHYSFYSMDIREFQLPSSQISMVPNVGLKFSISNANIKISGKWKA QKRFLKMSGNFDLSIEGMSISADLKLGSNPTSGKPTITCSSCSSHINSVHVHISKSKVGW LIQLFHKKIESALRNKMNSQPC >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_4|609_bp atggccaggggcccttgcaacgcgccgagatgggcgtccctgatggtgctggtcgccata ggcaccgccgtgacagcggccgtcaaccctggcgtcgtggtcaggatctcccagaagggc ctggactacgccagccagcaggggacggccgctctgcagaaggagctgaagaggatcaag attcctgactactcagacagctttaagatcaagcatcttgggaaggggcattatagcttc tacagcatggacatccgtgaattccagcttcccagttcccagataagcatggtgcccaat gtgggccttaagttctccatcagcaacgccaatatcaagatcagcgggaaatggaaggca caaaagagattcttaaaaatgagcggcaattttgacctgagcatagaaggcatgtccatt tcggctgatctgaagctgggcagtaaccccacgtcaggcaagcccaccatcacctgctcc agctgcagcagccacatcaacagtgtccacgtgcacatctcaaagagcaaagtggggtgg ctgatccaactcttccacaaaaaaattgagtctgcgcttcgaaacaagatgaacagccag ccctgctag >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_5|295_aa MPIKSVYTYHHHHHHHHHQSHHHHHHHHHHHLHHYHHHYHHPHHHHHQSHHTTIIIIPTI TITTIIIPTITIIIIIPTITITTIIIPTITITIIIIIPTITITTIIIPTITITIIIIIPT IIITTIIIPTITITIIIIIPTITVTIIIPTITITIIIIIPTITIIIIIIPTITIIIIIPT ITIINLTITTIIIIIPTITITTIIIPTITITITIIIIIIPTITIINLTITTIIIIFPTIT ITTIIIPTITITIIIIIIPTITILNLTTTIIIIIFPTITITTIIIPSPSSSSTSP >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_5|888_bp atgcctattaaatcggtgtatacttaccaccatcaccatcaccatcaccatcatcagtct caccatcaccatcaccatcatcatcatcatcatctccaccattaccatcaccattatcat catccccaccatcaccatcatcaatctcaccataccaccatcatcatcatccccaccatt accatcaccaccatcatcatccccaccatcaccatcattatcatcatccccaccatcacc atcaccaccatcatcatccccaccatcaccatcaccatcattatcatcatccccaccatc accatcaccaccatcatcatccccaccatcaccatcaccattattatcatcatccccacc atcatcatcaccaccatcatcatccccaccatcaccatcaccatcattatcatcatccct accatcaccgtcaccatcatcatccccaccatcaccatcaccatcattatcatcatcccc accatcaccatcatcattataatcatccccaccatcaccatcatcatcatcatccccacc atcaccatcatcaatctcaccatcaccaccatcatcatcatcatccccaccattaccatc accaccatcatcatccccactatcaccatcaccatcaccatcatcattatcatcatcccc accatcaccatcatcaatctcaccatcaccaccatcatcatcatcttccccaccattacc atcaccaccatcatcatccccaccatcaccatcaccatcattattatcatcatccccacc atcaccatcctcaatctcaccaccaccatcatcatcatcatcttccccaccattaccatc accaccatcatcatcccatcaccatcatcatcatcaacctcaccataa >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_6|276_aa MLPEAQVCEKVTNSVSSELQPYFQTLPVMTKIDSVAGINYGLVAPPATTAETLDVQMKGE FYSENHHNPPPFAPPVMEFPAAHDRMVYLGLSDYFFNTAGLVYQEAGVLKMTLRDDMHPP RSFSHLLLQIPKESKFRLTTKFFGTFLPEPTGLTFYPAVDVQAFAVLPNSSLASLFLIGM HTTGSMEVSAESNRLVGELKLDRLLLELKHSNIGPFPVELLQDIMNYIVPILVLPRVNEK LQKGFPLPTPARVQLYNVVLQPHQNFLLFGADVVYK >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_6|831_bp atgctcccagaagcccaggtctgcgagaaagtgaccaattctgtatcctccgagctgcaa ccttatttccagactctgccagtaatgaccaaaatagattctgtggctggaatcaactat ggtctggtggcacctccagcaaccacggctgagaccctggatgtacagatgaagggggag ttttacagtgagaaccaccacaatccacctccctttgctccaccagtgatggagtttccc gctgcccatgaccgcatggtatacctgggcctctcagactacttcttcaacacagccggg cttgtataccaagaggctggggtcttgaagatgacccttagagatgacatgcatcctccg agatccttttctcatctcttgctacagattccaaaggagtccaaatttcgactgacaacc aagttctttggaaccttcctacctgagcccaccggccttaccttctaccctgccgtggat gtccaggcctttgccgtcctccccaactcctccctggcttccctcttcctgattggcatg cacacaactggttccatggaggtcagcgccgagtccaacaggcttgttggagagctcaag ctggataggctgctcctggaactgaagcactcaaatattggccccttcccggttgaattg ctgcaggatatcatgaactacattgtacccattcttgtgctgcccagggttaacgagaaa ctacagaaaggcttccctctcccgacgccggccagagtccagctctacaacgtagtgctt cagcctcaccagaacttcctgctgttcggtgcagacgttgtctataaatga >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_7|596_aa MGALARALPSILLALLLTSTPEALGANPGLVARITDKGLQYAAQEGLLALQSELLRITLP DFTGDLRIPHVGRGRYEFHSLNIHSCELLHSALRPVPGQGLSLSISDSSIRVQGRWKVRK SFFKLQGSFDVSVKGISISVNLLLGSESSGRPTVTASSCSSDIADVEVDMSGDLGWLLNL FHNQIESKFQKVLESRICEMIQKSVSSDLQPYLQTLPVTTEIDSFADIDYSLVEAPRATA QMLEVMFKGEIFHRNHRSPVTLLAAVMSLPEEHNKMVYFAISDYVFNTASLVYHEEGYLN FSITDDMTFSSSGSTLKLLHVSLLQIPPDSNIRLTTKSFRPFVPRLARLYPNMNLELQGS VPSAPLLNFSPGNLSVDPYMEIDAFVLLPSSSKEPVFRLSVATNVSATLTFNTSKITGFL KPGKVKVELKESKVGLFNAELLEALLNYYILNTFYPKFNDKLAEGFPLPLLKRVQLYDLG LQIHKGSLDSQPHLWSPRPLSHRRCYGSLNERTRDSVHHVALDLALFCLNGLSSSNSLEI GIHDISNEALRLTGFRELHPVGVPGESQLDVDILTPKVTWDKCVEDGRASSSSLGP >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_7|1791_bp atgggggccttggccagagccctgccgtccatactgctggcattgctgcttacgtccacc ccagaggctctgggtgccaaccccggcttggtcgccaggatcaccgacaagggactgcag tatgcggcccaggaggggctattagctctgcagagtgagctgctcaggatcacgctgcct gacttcaccggggacttgaggatcccccacgtcggccgtgggcgctatgagttccacagc ctgaacatccacagctgtgagctgcttcactctgcgctgaggcctgtccctggccagggc ctgagtctcagcatctccgactcctccatccgggtccagggcaggtggaaggtgcgcaag tcattcttcaaactacagggctcctttgatgtcagtgtcaagggcatcagcatttcggtc aacctcctgttgggcagcgagtcctccgggaggcccacagttactgcctccagctgcagc agtgacatcgctgacgtggaggtggacatgtcgggagacttggggtggctgttgaacctc ttccacaaccagattgagtccaagttccagaaagtactggagagcaggatttgcgaaatg atccagaaatcagtgtcctccgatctacagccttatctccaaactctgccagttacaaca gagattgacagtttcgccgacattgattatagcttagtggaagcccctcgggcaacagcc cagatgctggaggtgatgtttaagggtgaaatctttcatcgtaaccaccgttctccagtt accctccttgctgcagtcatgagccttcctgaggaacacaacaaaatggtctactttgcc atctcggattatgtcttcaacacggccagcctggtttatcatgaggaaggatatctgaac ttctccatcacagatgacatgaccttcagctccagcgggagcaccctaaaacttcttcat gtctctttgctgcagataccgcctgactctaatatccgactgaccaccaagtccttccga cccttcgtcccacggttagccaggctctaccccaacatgaacctggaactccagggatca gtgccctctgctccgctcctgaacttcagccctgggaatctgtctgtggacccctatatg gagatagatgcctttgtgctcctgcccagctccagcaaggagcctgtcttccggctcagt gtggccactaatgtgtccgccaccttgaccttcaataccagcaagatcactgggttcctg aagccaggaaaggtaaaagtggaactgaaagaatccaaagttggactattcaatgcagag ctgttggaagcgctcctcaactattacatccttaacaccttctaccccaagttcaatgat aagttggccgaaggcttcccccttcctctgctgaagcgtgttcagctctacgaccttggg ctgcagatccataaggggtctttggatagtcagccacatctatggtctccaaggcccctt tctcatcgccgctgctatgggagcctcaatgagagaaccagggattctgtacatcatgta gccctggatctggcgctgttctgtctgaatggcctgagcagcagcaacagcctagaaatt ggaatacatgatatttctaatgaggctctcagactaactggatttagggagctgcaccca gttggtgtgccaggggaaagccagctggatgtcgatatattgacacccaaggtgacttgg gataaatgtgttgaagatggcagagcctccagcagcagtctaggtccctga >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_8|291_aa MDRYNKICSTVVTLGVEMGLELLQALHGTEYSGESKTDAFGPLQKEVQTKHSPLIVVTAP IMENWPFLQINNESVTKEHLQNKQKSKAILRRPSPSFAIARHRTPSLEICRHLDFSHAVC QVSAATRRQGAGPCGLCCTSDGFAPASALSLLQHSDLHPLRGFHCPRGENAPGIVIVMSA TKARGVSHDCRPMCVRGVLWESMDPEVPDGSAVCRGSMGCVVFIVAGRWTMGPQGTISNV EQMEKVAGSSSSNNNNNKIAEQIMEEHAIYNECGRSEQIGKRTFADSEIDT >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_8|876_bp atggacagatacaacaaaatctgcagcacggtggtcactcttggtgtggagatgggtctg gagctcctgcaggccttgcatggtactgagtacagtggtgaatcaaagacagatgccttt ggacccttgcagaaagaagtccaaactaagcacagtcccctcatcgtagtaacagctcct atcatggaaaattggccattcctacagattaataatgaatcagtgaccaaggaacatctt cagaacaaacagaaaagcaaagcaatccttcgccgtccctcgccgtccttcgccatcgca cgccaccgcaccccatctctcgaaatctgcagacatcttgatttttcccacgctgtctgt caggtctccgccgccactcgacgccagggcgccgggccttgtgggctgtgctgcacctcg gacggcttcgcaccagccagcgccctctctctcctgcagcactctgatctgcaccccctg aggggcttccactgtccgcggggtgagaatgcccctgggattgtcattgtgatgtcagcg accaaagccagaggagtgtcacatgactgccgccccatgtgtgtgagaggcgtcctctgg gagagcatggatcctgaggtcccagatgggtctgctgtgtgcagaggctccatgggatgt gttgtcttcatagtagcagggcgctggaccatgggtccgcaaggcaccatcagtaatgtc gaacaaatggagaaagttgcaggtagcagcagcagcaacaacaacaacaacaaaatagca gagcaaattatggaagagcatgccatctataatgaatgtggcagatctgagcaaattggc aagcgtacctttgcagactcagagatagatacttga >gi568815578f:38246517_38476666|GENSCAN_predicted_peptide_9|70_aa MGRVAHLAGELFDAAAPGSRKLPEQIPAGWLEWPSSSLGALGESLTGGLTDRLRTAGQGG ESASPMARVA >gi568815578f:38246517_38476666|GENSCAN_predicted_CDS_9|213_bp atgggtagggtggcccacttggctggggagttatttgacgccgccgcccctggcagtcgg aagttgcctgagcagatcccagccggctggctcgagtggccttcgtcgtcccttggcgcc ctgggagagtcgctgacgggtggactgacggaccgcctgaggacggccggccagggcggt gaaagcgccagccctatggcgcgggtcgcgtga