GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:24:09 Sequence gi568815583f:59904964_60105938 : 200975 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2839 2920 82 0 1 64 86 63 0.725 4.88 1.02 Term + 6004 6251 248 2 2 63 47 154 0.787 3.77 1.03 PlyA + 6279 6284 6 1.05 2.00 Prom + 7544 7583 40 -4.05 2.01 Init + 18063 18160 98 2 2 59 98 128 0.908 8.73 2.02 Intr + 18264 18401 138 0 0 115 61 26 0.633 1.26 2.03 Intr + 19733 19844 112 2 1 59 -30 115 0.678 -3.74 2.04 Term + 22049 22204 156 1 0 65 50 159 0.889 6.75 2.05 PlyA + 22458 22463 6 1.05 3.00 Prom + 25850 25889 40 -6.05 3.01 Init + 26825 27013 189 1 0 63 48 146 0.010 7.16 3.02 Intr + 40067 40262 196 1 1 13 38 166 0.002 2.17 3.03 Intr + 40814 40897 84 0 0 59 58 107 0.163 3.67 3.04 Term + 59911 60119 209 2 2 55 33 170 0.605 4.72 3.05 PlyA + 61632 61637 6 1.05 4.00 Prom + 67864 67903 40 -4.25 4.01 Init + 69768 69862 95 2 2 64 69 134 0.933 9.00 4.02 Intr + 79621 79733 113 1 2 74 -5 96 0.440 -2.00 4.03 Intr + 85328 85503 176 0 2 26 103 198 0.568 13.84 4.04 Intr + 89216 89351 136 1 1 58 57 70 0.486 0.12 4.05 Intr + 90256 90532 277 2 1 -5 75 244 0.656 9.45 4.06 Intr + 91808 91892 85 2 1 117 17 49 0.451 -0.40 4.07 Intr + 96940 97301 362 0 2 48 -24 243 0.608 2.19 4.08 Intr + 97476 97644 169 0 1 110 97 6 0.847 2.73 4.09 Intr + 98036 98245 210 1 0 54 -23 355 0.125 19.19 4.10 Intr + 98715 98802 88 2 1 27 61 77 0.121 -2.48 4.11 Intr + 99262 99606 345 2 0 112 8 198 0.709 8.64 4.12 Term + 99789 100978 1190 1 2 21 46 1277 0.993 107.70 4.13 PlyA + 103533 103538 6 1.05 5.00 Prom + 115697 115736 40 -3.75 5.01 Init + 127326 127444 119 2 2 65 84 111 0.806 8.12 5.02 Term + 133230 133392 163 0 1 38 42 171 0.532 3.93 5.03 PlyA + 133578 133583 6 1.05 6.07 PlyA - 135854 135849 6 1.05 6.06 Term - 138798 138701 98 1 2 61 48 150 0.596 5.45 6.05 Intr - 149237 149082 156 0 0 124 98 95 0.696 13.16 6.04 Intr - 149695 149496 200 0 2 80 3 112 0.705 -0.03 6.03 Intr - 160546 160431 116 1 2 72 76 106 0.356 6.13 6.02 Intr - 161301 161215 87 0 0 86 68 55 0.123 2.55 6.01 Init - 168792 168622 171 0 0 81 37 129 0.096 6.59 6.00 Prom - 169291 169252 40 -12.03 7.00 Prom + 169447 169486 40 -3.55 7.01 Init + 181880 181964 85 1 1 65 71 100 0.895 7.03 7.02 Term + 183491 183741 251 0 2 60 47 207 0.742 8.68 7.03 PlyA + 185056 185061 6 1.05 8.03 PlyA - 186647 186642 6 1.05 8.02 Term - 196868 196428 441 2 0 84 38 130 0.244 1.87 8.01 Init - 200232 200185 48 0 0 58 94 56 0.543 4.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 98036 98278 243 1 0 54 35 372 0.841 23.42 S.002 Init - 114421 114343 79 1 1 71 28 125 0.829 6.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_1|109_aa MEKEDVESGSATCEERMLYTSVLLNHQVLEVLARAIRQEKEIQSIRLGKEEVKLSLFVDD MIVYLENPIVSAQNLLKLTSNFSKVSGYKINVQKSQEFLYTNNREPNHE >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_1|330_bp atggaaaaggaggatgtagaaagcggaagtgcaacttgtgaggaaagaatgttatatacg tctgtgctgctcaatcaccaagtgttggaagttctggccagggcaatcaggcaggagaaa gaaatacagagtattcgattaggaaaagaggaagtcaaattgtccctgtttgtagatgac atgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgacaagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagaattcctatac accaataacagagagccaaatcatgagtga >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_2|167_aa MRETWQLLTPALVILMFPSAARDGISWNEKEPRWFFKKSLQRPLLLEIRSKKGVAPKTIQ GSSNLCFQFNEGSPEIKETSTTKFGHTGAGTQPSEGAFFAGDNIQPKGKLQSLLLHGLDE AGNRHPQQTNTGTENQTPHVLSHKWELNNENTWTQEGGTSQPGPVGE >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_2|504_bp atgcgggagacttggcagctcctgaccccagcactggtcattctgatgttcccctctgcc gcccgagatggcatttcctggaatgaaaaggagccaaggtggttcttcaaaaagtctttg cagagaccattactgttggaaattagaagcaagaaaggagttgccccaaaaacaatacaa gggagttctaacctttgtttccaatttaatgaaggctcccctgagattaaagaaacgtca acaaccaagtttggccacacaggtgctggaacacagccaagtgaaggagccttctttgct ggagataacatacaacctaaagggaaacttcagtccttgttattacacggacttgatgaa gccggaaaccgtcatcctcagcaaactaacacaggaacagaaaaccaaacaccgcatgtt ctcagtcataagtgggagttgaacaatgagaacacatggacacaggaagggggaacatca caaccggggcctgttggggaatag >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_3|225_aa MRKATDKDPSVGAITASVVAEATCSGIATESGSDFIIRECNDSLVSISHPEASKSNCSTV CFNPGDWLARRLVPLCCGGPMLILTQRPGPTLLKVVAASQTANHHYFLERPSETLSIKEP ISNRRMLSGPREQVTERIRIYNAPGKLTTVLRFLLLAMWNYNKRDVTVFHLDQQLQVLNM ENSEKTEVVLFACSSFKPITDMHLRLFELAKSTSMEQEDTELSNS >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_3|678_bp atgagaaaagccactgataaggacccatcagtaggagccattactgcatcagttgttgcc gaggctacctgctcaggcattgctactgaaagtgggtcagatttcatcatcagggaatgt aatgacagcttggtctccatatcccatcctgaagcatcaaaatctaactgttcaacagtc tgcttcaatcctggagactggctggcgaggaggctggtccctctttgctgtggaggtcct atgctgatcctaacacaacgcccaggccccacactgctgaaagtggttgctgccagccag acggctaatcatcattatttcctggaaaggccctctgagaccttgagcataaaagagcct atttcaaaccggaggatgctctctggtccacgtgagcaggttaccgagcgcattaggatt tataacgctcctgggaagctgaccactgtattacgcttcctgctgctagctatgtggaac tacaacaagagagatgtcacagttttccatttagatcaacaacttcaagttcttaacatg gaaaattcagagaagactgaagtggttctctttgcttgcagttcctttaaacctatcacc gacatgcacctcagattgtttgagctggcaaagtctacatcaatggaacaggaagataca gagttgtcaaactcataa >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_4|1081_aa MERSDQRSLNCNALYEGQQVGVMVGDTIAGLSKKAVERNGNQTKRKVEEGNDEIRAEISE TENRIIIEIGSADGDIVLATSSEDSQTLEALISIFTSEQSPKQYIVPTRRRLNSLDRKPK SYPAGPSTANRSSPSGPHSWEPVTLSYLFICALKDDSDFQASRKRQKANRLLRGKVFAGV VDAAAAPTLPTQLATREKSLTLLFGGGRQFRGGRLDPENDGIQVPQPSGSAAVFKTAQLN SWAHSPFLGESSARAGEHSSPAYLGSSFILVRPAEHRRKCPANPGKFALLGMPRPPTIFY RSTYQYERMHIFTKRILKIPGGVYAGTLPQQERSCLAHSNCFVPRPAVLGSRKAVDQRLS ILGNTSLAHYSSSRISQAPQLFGLPSSFLPRTFDPNLAAQSNPPNAAPRYDLKRDYFFRI KKVTLFFSSSTSVNIEKRALASQPRQPRARAISILMKLFNLLNSSCLSYVLQFDDDDDDG SDDDEDNDNNNNDSFNPSKANPNRREPAARTRRSDPRHIPRSPRRKTLWFFPQVVEDTAV LLHTNYTKLKQLHRKVKSNFDLTLRGLGGKGDCHMIAKKWHINEAPLQGSFLLLSPLKTI RWFEGGHGGSHLAQRRRGAHSSALRSPNTSLPPSAPGLRSRARPPAGRSRSVADRAARDA AGLSSRAQVRAESNRTRTLRAECASTRKVAGERVRALADRAPQAPPPTPEVCYSVWLTPP VSLCIHATFPYYPPPSQIRAVRRPARTQSKKRARKKMPRPGRNTYSDQKPPYSYISLTAM AIQSSPEKMLPLSEIYKFIMDRFPYYRENTQRWQNSLRHNLSFNDCFIKIPRRPDQPGKG SFWALHPSCGDMFENGSFLRRRKRFKVLKSDHLAPSKPADAAQYLQQQAKLRLSALAASG THLPQMPAAAYNLGGVAQPSGFKHPFAIENIIAREYKMPGGLAFSAMQPVPAAYPLPNQL TTMGSSLGTGWPHVYGSAGMIDSATPISMASGDYSAYGVPLKPLCHAAGQTLPAIPVPIK PTPAAVPALPALPAPIPTLLSNSPPSLSPTSSQTATSQSSPATPSETLTSPASALHSVAV H >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_4|3246_bp atggaaaggagtgatcaacgttccctgaactgcaatgcattatacgagggtcagcaagtg ggagtcatggttggcgacacaattgctggcctcagcaaaaaagcagtagaaaggaatgga aatcaaaccaaaaggaaggtggaggaaggaaatgacgagatcagagcagaaatcagtgaa acagaaaacagaataataatagagattggatctgctgatggagacattgtcttagctact tcatcggaagactcgcagactctagaagccctgataagtatcttcacaagtgaacaatca ccaaagcaatatatagtgcccaccagaagacgactcaattccttggacagaaagcccaag agctacccagctggcccttcaacggcaaatcgctcctcgcccagtggtcctcactcctgg gagccagtgacactttcttatttatttatttgcgccctgaaggacgactcggatttccag gccagccggaagaggcagaaagcgaacaggctgctcagagggaaagtttttgcaggagtt gtggacgccgcagcagccccaactctcccaacccagcttgcgacccgagagaagtcgctc acacttctctttgggggagggcgacagttccggggaggacgcttggatcccgaaaatgac gggatccaggtcccacaaccctcaggaagcgctgcggtcttcaaaactgcgcagctaaat tcctgggcgcattctccctttcttggcgagtccagtgcccgtgccggggagcattccagc ccagcttacctcggcagctcctttattttggttaggccagctgagcacagaagaaaatgc cctgccaatcccggaaagtttgcgctactgggaatgcctcgaccgcccacgatattttat cggtctacgtatcagtatgaacggatgcacatatttactaagaggatcctaaaaattcct gggggagtctatgctggtacccttccacagcaggagcgctcctgccttgcccacagtaac tgttttgtaccgaggccagctgtactcgggagcaggaaagctgtggaccagcgcctttcc attctgggaaacacaagccttgcccactattcctcatcacggatctcccaagctccccag ctctttggactccccagctccttcctccccaggaccttcgatccgaatctggccgcacaa agtaaccccccaaacgccgcacctcgttatgacctgaagcgcgattattttttccgaatt aagaaggtgacattattcttttcttcaagcacttctgttaatattgaaaaacgtgcgctt gcatctcaaccgaggcagcccagagccagagccatttcaatattaatgaaattgtttaat ctcctaaattcatcgtgtttaagttacgttttgcaatttgatgatgatgatgatgacggc agcgacgacgatgaggataatgataataataataatgattcatttaacccttccaaggcg aatccgaaccgaagggagcctgctgctcgcaccaggcggagtgacccaaggcacattccc cgctccccaagaaggaaaactctatggttctttccccaagttgtagaagatactgctgtc ttactacacactaactatacaaagctgaagcagctgcataggaaagtcaagtcaaatttc gacctaactctccggggactggggggcaagggcgattgtcatatgatagctaagaagtgg cacattaatgaagcgccgctacaggggtcttttctgctcctgtcaccgcttaaaactatc agatggttcgagggaggacatggaggcagccacctagctcagcggagacgcggagcccac agcagcgccctccggagccctaacacgtcgctgccaccatccgcgccgggactccgcagc cgagctcggccgcccgcaggacgctccaggagcgtcgcggaccgggcggcacgggacgct gcggggctgagctcaagagcccaggttcgcgccgagtccaaccggacccggacgctgcgc gcggagtgcgcgtcgacccgcaaagttgctggcgaaagagtccgggcgctggctgatcga gcgccgcaggccccacccccgacccccgaagtctgttactcggtctggctgaccccgccg gtgtctctgtgcatccatgctacctttccctattacccacccccttcccagatccgagca gtccgccggcccgcgcggacccagagcaagaagagggcgaggaagaagatgcctcggccc ggccgcaacacgtacagcgaccagaagccgccctactcgtacatctcgctgaccgctatg gccatccagagctctcccgagaagatgctgccgctgagcgagatctacaagttcatcatg gaccgcttcccctactacagggagaacacgcagcgctggcagaacagtctgcgccacaac ctctccttcaacgactgcttcatcaagatcccgcggcggccggaccagccaggcaagggc agcttctgggcgctgcacccaagctgcggggacatgttcgagaacggcagcttcctgcgg cgccgcaagcgcttcaaggtgcttaagtccgaccacctggcgcccagcaagccagccgac gcggcgcagtacctgcagcagcaggccaagctgcggctcagcgcgctggcggcctcgggc acgcacctgccacagatgcccgccgccgcctacaacttgggcggcgtggcgcagccctcg ggcttcaagcaccccttcgccatcgagaacatcatcgcgcgggaatacaagatgcctggg gggctggccttctccgccatgcagccggtgcccgctgcctacccgctccccaaccagttg actaccatgggcagctcgctgggcaccggctggccacacgtgtatggctccgccggcatg atcgactcggccacccccatctccatggcgagtggcgactacagcgcctacggcgtgccg ttgaagccgctgtgccacgcggcgggccaaacgctgcccgccatccccgtgcccattaag cccacgccggccgccgtgcccgcgctgcctgcgctgccagcgcccatccccaccttgctc tcgaactcgccgccctcgctcagccccacgtcctcgcaaacagccaccagccaaagcagc cccgccacccccagcgaaacgctcaccagcccggcctccgccttgcactcggtggcggtg cactga >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_5|93_aa MEAEKSHDVPPAISKLETQEAWWSNSSQIKKVEEPGELIVGQPEDTLCVDSLEQKRHLGS PGNTKGSKRSALALSPGVTPGESAAPGGCLQRA >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_5|282_bp atggaggctgagaaatcccatgatgtgccacctgctatcagcaaactggagacccaggaa gcctggtggtctaattccagtcagataaagaaggtcgaagaaccaggggagctcatagta ggacagcctgaagacactctgtgtgtagacagcctggagcagaaacgccacctgggaagc cctgggaacacaaaaggatcaaagcggagcgcactggcgctgagtcctggagtcactcca ggggagtcggcagccccaggtggctgcctccagagggcctga >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_6|275_aa MTSVLMRRGRDTRDAHTQRKGHGRTQQKGGHLQAKERGLRRNQTCRPLDLGLPATRTNGL WKVVSHFLLTVAQMGLLEALGYQWQLAGSCQLVTGNQPLPVEPGSSKVKNTHPGEILPRR SRFLRIIPTHKHFAHQPLSQDLLLRNPDKTMTDSADLKKLNLMPAVEKSRKKLVAWKCQR AQKLMTLDTSGKCWQSALDPLDRRLEDSTLRNPTNQHAQLISPNMKSCSPGLNLNQDEGM VGQGVFDFVPQELTLADKQQIGFFRDTVKITKEKS >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_6|828_bp atgactagtgtgctcatgagaagaggaagagacaccagggatgcgcacacccagagaaaa ggccatgggaggacacagcaaaaaggtggccatctgcaagccaaggagagaggcctcagg agaaatcaaacctgccgaccccttgatcttggacttccagccaccagaactaatggtttg tggaaggtcgtgtctcatttcctgctcacagtggcgcagatggggcttttggaggctctg ggctatcagtggcagctggcaggatcctgtcagctggtgactggaaatcagccactacct gtggaacctggtagcagcaaagtgaaaaacacacatcctggggaaatcctaccaaggcgt tccagatttctcaggatcatacccacacataaacattttgcgcatcagcctttgtctcag gatctgcttttgaggaatccagacaagacaatgactgattcagcagacttgaaaaagttg aatcttatgccagcagtggaaaaatccaggaagaaacttgtagcatggaaatgtcaaagg gctcagaagttaatgacattggatacctctggaaaatgctggcagtcagctttagatcct ctagacaggagattagaagactctacactaagaaatccaactaaccagcatgcacaactg atttccccaaacatgaagtcctgcagcccaggtcttaacctgaatcaggatgaggggatg gtgggtcaaggagtcttcgatttcgtgcctcaggaactaactctggctgacaaacagcaa ataggatttttccgagacaccgttaagatcacaaaagagaagagctga >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_7|111_aa MTELLTHMKDGDWRTAPISMPDSRYETPGTHIVKGGVIVRLFSLTSSGPRENERDEGPVT GRGMNHTEELEDSKLTFKGRNIQTGPIHSEPVKNILFVMMDKSKIPQECSH >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_7|336_bp atgacagaattactgactcacatgaaggatggggactggcggacggcacccatcagcatg cctgatagccggtatgagactccagggactcacatcgtcaaaggaggagtcatcgttaga cttttctcactgacatccagtggcccaagggagaacgagagagatgaagggccagtcaca ggaaggggtatgaaccacactgaggagctagaagattcaaaattaactttcaagggtagg aacatacagacaggaccaattcactcagaacccgtaaaaaatatactctttgtgatgatg gataagtctaaaatccctcaagaatgtagccactga >gi568815583f:59904964_60105938|GENSCAN_predicted_peptide_8|162_aa MKDIQEFSEEALAQDQVTLLRSKSLVGILFNAVASGSHSEVYGSVLLVCPVLPSCWSSRK GQKVTRSLLLHLLLLLFLLNPLKWTFKCELCNHALAKEQPQRQLQLVQRDGLVGIPDSPD FCSQTLTNLATRWLMPVLTAWTRKCLSLPKVCRRSAAGNTEV >gi568815583f:59904964_60105938|GENSCAN_predicted_CDS_8|489_bp atgaaagacattcaagagtttagtgaggaggctcttgctcaagaccaggttactttgcta cggagcaagtccttggtaggcattctgttcaatgcggtagcctctggaagtcacagtgag gtttatggttcagtactgctggtttgccctgttcttcccagctgctggagcagccgcaag gggcaaaaggtgaccaggtcacttctactccacctgcttctgttgttgtttcttttaaat cctcttaagtggacttttaagtgtgaactttgcaaccatgccctggctaaagaacaacct cagaggcagctgcagcttgttcaaagagatgggctggtgggaatcccagactctcctgat ttctgttcacaaactctcactaatctggcaaccaggtggttaatgccagtgttgaccgca tggacacgcaagtgtctttctttgcccaaagtctgcaggaggtcagcagctggaaacact gaagtctga