GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:59:55 Sequence gi568815582f:55225092_55429311 : 204220 bp : 42.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10764 10782 19 2 1 73 94 30 0.268 2.51 1.02 Term + 17563 17786 224 2 2 39 53 188 0.398 6.50 1.03 PlyA + 19159 19164 6 1.05 2.00 Prom + 22885 22924 40 -6.05 2.01 Init + 25878 25985 108 2 0 100 89 43 0.461 5.87 2.02 Intr + 38250 38394 145 2 1 53 41 158 0.786 6.53 2.03 Term + 42942 43066 125 1 2 70 39 88 0.404 -0.33 2.04 PlyA + 44674 44679 6 1.05 3.04 PlyA - 46312 46307 6 1.05 3.03 Term - 46576 46500 77 2 2 116 38 70 0.770 1.82 3.02 Intr - 49038 48787 252 1 0 28 53 162 0.323 3.18 3.01 Init - 57162 56949 214 0 1 61 35 220 0.529 12.95 3.00 Prom - 66743 66704 40 -5.95 4.00 Prom + 66867 66906 40 -6.15 4.01 Init + 74103 74277 175 2 1 49 92 83 0.320 4.36 4.02 Intr + 74741 74830 90 0 0 50 57 80 0.234 0.15 4.03 Intr + 75665 75837 173 0 2 11 95 134 0.410 5.14 4.04 Intr + 90348 90426 79 2 1 111 89 24 0.023 3.01 4.05 Intr + 98597 98870 274 0 1 40 48 338 0.138 20.67 4.06 Intr + 99188 99323 136 2 1 92 83 -27 0.561 -3.15 4.07 Intr + 101245 101502 258 0 0 104 105 217 0.996 21.74 4.08 Intr + 102205 102314 110 0 2 131 103 119 0.999 15.86 4.09 Intr + 102495 102802 308 0 2 93 79 297 0.874 24.47 4.10 Intr + 103609 104320 712 2 1 92 18 757 0.180 58.67 4.11 Intr + 104590 104703 114 1 0 84 86 110 0.424 9.04 4.12 Intr + 104848 104914 67 1 1 20 4 56 0.214 -11.61 4.13 Intr + 105210 105386 177 2 0 91 40 231 0.672 17.79 4.14 Intr + 105690 105971 282 2 0 26 81 173 0.951 7.09 4.15 Intr + 106286 106737 452 1 2 5 66 276 0.889 8.27 4.16 Term + 106921 107017 97 1 1 72 42 132 0.914 3.46 4.17 PlyA + 107025 107030 6 1.05 5.00 Prom + 110108 110147 40 -5.75 5.01 Init + 116421 116483 63 2 0 82 116 -7 0.131 2.70 5.02 Intr + 122261 122371 111 1 0 60 62 94 0.001 3.66 5.03 Intr + 132757 132840 84 0 0 105 45 89 0.073 5.40 5.04 Intr + 135240 135365 126 2 0 75 33 88 0.013 1.96 5.05 Intr + 145183 145233 51 0 0 74 61 84 0.005 2.49 5.06 Intr + 149702 149941 240 1 0 65 11 162 0.043 3.02 5.07 Intr + 154780 155025 246 0 0 106 -6 164 0.587 5.53 5.08 Intr + 155047 155201 155 0 2 90 40 162 0.078 9.55 5.09 Intr + 159916 160045 130 1 1 61 100 12 0.031 -0.52 5.10 Term + 170262 170327 66 2 0 122 38 124 0.886 7.76 5.11 PlyA + 172128 172133 6 1.05 6.06 PlyA - 172227 172222 6 1.05 6.05 Term - 176796 176716 81 0 0 88 42 104 0.224 2.51 6.04 Intr - 186469 186260 210 1 0 109 76 29 0.074 1.99 6.03 Intr - 187251 187014 238 2 1 82 31 113 0.056 1.69 6.02 Intr - 193777 193692 86 1 2 46 116 56 0.443 1.90 6.01 Init - 194948 194619 330 2 0 53 38 176 0.252 6.46 6.00 Prom - 195095 195056 40 -4.55 7.03 PlyA - 197344 197339 6 1.05 7.02 Term - 201879 201723 157 2 1 113 42 149 0.896 9.22 7.01 Init - 202462 202410 53 1 2 62 78 38 0.527 0.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 39145 39273 129 2 0 106 61 84 0.812 7.25 S.002 Term + 77917 78054 138 0 0 87 48 115 0.915 4.38 S.003 Init - 122643 122542 102 0 0 56 81 132 0.803 9.59 S.004 Term + 132757 132903 147 0 0 105 49 114 0.836 6.12 S.005 Init + 151134 151286 153 2 0 83 68 85 0.909 6.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_1|80_aa MVKYKEARQGPGIQKALCSWDKAKGLIELINTSHLQMAKLKEHTVTHAHWGFRSCKHSTL DIAMGSEPDDLPVCMLPLGV >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_1|243_bp atggtgaagtacaaggaagcaaggcaaggacctgggatacagaaagccctctgttcttgg gataaggcaaagggtctaattgagctgattaacacaagccacttgcagatggcaaaactg aaagagcacactgtaacacatgcccactggggcttcaggagctgtaaacactcaacccta gacattgccatgggatcggagcccgatgacctgcctgtctgcatgctccccctaggggtt tga >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_2|125_aa MPCLNILNQGYDFSCLTTHGTKFLVVEVDDTKQLVRDGTPGLEDERAESGFWKPKRSCWR ARLRTHGQLVSKRQGPLPLLVVPGCPLEGYVPQVPHSGICTAGASGALQTVWQGKEAKTK LHIGS >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_2|378_bp atgccctgcttgaacatcttgaaccagggatatgattttagttgcctaactacacatgga actaaattcctagtggttgaagttgatgatacaaagcagctggttagagatgggacccca gggctggaagatgaaagagctgagtccggcttctggaagcccaaaagatcctgctggcgt gcccggctgaggactcatgggcagctggtatccaagcgacaagggccactaccacttctg gtggtccctgggtgtccgctggagggttatgtccctcaagttcctcattcagggatttgc acagctggagcttctggagcattacaaactgtgtggcaggggaaagaagcaaagactaaa ttgcacattggctcttaa >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_3|180_aa MEVLDKVKNNKWTLSGGPTLIDENSYLGHCVEKDSQVGPRTRREEYHSQLIAQAQEGREG TCTPIAAIAMPAAKLPREQICIEPQRGLQLQARVSGSDGELGRNPIQKLISLNCCITEIK RQICKFQVTTTHKKDKGFPWKEANAEMKRGKRIIKALDDHDLLTISMVLPSLECRIVVII >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_3|543_bp atggaggtgttagataaggtcaagaataacaagtggactttgtctggagggccaacactg atagatgaaaacagctatttagggcactgtgttgagaaggattctcaggttgggccacgg actcgcagggaagagtaccacagccaattaatagcacaggcccaggaagggagagaagga acctgcacaccaatagctgccatagcaatgccagcagcgaagttgccacgtgaacagatc tgtattgaaccccagagagggcttcagctgcaagctcgagtgtcagggtcagatggagag ctgggaaggaatcctatccagaagctaattagtttaaactgctgcatcacagaaattaaa aggcaaatttgcaaatttcaagtaaccactacccacaaaaaagacaaaggtttcccctgg aaagaggccaatgctgaaatgaagagaggcaaaagaataataaaggccctcgatgaccat gatcttcttaccatctccatggttttgccttctctggaatgtcgcatagttgtaatcata tag >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_4|1167_aa MPSAWRKDSPGVRSSVCLFYGGWGRWLIPCWLLSSLTLLQPLVTSLLPHTNTFVIERLVS GADKLRSGRRDSVRTGMSLLEWVEELMGGWLCYLMMRKSPPPSVKLIARKELSPLLRDIA LTLPMRGHLSFRLTVYIPFRKSRVVANVSSVRMGTLFCSQPYPSTLKPYQVQTPPPAERR GCSPDREPSSGSLAAPRPLREARARGSQALPSEEMPRDLSKQPAAARDRTWRVSRRNLRD PEPRPRGGAIRPTRGPGSCREAPRLLSPLSPPPSPSCLSPPLFSCLFFCILSSLAAPVAS RLLRAAGEEFLASASSSTTCCESTQRSVSDVASGSTPAPALCCAPYDSRLLGSARPELGA ALGIYGAPYAAAAAAQSYPGYLPYSPEPPSLYGALNPQYEFKEAAGSFTSSLAQPGAYYP YERTLGQYQYERYGAVELSGAGRRKNATRETTSTLKAWLNEHRKNPYPTKGEKIMLAIIT KMTLTQVSTWFANARRRLKKENKMTWAPKNKGGEERKAEGGEEDSLGCLTADTKEVTASQ EARGLRLSDLEDLEEEEEEEEEAEDEEVVATAGDRLTEFRKGAQSLPGPCAAAREGRLER RECGLAAPRFSFNDPSGSEEADFLSAETGSPRLTMHYPCLEKPRIWSLAHTATASAVEGA PPARPRPRSPECRMIPGQPPASARRLSVPRDSACDESSCIPKAFGNPKFALQGLPLNCAP CPRRSEPVVQCQYPSGAEGSGPPAALGVSMQKTPTYRPARQLHTLCHSSLPRGDSLVPAV LLREFTRVGQAPGLLEKSKTVGQTHTTDLRQASDCQSVSPDAGWFFHCVKDLVSAMAAIC ERILEMGPTFRIHLQVKKLPDLARDQELSLCLRDRHTETLLAAVLARRAGVRCLRGRCAG KDALRPGILATARRGLHTSVPLVAGKETDKVTDKAVAPAAQRTNRCVTNPWSQADHLAGP AHGKWWDLSGCLSGNCICFRVRIWVPGSERTAARREAGTRGSALEGSSEEQMRREGRTKG SWGVRPRPGFLGSCVSNAGSRPGQAFQASAEARLSVTEIQASIVRLLGRMIENLWRTMRG NVEVGSAREAEAARHLLAPLYGKDSFAVGRDFRDHDLQKLQTSSSDEKVKPKREGRDPPV ALADPRLRVRKIKLRKDKHYFQSALRD >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_4|3504_bp atgccatctgcatggaggaaagattcgccaggtgtaagaagctctgtgtgcctattttat ggagggtggggccggtggttaataccctgctggctcctgtcttccctgactcttctgcag cctctggtgacttccctgctccctcataccaacacattcgtgattgagagattggtgtct ggtgctgataaacttaggtcagggagacgggactctgtaagaactgggatgtccctgctg gagtgggtggaagagctgatgggaggatggctctgttacctgatgatgagaaaatctccc cctccaagtgtgaaactcatcgcaaggaaagaactttctcctcttttgcgggacattgcc ctgacactgcctatgaggggacatctttctttccgccttactgtttatattcccttcaga aaatccagagttgttgcgaatgtcagctctgtgcggatggggactttgttttgttctcag ccatatcccagcaccttaaaaccctaccaggtacagacgccacccccagcggagcgcagg ggctgcagcccggaccgggagccttcttcaggctccctggctgccccgcggccactgcgg gaggcgcgcgcgcgaggcagccaagccttgccttcggaggagatgcccagggatcttagc aagcagcctgcggctgccagggatcggacctggcgagtttcccgacggaatctgagggat cccgaacctcggcctcgagggggcgctatccggcctactcgaggaccaggcagctgcaga gaagctccaagactcctgtctcccctctcccctcctccttctccctcttgcctttctccc ccacttttctcctgtctcttcttttgtattctctcctccctcgccgccccggttgcctct cgcctcctccgggccgcaggggaggagtttctggcgtcggcaagttccagcaccacatgc tgcgaatctacccaacgctctgtctcagatgtggcatcaggctccaccccagcgcccgct ctctgctgcgcaccctacgatagtcgactgctgggcagtgcgcgaccggagctgggcgcc gccttgggcatctatggagcaccctatgcggccgctgcagctgcccagagctaccctggc tacctgccctatagcccagagcccccctcactgtatggggcactgaatccacagtatgaa tttaaggaggctgcagggagttttacatccagcctggcacaaccaggagcctattatccc tatgagcggactctggggcagtaccaatatgaacggtatggcgcagtggaattgagtggc gccggtcgccgaaagaacgcgacccgggagaccaccagtacactcaaggcctggctcaac gagcaccgcaaaaacccctaccccactaagggtgagaagatcatgctggccatcatcacc aagatgaccctcacccaggtgtccacctggttcgccaacgcacgccggcgcctcaagaaa gagaacaaaatgacatgggcgcccaagaacaaaggtggggaggagaggaaggcagaggga ggagaggaggactcactaggctgcctaactgctgacaccaaagaagttactgctagccag gaggcccgggggctccggctgagtgacctggaagacctggaggaagaggaggaggaggag gaggaagctgaagacgaggaggtagtggccacagctggggacaggctgacggagttccga aagggcgcgcagtcactgcctgggccgtgcgctgcagctcgagagggccgattggagcgc agggagtgcggcctggctgcgccccgcttctccttcaatgacccttccggatcggaagaa gctgacttcctctcggcggagacaggcagccctaggttgaccatgcactacccatgcttg gagaaaccgcgcatctggtctctggcgcacaccgcgacagccagcgctgttgaaggtgca cccccagcccggcctaggccacgaagtcctgagtgccgtatgattcctggacagcctcct gcctctgcccggcgactctcagtccccagagactccgcgtgcgacgagtcttcctgcata cccaaagcctttggaaaccccaagtttgccctgcagggactaccgctgaactgtgcgccg tgcccgcggaggagcgagcctgtagtgcagtgccagtacccgtctggagcagaaggtagt gggcccccagcggcgctgggagtatctatgcaaaagacacccacctaccgccccgcccgg caattgcacaccctctgccattccagtctgcccagaggagactcgctggtccctgctgtg ctgctcagagagtttactagggttgggcaggcaccagggcttttagagaagtctaagact gtgggccagacgcacaccacagatttaaggcaggccagtgactgccaatcagtgtcccct gatgctggctggttctttcactgtgttaaggacctggttagcgcaatggctgcgatttgc gaaagaatcttggaaatgggccccacgtttcgaattcatctccaggttaagaagctgcca gaccttgccagggaccaggagctctcactttgcctaagagacagacacacagaaaccctc ctagcagctgtccttgcacgcagagctggggtgaggtgcttgagaggtcggtgtgcaggc aaagatgcattgcggccagggattcttgcgacagcgcggcgtgggctgcatacctcagtt cctttagtagctgggaaagaaactgacaaggtaacagacaaggccgtggcgcctgcagct cagcggactaaccgctgcgtaaccaatccgtggagccaagctgaccacctcgcgggtcct gcgcatgggaagtggtgggacttgtcaggatgtttatctggaaactgcatctgtttccgg gttcgtatttgggttccagggagtgagcgcaccgctgctcgcagggaagcagggacccgg ggcagtgccctagaggggagctccgaagagcaaatgaggcgggaaggcaggacgaaaggc agctggggtgtgcgcccaaggcctggattcctggggagctgcgtttccaacgccgggtcc cggccaggccaggccttccaggccagcgcggaggccagactatcagtcactgaaatccag gcgagcatagtccgcctcctgggcaggatgattgaaaacctttggaggacgatgagggga aatgtggaggttggaagtgctagggaagccgaagcagcgcgccacctcctagctcctctc tatggaaaagacagctttgctgtcggaagggactttagagatcatgacttacaaaaactg caaacctcctcttcagatgagaaggttaagcccaaacgggagggcagggaccctccagtc gccctcgctgacccgcggcttagggtccggaaaattaaactgcgaaaagataaacattat ttccagtcggctttgcgggattaa >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_5|423_aa MHRRVDFKVFCQNENSGYLWKGRSGRRAEHWMLAQVHTSFRLLEKGFMKGGNAEKVTVGH PPSSKLPSDASLNQVRRPLLKSPESMWNSLEGSSLDPQLLCQLPGTEKKLKEHVLNIFSE NRMSTDELLYSDLGAYGPNTGAVLKDKEGKKKLAQNTHHLNPLNIEGGLEGKQTSLPLSF LVLRYRSHITRPFQWDTLGSGVSGHRLQEERGPWLDTTLSLLICKAQGGLLKPLGSVIAP LPFIQPLERAPECPVIKIAFRLESIIADTHYCFPGGHQSRQILYKPRTANLAVYDGSLCV ACGLPGEDSGPNTCHASNSLNPSPSALPDPGEKQQLLTAGTARPILPCPSPGFRAQRLRR DFVPKLIVSRLCELAWGCFYRGLLGGVTVKLLPKENDNAYLKPTQCEEDENEDLYDDPLS LNE >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_5|1272_bp atgcataggagagtggatttcaaagtcttttgccaaaatgaaaacagtggttatctctgg aagggcaggagtgggagacgtgctgagcactggatgttggcacaagtccacacctctttc cgactgctggagaaaggcttcatgaaagggggcaatgcagaaaaagtaactgttggccac ccaccgtccagcaaactgcccagtgatgccagcctcaaccaagtcagaaggccacttctg aaatcccctgagagcatgtggaattctctggagggctcatcacttgatccccaactcctg tgccaattgcctggcacagaaaagaagctcaaagaacatgtgttaaatatcttctcagaa aaccgtatgagcactgatgaactgctgtattctgacttgggggcttatggacccaacacc ggagctgtgttgaaagataaagaagggaagaaaaaattggcccagaacacacaccatttg aaccccttaaatattgaaggaggactggaaggcaagcagacatctcttccactctctttc ctggtcctgcgctacaggtcacacataacaaggccctttcagtgggacacactgggctct ggggtatcagggcatcgtctacaagaagagcgagggccttggttggacacaacccttagt ttactcatctgtaaagcccaaggaggcctcctgaagcctcttggctctgttattgcccct ctccctttcatacaacccttggagcgagcacctgagtgccctgtgataaagatagcattt cgacttgaaagcataattgctgacacacattactgtttccccggaggacatcagtccaga caaatcctctacaaacctaggacagcaaacttggcagtttatgatggaagcctgtgtgta gcttgtggtctccctggtgaagactcgggtccaaacacctgccacgccagcaacagcctc aacccttccccctctgctctccctgacccaggtgaaaagcagcagctgctgactgctggc actgcccgtcccatcctgccatgcccatcaccaggattcagagctcaaaggcttaggaga gattttgttcccaagctcattgttagtcgtctctgtgaactggcatgggggtgtttttat agaggactcttgggtggggtgactgtgaagttgttgccaaaggaaaatgacaatgcttat ctgaagcctactcagtgtgaagaagatgagaatgaagatctttatgatgatccactttca cttaatgaataa >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_6|314_aa MMDLCREIREDLTGKNDMNRVLKEGRESVMWMDIWKKRIPGTRSSKCKGPEAGTYLACCR RARTPVKPESSVLEREVDQTPEKGGLGEGVYTMSGLSGHCKVFLYKKAGKKKGDQDIGRR VEALTVPIAVICQNRAPGRLWEENPSSSIPASGGPGVPRLATASLQPLPLSFHGLLVCGS RYTFSLGPPEIQDDCILKSLTAYICKDSLSKYGHLSEVCSSSSSSWATLPCFKVFWNFVQ TCCAEDPLPVLPLQEREEAATYTLHFPELRQLASSCIQPRGSTAGTWRQQAEWKYRLNGA EEQMEILALVGESD >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_6|945_bp atgatggatttatgtagggagatcagagaagacctcactgggaagaatgatatgaacaga gtcctgaaggaggggagagagtcagtcatgtggatggatatttggaagaagagaattcct ggcaccagaagcagcaagtgcaagggccctgaggctggaacatatttggcctgctgcaga agagcgagaacaccagtgaagccagagtcaagtgtgctagagagagaagtcgatcagacg ccagagaaagggggacttggggaaggggtgtataccatgtcgggcctttcaggtcattgt aaagtcttcctgtacaagaaagctgggaagaagaagggtgaccaggatattggcaggagg gttgaagccttgacagttcccattgctgtgatctgccagaacagggctcctggaaggctt tgggaggagaatccttcctcatctattccggcttctggtggccctggtgttcctcggctt gcaacagcatcactccaacctctgcctctgtctttccatggccttcttgtctgtggttct cgctatacattttctttaggcccacctgaaatccaggatgactgcatcttgaagtcctta actgcttacatctgcaaagactctttgtctaaatatgggcacctttctgaagtttgctct tcttcctcctcttcttgggctactctcccatgcttcaaggttttctggaattttgttcag acctgttgtgctgaggatccacttcccgtccttcccctgcaggagagggaggaggctgca acctacacactacatttcccagagctgcgtcagctggcctccagctgcattcaaccaagg ggaagcactgctgggacctggaggcaacaggcagaatggaaatatcgtcttaatggtgca gaagagcaaatggagattttggctctagtgggggaatcagattaa >gi568815582f:55225092_55429311|GENSCAN_predicted_peptide_7|69_aa MERPTRQRTEASSNNPVRLELPIKAGTPAATLRNAVMMLSGKAEGTGSLHLRIAAPNLGH QPEDPAREK >gi568815582f:55225092_55429311|GENSCAN_predicted_CDS_7|210_bp atggagaggcccacaagacaacgaactgaagcctccagcaacaaccctgtgaggctggaa ctgcccataaaggctggcaccccagcagccaccttgagaaatgcagtgatgatgctgagt ggcaaggcagaaggcacagggtccctgcacctgaggattgctgctccaaacctgggccac cagcctgaagatcccgccagggagaaataa