GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:29:27 Sequence gi568815592r:36266648_36485671 : 219024 bp : 45.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3794 4081 288 2 0 61 47 311 0.140 20.66 1.02 Intr + 4613 4716 104 2 2 83 49 157 0.643 11.12 1.03 Intr + 24673 24830 158 2 2 127 58 234 0.960 24.03 1.04 Intr + 26414 26479 66 1 0 96 89 89 0.949 8.90 1.05 Intr + 27543 27752 210 2 0 81 69 462 0.994 42.71 1.06 Intr + 28717 28799 83 0 2 101 110 88 0.796 10.74 1.07 Intr + 31193 31196 4 2 1 117 111 0 0.322 -1.67 1.08 Intr + 31931 31982 52 1 1 112 48 20 0.272 -1.12 1.09 Intr + 35214 35411 198 1 0 90 22 96 0.445 2.52 1.10 Intr + 39645 39729 85 1 1 86 111 61 0.898 7.08 1.11 Intr + 40940 41065 126 2 0 82 47 95 0.466 4.59 1.12 Intr + 44614 44798 185 1 2 12 100 111 0.531 4.13 1.13 Term + 45116 45234 119 0 2 105 45 25 0.546 -1.40 1.14 PlyA + 48141 48146 6 1.05 2.11 PlyA - 49133 49128 6 1.05 2.10 Term - 50744 50709 36 2 0 125 48 14 0.252 -1.56 2.09 Intr - 52963 52709 255 1 0 125 97 205 0.917 22.74 2.08 Intr - 54572 54508 65 0 2 103 109 80 0.994 10.04 2.07 Intr - 55795 55664 132 2 0 89 20 66 0.534 0.52 2.06 Intr - 56886 56646 241 0 1 140 85 30 0.885 5.02 2.05 Intr - 57543 57482 62 1 2 67 78 66 0.938 1.95 2.04 Intr - 58767 58636 132 1 0 112 24 67 0.886 3.32 2.03 Intr - 60106 59863 244 1 1 109 89 110 0.701 10.27 2.02 Intr - 62067 61910 158 1 2 113 24 153 0.635 11.13 2.01 Init - 64043 63434 610 2 1 68 72 342 0.931 26.09 2.00 Prom - 70273 70234 40 -2.26 3.00 Prom + 73100 73139 40 -3.76 3.01 Init + 96531 96633 103 2 1 60 68 96 0.110 3.20 3.02 Intr + 96660 96784 125 1 2 43 113 123 0.133 10.60 3.03 Term + 98036 98152 117 1 0 12 48 90 0.048 -4.06 3.04 PlyA + 99530 99535 6 1.05 4.11 PlyA - 99567 99562 6 1.05 4.10 Term - 100115 99998 118 1 1 22 55 108 0.468 -1.09 4.09 Intr - 100328 100228 101 2 2 84 109 121 0.987 12.71 4.08 Intr - 102424 102282 143 2 2 79 105 236 0.999 24.47 4.07 Intr - 104913 104683 231 1 0 93 105 201 0.981 20.04 4.06 Intr - 106931 106806 126 0 0 107 100 121 0.994 15.85 4.05 Intr - 109388 109224 165 0 0 61 71 181 0.867 13.63 4.04 Intr - 119022 118887 136 0 1 115 94 32 0.274 6.64 4.03 Intr - 120165 119928 238 2 1 76 -32 108 0.226 -4.78 4.02 Intr - 120707 120588 120 1 0 103 48 101 0.926 7.21 4.01 Init - 121478 121411 68 2 2 90 47 94 0.982 4.10 4.00 Prom - 123240 123201 40 -5.46 5.04 PlyA - 123934 123929 6 1.05 5.03 Term - 129891 129585 307 2 1 79 55 117 0.700 2.09 5.02 Intr - 133937 133772 166 0 1 100 91 85 0.403 9.02 5.01 Init - 159318 159267 52 0 1 69 110 18 0.280 3.42 5.00 Prom - 161972 161933 40 -3.16 6.00 Prom + 164530 164569 40 -3.56 6.01 Init + 175948 176063 116 0 2 62 102 78 0.398 6.28 6.02 Intr + 182216 182367 152 2 2 66 73 60 0.212 2.11 6.03 Intr + 208142 208415 274 0 1 78 88 155 0.655 11.10 6.04 Intr + 212474 212576 103 2 1 44 70 162 0.916 10.18 6.05 Intr + 212944 213064 121 0 1 94 80 64 0.999 6.27 6.06 Intr + 214915 215112 198 2 0 111 19 269 0.985 21.62 6.07 Intr + 218067 218177 111 1 0 111 93 103 0.997 13.45 6.08 Term + 218353 218438 86 2 2 88 50 59 0.975 -0.08 6.09 PlyA + 218535 218540 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_1|559_aa XKSEAEEMEEQVFKGDPDTPHSISFSGSGFLSFYQAGAVDALRDLAPRMLETAHRFAGTS AGAVIAALAICGIEMGEACVLGPLGSLLGDSTETEGASLSEKNCYTANPERETLSRTADA SSSPKEQAQQPNEYLRVLNVGVAEVKKSFLGPLSPSCKMVQMMRQFLYRVLPEDSYKVTT GKLHALYCSCFVPVYCGLIPPTYRGVRYIDGGFTGMQPCAFWTDAITISTFSGQQDICPR DCPAIFHDFRMFNCSFQFSLENIARMTHALFPPDLVILHDYYYRGYEDAVLYLRRLSKYR WGPSLPPATTIPLSVLMILTTLDAVYLNSSSKRVIFPRVEVYCQIELALGNECPERSQPS LRARQASLEGATQPHKEWVPKGDGRGSHAVALLVSSKPKSAVPLVHVKETVSKPYVTESP AEDSNWVNKVFKKNKQKTSGTRKGFPRHSGSKKPSSKVQEPGGGVEDEQKLVRLRIRVWE GIPGTRDGPEQGNSLCKQLTAVARMEVAERPGSELGDVVEGLLPAHSTSLCSPLLRQFGS PTDLIPAGSRNAALKCGIP >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_1|1680_bp nnaaagtcagaggccgaggagatggaagaacaggtgttcaagggggacccggacacccct cactccatctccttctcgggcagtggattcctctccttctaccaggcgggggctgtggac gccctgcgggacctggccccccggatgctggaaacagcccaccgctttgcggggacatcg gcaggtgctgtgatcgccgccctggccatctgcgggattgaaatgggtgaggcctgtgtt ctgggtcccctgggaagtctcttgggggattccacagagacagaaggagcgagcctcagc gagaagaactgctacactgctaacccagagagagaaacactgtcccggacagcagatgca tctagcagccccaaggaacaagcccaacagcctaatgagtatctcagagtcctcaacgtg ggtgtggccgaggtgaagaaatccttcctggggcccttgtccccgtcctgtaagatggtg cagatgatgaggcagtttctgtaccgggtcctgcccgaggactcctacaaggtcaccacg gggaagctccatgccctatactgcagctgcttcgtcccggtgtactgtggcctcatcccc ccgacttaccgcggtgtgaggtacatcgatgggggcttcacgggcatgcagccctgtgcc ttctggaccgacgccatcaccatctccaccttcagtgggcagcaggacatctgtccccgg gactgcccggccatcttccacgacttccgcatgttcaactgctccttccagttctccctg gagaacatcgccaggatgacccacgcattgttccccccggacctggtgatcctgcacgat tactactaccgagggtacgaggatgcagttttgtacttgaggcggctgagtaagtaccgg tggggccccagtttgcccccagcaaccaccatcccactttctgtgctgatgattttgact acactagatgctgtttatcttaattcttcctccaagagagtgattttcccccgggtggaa gtgtactgccagatagaactcgcccttggcaatgagtgccctgaacgcagtcaaccaagc cttcgagcacggcaggccagtctggaaggagccacacaacctcacaaggagtgggttccc aaaggggatggaaggggcagccatgctgtagctcttcttgtctcttcaaaaccaaaaagc gccgtgcctctggttcatgtgaaggaaaccgtcagcaagccttatgtaacggagagccct gctgaagactcaaactgggtgaataaggtcttcaagaagaacaagcaaaagacaagtggc accagaaaaggcttcccaagacattcgggatccaaaaaaccaagcagcaaagtgcaggaa cctggtggaggtgtggaagatgagcagaagttagtcagactgaggataagggtgtgggag ggtattccaggcacacgggatggcccagaacaaggaaatagtctatgcaagcagttaact gcagttgcaaggatggaggtagcggagaggcctggaagtgaacttggagatgtggtggaa ggtctgctccctgcccactcaacttccctctgctctccacttctgagacagtttgggtca cctacagacctcatcccagccggatccaggaatgctgccctgaagtgtggaattccttag >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_2|644_aa MENPRCPRRPLAEKKARSLDRPQAPGKGSESWDCHWLSLPTAPSRKALHWTTSDWARHSD SPAPSAEAHCTTAAAPTPEETGDFLPSEQRPSQDTKKGWLKTMLNFFVRTGPEEPREKAS RRPRGKEGISQHPEPLEAAGEPALRKKAHHDKKPSRKKQGHKKHAAEVTKAAQDQEARGR EEGLSKAAAALRSGEADLGPARRGGEDSDHQSFLIKVDGTGALDVSPHATGHQQEEELKK PDRESESLVTFLPVAEQSLASQLGVALPNPAPAVRKKSQEKKTSLKRTSKTNPKKHGSEE AKRGAADVSSPEAWPPKKSSFLPLCVSGHRPSISSSYGLEEPKVQEAPSTEAGAPGPSVL PTPSESQEPGEELPLDRASEYKEFIQKIISMLQDAEEQQGEEQPQVQQEEVGVENPAPHC RRKSQEKRSSFRRAFYHKKHTSKEPRRAGAAGAASPEARRPKRPSFLPLCVGGHRPSTSS SLDPEDLECREPLPAEGEPVVISEAPSQARGHTPEGAPQLSGACESKEIIIQKLVALLQE VDGQLGQQIRRHPSFKRFFYEFSDSSLSKLVATLRSQVAHSSKLDRNRARRLYQFDVSLA NKFAGSNSHAMCILMGLRDHYNCTQFPYREDQPNITSPKVESPD >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_2|1935_bp atggagaatccaaggtgcccaaggaggcccctggcggagaagaaagccaggtctctggac aggccgcaggcccccgggaaaggctcggagtcgtgggactgccattggctctccctgccc actgccccctccaggaaggcgcttcactggacgaccagtgattgggccagacattcagac agcccagctccatctgcagaggctcactgcaccaccgctgcagcccccactcccgaggag accggagattttctccccagcgagcagaggccttcgcaagacaccaagaaggggtggctg aagaccatgctgaacttcttcgtgaggacgggccctgaggagcccagagaaaaggccagc aggaggccaagggggaaggagggtatctcccagcatccggagcccctggaagcagcaggg gagccagccctcaggaagaaagcccaccacgacaagaagcccagccgcaagaagcaaggt cacaagaaacacgcggccgaggtgactaaggcagctcaagaccaggaggccagaggccga gaggaagggttgtccaaggcagctgctgccttgcgctccggggaggctgacctgggccca gctcgcaggggtggggaagattctgatcaccagtccttcctcatcaaagtggatggtact ggagctttggatgtttctccccatgccacaggtcatcagcaagaagaggagctcaaaaag cctgaccgtgagtcggaatctctagtgacctttttgcctgtggctgagcaatctctggcc tcacagctgggggtggccctgccaaacccagcaccagctgttaggaagaaatcccaagag aaaaagacaagcctcaagagaacctcaaagacaaaccccaagaaacacggctccgaggag gccaagaggggggctgcagatgtttccagtccagaggcctggccacccaagaagtccagc tttctgcccctgtgtgtcagcggccatcggccttccatctccagcagctatggcttggaa gaacctaaagtccaggaggccccatctacagaggctggggctccaggtccctccgtgctt cccaccccatcagagagccaggaacctggagaggagcttccgctggacagagcctcggaa tacaaagaattcattcagaagatcatttccatgctccaagatgcagaagaacagcaagga gaggagcaacctcaagtccagcaggaagaggtgggtgtagaaaacccggccccacactgc agaaggaaatctcaagaaaaaaggtcgagcttcaggagagcgttttaccataagaaacac acctccaaggaacccagaagagcgggggcagcaggggctgccagcccagaggcccgacga cccaagaggcccagctttctgcccctgtgtgttggtggccatcggccctccacctccagc agccttgatccagaagatctcgagtgccgggagcccctgcccgcagaaggggagccagtt gtgatctcagaagcaccctcccaggctagaggccacacgccagaaggggcacctcagctg agtggagcatgtgaatctaaggagatcatcatccagaagcttgtggcacttctccaagaa gtggatggccaactggggcagcagatcaggcgccaccccagctttaagaggtttttttac gagttctcggactcctccctcagcaagctggtagccaccctgcgcagccaggtggctcac tcctctaagctggacaggaaccgcgccaggagactctaccagtttgacgttagcttagct aacaaatttgctggcagcaacagccatgccatgtgcatcctcatgggcctaagagaccac tacaattgcacccagttcccatacagggaggaccagccgaacatcacaagtcctaaggtt gaaagtccagattga >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_3|114_aa MKPRTLAVSLTALQVAHLEFVLSDVPTCSEFLPSGVKLQTFAVSVTALKAPRLESFVPPR GLMGSLAPGVKLQTFTCSGGMKGSSSAAKVGAQAEEVPRASEGCEDCQHAVISQ >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_3|345_bp atgaagccgcggaccctcgcggtgagtcttacagctcttcaggtggcgcatctggagttt gttctttctgatgttccgacgtgttcagagtttcttccttctggagtgaagctgcagacc ttcgcggtgagtgttacagcccttaaggcaccgcgtctggagtcgttcgttcctccccgt gggctcatgggctcgctggctccaggagtgaagctgcagaccttcacgtgcagcggtggg atgaagggctcctcaagtgccgccaaagtgggagcccaggcagaggaggtgccgagagcg agcgagggctgtgaggactgccagcacgctgtcatctctcaataa >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_4|481_aa MRGPFLSYVRRLTGSFLVLGSGPRGSPPLTLSVSYTALKASRTAPAGPFRSPLTTGRPYT NQMRQPLRSDYLGNSTKGGPAGNEVRTTRKGVESPFAAAKASGVCFAGSDPNSLSADISP RIPRDECKAFDNPRVKNDLMELEGELAISPISPVAAMPPLGTHVQARCEAQINLLGEGGI CKLPGRLRIQPALWSREDVLHWLRWAEQEYSLPCTAEHGFEMNGRALCILTKDDFRHRAP SSGDVLYELLQYIKTQRRALVCGPFFGGIFRLKTPTQHSPVPPEEVTGPSQMDTRRGHLL QPPDPGLTSNFGHLDDPGLARWTPGKEESLNLCHCAELGCRTQGVCSFPAMPQAPIDGRI ADCRLLWDYVYQLLLDTRYEPYIKWEDKDAKIFRVVDPNGLARLWGNHKNRVNMTYEKMS RALRHYYKLNIIKKEPGQKLLFRFLKTPGKMVQDKHSHLEPLESQEQDRIEFKDKRPEIS P >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_4|1446_bp atgcgaggccccttcctatcctatgtgcggagactcacagggtccttcctggtcctgggc tcgggaccacgcgggtcaccgcccctgactctctcggtgtcatacacagctctgaaggca tcacgcacagcccctgcgggccctttccgcagcccccttaccactggtcgcccctacacc aaccagatgaggcaacccctgaggtctgattacttgggaaacagcaccaaagggggaccg gctggaaatgaagtgagaacgacccggaaaggtgttgaatcccctttcgcagcagccaag gcttctggagtttgctttgctggctccgatccaaattctctctcggcagatatctctcct cggatcccaagagatgaatgtaaagcttttgacaatcctagagtgaaaaatgacttgatg gagctggagggagaattggctatttctcctataagccctgtggcagccatgcctccccta ggcacccacgtgcaagccagatgtgaagctcaaattaacctgctgggtgaaggggggatc tgcaagctgccaggaagactccgcatccagcccgcactgtggagcagggaggacgtgctg cactggctgcgctgggcagagcaggagtactctctgccatgcaccgcggagcacgggttc gagatgaacggacgcgccctctgcatcctcaccaaggacgacttccggcaccgtgcgccc agctcaggtgacgtcctgtatgagctgctccagtacatcaagacccagcggcgagccctg gtgtgtgggcccttttttggagggatcttcaggctgaagacgcccacccagcactctcca gtccccccggaagaggtgactggcccctctcagatggacacccgaaggggccacctgctg cagccaccagacccagggcttaccagcaacttcggccacctggatgaccctggcctggca aggtggacccctggcaaggaggagtccctcaacttatgtcactgtgcagagctcggctgc aggacccagggggtctgttccttccccgcgatgccgcaggcccccattgacggcaggatc gctgactgccgcctgctgtgggattacgtgtatcagctgctccttgatacccgatatgag ccctacatcaagtgggaagacaaggacgccaagatcttccgagttgtggatccaaatggg ctcgccagactctggggaaatcacaagaaccgggtgaacatgacctacgagaagatgtct cgtgccctgcgccactattataagcttaatatcattaagaaggaaccggggcagaaactc ctgttcagatttctaaagactccgggaaagatggtccaggacaagcacagccacctggag ccgctggagagccaggagcaggacagaatagagttcaaggacaagaggccagaaatctct ccgtga >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_5|174_aa MTQLVSSQPVPAMSRNPDHNLLSQPKEHSIVQKHHQEEIIHKLAMQLRHIGDNIDHRMVR EVRLQFPQPGPVRNQSASGAMRDSAPTPAQRLEQALGAERGQAVAADTPEPSGVGVGCFL GPLRVQAAEMPGSCNWEGGHSGTQGAPAPTQKGLGSYWLHGACGPSCTFLLQPA >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_5|525_bp atgacccaacttgtcagctcccagccagttccagccatgtcaaggaacccagatcataat ctactttctcagcccaaggagcatagcattgttcagaagcatcaccaggaggaaataatt cacaagttggccatgcagctgagacacattggggacaacattgatcataggatggttcga gaggtgaggcttcagtttccccagccagggcctgtcaggaatcaatctgcctccggcgcc atgcgggactcagccccaacccctgctcagagattggagcaggctctgggagcagagaga ggccaggcagtggcagcagacacccctgagccttcaggggtaggggtagggtgcttcctg gggcccctgagggtgcaagctgcagagatgcctgggtcctgcaactgggagggtggccac agtggcacccagggagctcctgccccaactcagaaggggctgggctcctactggctccat ggagcatgtggccccagctgcaccttcctgctgcagccagcatga >gi568815592r:36266648_36485671|GENSCAN_predicted_peptide_6|386_aa MYIQGSDFRANAFKQLQYSYYATSNLAPRMGRSGFHIFGGFVVLLTSGVKPQTFSVSVTA LKGGVSGVVCSSRWVRGLADFRSEAADLCNLSLDYASQPANLQFPHIMPLAEDIKGSCFQ SGNKRNHEPFIAPERFGNSSVGFGSNSHSQAPEKVTLLVDGTRFVVNPQIFTAHPDTMLG RMFGPGREYNFTRPNEKGEYEIAEGISATVFRTVLDYYKTGIINCPDGISIPDLRDTCDY LCINFDFNTIRCQDLSALLHELSNDGAHKQFDHYLEELILPIMVGCAKKGERECHIVVLT DEDSVDWDEDHPPPMGEEYSQILYSSKLYRFFKYIENRDVAKTVLKERGLKNIRIGIEVE NGSKQPQKQVRVYYFRIKSSKGKGQW >gi568815592r:36266648_36485671|GENSCAN_predicted_CDS_6|1161_bp atgtatatccagggtagtgactttcgagcaaatgcgtttaagcaactccagtattcttac tacgctacctccaacctcgctccacgcatggggagatcagggttccacattttcggtggg ttcgtggtcttgctgacttcaggagtgaagccgcagaccttctcagtgagtgttacagct cttaaaggtggcgtgtctggagttgtttgttcctcccggtgggttcgtggtcttgctgac ttcaggagtgaagctgcagacctttgcaacctctcacttgactatgcctctcagccagca aatcttcagttccctcacataatgccccttgctgaagacatcaaaggttcttgcttccaa agtgggaataaacggaaccatgaaccttttattgctccagaaagatttggaaacagtagt gtgggctttggcagtaattcccattcccaagcaccagagaaagtgacgcttcttgtagat ggcacacgttttgttgtgaatccacagattttcactgctcatccggataccatgctggga aggatgtttggaccaggaagagagtacaacttcactcggcccaatgagaagggagagtat gagattgctgaaggcatcagtgcaactgtatttcgcacagtgctggattattacaaaacc ggtatcatcaattgtcctgatggcatctctatcccagatcttagagatacttgtgattat ctctgcattaattttgacttcaacactatccgatgtcaagatctgagtgctttactccat gaactgtctaatgacggtgctcataagcagtttgatcactacctcgaagagctcatcttg cccatcatggtgggctgtgccaagaaaggagaacgagagtgccacattgttgtgctgacg gatgaggattctgtggactgggatgaagaccaccctccaccaatgggggaggaatattcc caaattctttatagctccaagctctacagattcttcaaatatattgagaatagggatgtt gcaaaaacagtgttaaaggaacggggcctaaaaaacattcgcattggaattgaagttgaa aatggttcaaagcagccacagaaacaagttcgagtttactacttcaggataaagagcagc aagggcaaaggacagtggtag