GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:26:17 Sequence gi568815595r:123869158_124080481 : 211324 bp : 40.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1798 1952 155 2 2 113 95 51 0.577 7.17 1.02 Intr + 3726 3738 13 2 1 123 49 4 0.061 -6.56 1.03 Intr + 9223 9327 105 2 0 86 97 72 0.275 7.17 1.04 Term + 13455 13816 362 1 2 17 49 216 0.350 4.31 1.05 PlyA + 16045 16050 6 1.05 2.04 PlyA - 16694 16689 6 1.05 2.03 Term - 20790 20377 414 0 0 73 53 247 0.829 13.98 2.02 Intr - 28456 28354 103 0 1 90 59 84 0.062 4.96 2.01 Init - 40174 40050 125 1 2 94 83 70 0.579 6.79 2.00 Prom - 45147 45108 40 -6.05 3.10 PlyA - 45497 45492 6 1.05 3.09 Term - 46561 45622 940 0 1 88 38 550 0.321 40.58 3.08 Intr - 62077 61945 133 2 1 51 98 137 0.878 9.78 3.07 Intr - 62369 62151 219 0 0 80 84 82 0.896 4.35 3.06 Intr - 64598 64516 83 1 2 75 51 59 0.562 -0.74 3.05 Intr - 75833 75692 142 0 1 72 116 95 0.956 9.19 3.04 Intr - 78162 77646 517 0 1 100 71 341 0.993 25.20 3.03 Intr - 79975 79707 269 2 2 34 84 191 0.728 9.63 3.02 Intr - 84980 84863 118 2 1 89 49 58 0.497 1.12 3.01 Init - 92310 91987 324 0 0 89 94 165 0.703 12.92 3.00 Prom - 98606 98567 40 -5.65 4.06 PlyA - 99392 99387 6 1.05 4.05 Term - 100064 99998 67 1 1 87 42 53 0.364 -2.97 4.04 Intr - 101060 100885 176 2 2 62 84 109 0.930 5.72 4.03 Intr - 106383 106222 162 0 0 121 81 204 0.992 22.25 4.02 Intr - 107824 107707 118 0 1 85 110 156 0.992 17.05 4.01 Init - 111324 111209 116 0 2 38 53 160 0.494 7.23 4.00 Prom - 115454 115415 40 -2.85 5.10 PlyA - 116173 116168 6 1.05 5.09 Term - 117475 117375 101 2 2 94 47 122 0.825 6.01 5.08 Intr - 120343 120241 103 1 1 64 63 63 0.273 0.23 5.07 Intr - 123170 122860 311 0 2 61 39 147 0.237 2.21 5.06 Intr - 124596 124547 50 2 2 92 69 44 0.090 0.31 5.05 Intr - 134417 134320 98 2 2 44 67 115 0.243 2.89 5.04 Intr - 145145 144936 210 2 0 53 68 169 0.560 9.59 5.03 Intr - 150234 150108 127 2 1 62 74 47 0.290 0.46 5.02 Intr - 150410 150335 76 0 1 43 98 39 0.222 -1.95 5.01 Init - 157797 157653 145 0 1 63 42 105 0.184 3.73 5.00 Prom - 158598 158559 40 -6.05 6.00 Prom + 161776 161815 40 -4.95 6.01 Init + 164584 164656 73 0 1 76 115 167 0.782 19.48 6.02 Intr + 165044 165190 147 0 0 81 78 89 0.693 6.49 6.03 Intr + 165585 165658 74 1 2 63 100 -8 0.015 -3.89 6.04 Intr + 175023 175176 154 2 1 86 81 121 0.356 9.92 6.05 Intr + 187921 188099 179 2 2 93 80 26 0.007 1.02 6.06 Intr + 200102 200308 207 1 0 61 56 228 0.426 15.25 6.07 Intr + 206774 207010 237 1 0 -2 62 229 0.061 8.19 6.08 Term + 208350 208520 171 2 0 44 44 94 0.054 -2.56 6.09 PlyA + 208824 208829 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_1|211_aa XHIGRQMSIRVLDCQGGKTRPLLPMKETVHGVTVRLLPIGPTPKQKPQGSLLMLGQASDD IPCSLTPGKGNSTRYPGALKQTDGSDHRPDKVNNETDLHILAKKALVHVPRVPATQSSIQ SPTTPDPGPTAQVPPRPKAPGPLRVGEKGVTSLWAGRKIPCCPRALWCQQHHPPQSHSVQ AGKRLRESRKREGGQEFKRKENSSIRTLNAL >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_1|636_bp naccatattgggaggcagatgagcattagggtcctagactgtcagggtgggaagaccagg ccactgcttcccatgaaagaaacagtccatggtgtgaccgttcgcctccttcccattggt cccacccccaagcagaagcctcagggctccctgctgatgttagggcaggcctcagatgac attccatgtagtctgaccccaggaaaggggaacagcactcggtatccaggggccctgaag cagacagatggcagtgatcacaggccagataaagtgaacaatgagacagacctgcacatc ttggcaaaaaaagctcttgtgcatgtccccagagtacccgctacccaatccagcatccag agtccaaccactccggacccaggaccaactgctcaggttcctcccagacccaaggctcct ggcccactgagagtgggagaaaaaggggtaacctcactctgggcaggaaggaaaattcca tgctgccctagagcactgtggtgccagcagcatcatcctccacagtcccacagtgtgcag gcggggaagagactaagagaatcacggaagagagaaggaggtcaagaatttaagagaaaa gaaaacagttctattagaaccctaaatgccttgtag >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_2|213_aa MEKNTNYPRGKLSPFWSKRLTRLFKSITRNVVSRSVGSTRPRLFKTVDLNVLQSLTQKPN TVHELRKSRTALNKEESHVYLYQQLENGLIHEPPPRFQRMYGNAWVSRQKSAAGAEALWR TSTRFVWRGNVELEPPHRVPTVALPSGAVRSRPPSFRPQNGRSTNSLHHAPGKAAGTQCQ PVKVATGAIPCRPTGVELPKDLEEHPCISMPWM >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_2|642_bp atggagaagaacaccaactatccaagaggaaaactcagtcccttttggagcaagagactt acacgtttgttcaaatctataacaagaaatgttgtcagccgctcagtagggagcacacgg ccaagactttttaaaactgtggacttaaatgttctgcaaagtcttacacaaaagcccaat acagtacatgaactacggaaaagcagaactgctctgaacaaggaagagtctcacgtatat ctttatcaacagcttgaaaatggactaattcatgaacctccacctagatttcagaggatg tatggaaatgcctgggtgtccaggcagaagtctgctgcaggggcagaggccttatggaga acctccactaggtttgtgtggaggggaaatgtggagttggagcccccacacagagtcccc actgtggcactgcctagtggagctgtgagaagcaggccaccatccttcagaccccagaat ggtagatccaccaacagtttgcaccatgcacctggaaaagctgcaggcactcaatgccag cctgtgaaagtagccacaggggctataccctgcagacccactggggtggagctgcccaag gacttggaagagcatccttgcatcagcatgccctggatgtga >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_3|914_aa MPWPRPVRRLLASPNPCRWVTSREAWRGLLRTPKALGLEQEKGSGGASETMKRGIRRDPF RKRKLGGRAKKVREPTAVNSFYREASLPSVWASLRRREMVRSGARPGQAGYFLSRAPFLK ELLFAIYVTGSGCHGINLLDPDLTASVASSDNKKQIPNEASARSERDTSDLEQNWSLQDH YRMYSPIIYQALCEHVQTQMSLMNDLTSKNIPNGIPAVPCHAPSHSGEYECCLKIVSEVQ TDGNSQFASQGKTVSATCTDVLRNSFNTSPGVPCSLPKTDISAIPTLQQLGLVNGILPQQ GIHKETDLLKCIQTYLSLFRSHGKETHLDSQTHRSPTQSQPAFLATNEEKCAREQIREAT SERKDLNIHVRDTKTVKDVQKAKNVNKTAEKVRIIKYLLGELKALVAEQEDSEIQRLITE MEACISVLPTVSGNTDIQVEIALAMQPLRSENAQLRRQLRILNQQLREQQKTQKPSGAVD CNLELFSLQSLNMSLQNQLEESLKSQELLQSKNEELLKVIENQKDENKKFSSIFKDKDQT ILENKQQYDIEITRIKIELEEALVNVKSSQFKLETAEKENQILGITLRQRDAEVTRLREL TRTLQTSMAKLLSDLSVDSARCKPGNNLTKSLLNIHDKQLQHDPAPAHTSIMSYLNKLET NYSFTHSEPLSTIKNEETIEPDKTYENVLSSRGPQNSNTRGMEEASAPGIISALSKQDSD EGSETMALIEDEHNLDNTIYIPFARSTPEKKSPLSKRLSPQPQIRAATTQLVSNSGLAVS GKENKLCTPVICSSSTKEAEDAPEKLSRASDMKDTQLLKKIKEAIGKIPAATKEPEEQTA CHGPSGCLSNSLQVKGNTVCDGSVFTSDLMSDWSISSFSTFTSRDEQDFRNGLAALDANI ARLQKSLRTGLLEK >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_3|2745_bp atgccttggccccgccccgttcgccgtttattggcttctcccaacccctgccggtgggtg acaagccgggaagcttggaggggtctattgcgtacgcccaaggcgctgggcctggagcag gagaaagggagtggaggcgcgtcggagactatgaagcgcggcattcggcgggatcctttc cggaagcggaagctcggcgggcgggccaagaaggtccgggagcccacggcggttaattct ttttaccgtgaggcttcacttccctcggtctgggcttctctgaggcggcgagagatggtc aggtctggagctcgaccgggccaggcaggttatttcttatccagggcacctttcctcaag gaactattatttgcaatatatgtgactggcagtgggtgtcatggtatcaacttactggat cctgacttaactgcctctgtcgcatcttcagataataagaaacagatacctaatgaagct tctgctagaagtgaaagagacacatcagacctagagcaaaactggtcattgcaagatcat tatagaatgtattcacccataatataccaagccctctgtgagcacgtgcagactcagatg tcactgatgaatgacttgacttcaaagaacatccctaatggaattcctgctgtaccatgc catgctccctctcattctggtgagtacgaatgctgtttaaaaatcgtgagtgaagttcaa actgatggcaacagtcagtttgcatcacaaggtaaaacagtttctgcaacctgtactgat gttctacggaattcatttaataccagtcctggagttccatgtagcctgcccaaaactgac atatcagctattccaacattgcagcaactgggccttgttaatggaattctgccacaacaa ggaattcataaggaaacagacctactaaaatgtattcaaacatatttgtctctttttcga tctcatggaaaagaaacgcatctggacagtcagacacaccgaagccctactcagtcacaa ccagctttcttggccactaatgaagaaaaatgtgccagagagcaaattagagaggccaca agtgaaagaaaggatttaaacatacatgtgcgagatacaaaaacagtgaaggatgtacag aaggcaaaaaatgtgaacaagacagctgaaaaagttagaattataaaatatttgttggga gagctcaaggccctggtagcagaacaagaggattcagaaattcagaggttgattacagaa atggaggcatgtatatctgtacttccaacagtaagtggaaacacagatattcaagttgag atagcactggccatgcaaccattaagaagtgagaatgctcagttacgaaggcagttgaga attttgaaccagcaactcagagaacaacagaaaactcaaaaaccatctggtgctgtggat tgcaaccttgaattgttttctcttcagtcattgaatatgtcactgcaaaatcaattggag gagtcactaaagagccaggaattactgcagagtaaaaatgaagagctgttaaaagtgatt gaaaatcagaaagatgaaaacaaaaaatttagtagtatatttaaagacaaagatcaaact atacttgaaaataaacagcaatatgatattgagataacaagaataaaaattgaattggag gaagccctagtcaatgtgaaaagctcccagtttaagttagaaactgctgaaaaggaaaac cagatattggggataacattacgtcagcgtgatgctgaggtgactcgactaagagaatta accagaactttacagactagcatggcaaagcttctctccgatcttagtgtggacagtgct cgctgcaagcctgggaataaccttaccaaatcactcttgaacattcatgataaacaactt caacatgacccagctcctgctcacacttccataatgagctatctaaataagttagaaaca aattacagttttacacattcagagccactttctacaattaaaaatgaggaaaccatagag ccagacaaaacctatgaaaatgttctgtcctccagaggccctcaaaatagtaacactagg ggcatggaggaagcatctgcacctggaattatttctgccctttcaaaacaggattctgat gaagggagtgaaactatggctttaatagaagatgagcataatttggataatacaatttac attccttttgctagaagcactcctgaaaagaaatcaccactttctaagagactatcccct cagccacaaataagagcagctacaacacagctagtcagcaacagcggacttgctgtctct ggaaaagaaaataaactgtgtacacctgtaatctgttcctcttcaacaaaggaagcagaa gatgcacctgaaaaactttccagagcatctgatatgaaggacacacagctcctcaagaaa ataaaggaagcaattggtaagatccctgctgccaccaaggagccagaggaacaaactgca tgtcatggcccatcaggttgtcttagcaacagccttcaagtgaaaggcaatactgtctgt gatggtagtgttttcacttctgacttgatgtctgactggagcatctcttcgttttcaacg ttcacttctcgtgatgaacaagacttcagaaatggccttgcggcattagatgccaacata gctagactccagaagtctttaaggactggtcttctggagaaatga >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_4|212_aa MAQTDKPTCIPPELPKMLKEFAKAAIRVQPQDLIQWAADYFEALSRGETPPVRERSERVA LCNRAELTPELLKILHSQVAGRLIIRAEELAQMWKVVNLPTDLFNSVMNVGRFTEEIEWL KFLALACSALGVTITKTLKIVCEVLSCDHNGGSPRIPFSTFQFLYTYIAKVDGEISASHV SRMLNYMEQEVIGPDGIITVNDFTQNPRVQLE >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_4|639_bp atggctcagacagataagccaacatgcatcccgccggagctgccgaagatgctgaaggag tttgccaaagccgccattagggtgcagccgcaggacctcatccagtgggcagccgattat tttgaggccctgtcccgtggagagacgcctccggtgagagagcggtctgagcgagtcgct ttgtgtaaccgggcagagctaacacctgagctgttaaagatcctgcattctcaggttgct ggcagactgatcatccgtgcagaggagctggcccagatgtggaaagtggtgaatctccca acagatctgtttaatagtgtgatgaatgtgggtcgcttcacggaggagatcgagtggctg aagtttttagcccttgcttgcagcgctctgggagttactattaccaaaactctcaagata gtgtgtgaggtcttatcatgtgaccataatggtgggtcgccccggatcccgttcagcacc ttccagtttctctacacgtatattgccaaagtggatggggagatctctgcatcacatgtc agcaggatgctaaactacatggaacaggaagtaattggccctgatggtataatcacagtg aatgactttacccaaaaccccagggttcagctggagtaa >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_5|406_aa MRKMQTKSSMRCDDLLECLKLKRLNISSAGEDEERMQKRNSLTLRMQNGANTINLHSTVG CVCGGKKKNFGTGGSADMLHLQVLSPLCDTNFDLLVGLQLLWKGPGNVNFSWGNVEAHDP RGFQVKFSSTKMQILADGRCLEDTEMARQPEGYVFSVSHTSSIFPDGKNQIGRKLNVPLL TEQLSQASSTTGRTPFPIRDEMTPHYRFKNTFEDIRLERSCPVGSGHPQTIQDSPAQSLT LQAVWSFHHQQSPKQLSASPSSALAGPWEPAAAGEAGRLCDVAPGATPGDLAAEPAQGLP GEGGALTSNANGGRAPPLLSPPAAAALPAEVLLLSFLRRSFQCPVLTKHILMTADKGKNV KGPSSIFVEEAIQDRALEGTEPASTTPANTFPRANTVQRTADPPPP >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_5|1221_bp atgaggaaaatgcaaactaaaagctcaatgagatgtgatgatttattggaatgtctaaaa ttaaaacgcctgaacatatcaagtgctggtgaggatgaggagaggatgcaaaaacggaac tctcttacacttcgaatgcaaaatggtgctaacaccataaatctccatagtacagttggc tgtgtgtgtggagggaagaagaagaactttggaacaggaggatcagcagacatgctccat cttcaggttctcagccctctttgtgatactaattttgacctgttggtgggactgcagctt ctttggaaaggaccaggaaatgtcaacttttcctgggggaatgtggaggcccatgaccct cgagggttccaggtaaagttctcaagcaccaagatgcaaatcctggcagatggaagatgt ctggaagacactgaaatggcaaggcagcctgagggatatgtattcagtgtttctcacact agctccatatttccagatgggaaaaaccaaattggaaggaagttaaacgtaccactactc actgagcaactgtcccaggcttcctcaaccacaggtagaacacccttccccatacgtgat gaaatgacacctcattacagattcaaaaatacctttgaagatattcgccttgaaagatct tgccctgtgggcagtggccatccccagaccatccaggattctccagcccagtccttaacc ctccaggccgtctggtccttccaccaccagcaaagcccaaaacagctttcagcgtcccct agcagcgccctagcgggaccctgggaacccgccgctgcaggagaggcggggcggctctgt gacgtcgcgcccggcgctacccctggtgacctggcagcggagcccgcgcagggtttgcca ggcgaaggcggagcgctaacgtctaacgctaacggcggtcgtgccccgccgctgctgtca cccccggccgctgctgccctccccgccgaggttctactgctctccttcttaagaaggtcc ttccagtgccctgtcctaacaaagcatatccttatgacagctgacaagggaaaaaatgtt aaaggacctagttcaatttttgtggaggaggcaatacaagacagagccttggagggtaca gaaccagcaagcaccacccctgccaacaccttccctcgtgctaacactgtgcagagaaca gcagatcctcctccaccctaa >gi568815595r:123869158_124080481|GENSCAN_predicted_peptide_6|413_aa MNPPEGAAEEGGAADSDVDAFFRTAGLPAAALGRHPSSPSGARGPDVHRGAAGTRSRAGC RLTGVGRKTSVSPASVCSSPSPHMSETVLSGIFVPRSPGLTQDEKPHFIKGSSVVSHDPP EGWGVGIWLLLPPKGGLWGTRLIYPRTQEEGVLTAIDLSLTTFGFVKSWWGGRSADCFFL WRLSLQDHILYFFLVPPTIPPPLLPLCLQSTFFLLDHPGGKGTHGNLGGGGGGGGGGGGG GGGGGGGNTEGTWLRCLRDWGSPRAESTRVPLCRAQPRDLHQGRPKVASLVSIPLSLRVA YVQASFHTPSRYWPDVLPAMRPESLQDRLREWAEVAAGETNFAGKFLRSPIGKLKEKGHF QIRILPVIQVHSLGSSLIVPALPLELLSSESPCQFYRHSPVQKFNDTTLVLLQ >gi568815595r:123869158_124080481|GENSCAN_predicted_CDS_6|1242_bp atgaacccccctgagggagcagcggaggaaggaggagcagcagactcggacgtggacgcc tttttccggacagcggggctcccggcggccgccctgggtcggcacccctcctcgccgtcg ggggcccggggccctgatgtgcacagaggggcggcggggacccggagccgcgccggctgc agactgacaggagtggggcggaagacatcggtctcgcctgcttctgtgtgtagctcccca tctccccacatgtctgaaacagttttgtctggaatctttgtcccaaggagtccagggctg acccaagatgaaaagccccatttcatcaaaggctcctcagtggtctcacacgaccctcca gagggctggggagtaggaatctggctgttgctgccaccaaagggtggattgtggggcacc aggcttatctaccctagaacacaagaagagggagttttaactgccattgatttatccctt accacatttggctttgtgaagagctggtggggaggacgatctgcagactgcttcttcctc tggagattaagccttcaggaccatatcctgtactttttcttagtgcctcccaccataccg cccccacttttgccactctgtcttcagtccacattctttctattggatcaccctggaggg aaaggaactcatggaaacctaggaggaggaggaggaggaggaggaggaggaggaggagga ggaggaggaggaggaggaggaaatactgagggaacctggttaagatgtctgagagactgg ggatctcccagagctgagtcgacacgtgtccctctatgtcgtgcacagcccagggatctg caccaaggacggcccaaggtcgccagccttgtgagtatacccctgtctctgcgtgtggcc tatgtgcaggcttccttccacacgccatctcgctactggcctgacgtgcttcctgcgatg aggccagaatctctgcaggacagactaagagaatgggctgaagtggcagcaggagaaact aattttgctggaaagtttctaaggagcccaattgggaagctcaaggagaagggccacttc caaatccgcatccttccagtcattcaggtgcatagcttgggatcatccttaattgtcccc gctcttcctctggaacttctcagctctgaatccccttgccagttctaccgccactctcca gttcagaagtttaatgatacaacattggtactattgcaatag