GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:52:33 Sequence gi568815589r:91309349_91510734 : 201386 bp : 40.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 2025 2020 6 1.05 1.07 Term - 2438 2325 114 2 0 96 32 90 0.060 1.79 1.06 Intr - 10309 10199 111 1 0 76 68 56 0.103 2.06 1.05 Intr - 18303 18242 62 1 2 103 70 91 0.040 6.33 1.04 Intr - 19212 19134 79 0 1 94 31 116 0.279 4.71 1.03 Intr - 51567 51425 143 1 2 60 86 98 0.611 5.95 1.02 Intr - 52417 52280 138 2 0 68 78 76 0.673 4.11 1.01 Init - 52541 52463 79 2 1 111 35 142 0.889 10.50 1.00 Prom - 61265 61226 40 -4.55 2.00 Prom + 61988 62027 40 -5.95 2.01 Init + 62030 62105 76 1 1 87 89 63 0.972 7.60 2.02 Intr + 84434 84526 93 0 0 28 71 103 0.006 1.62 2.03 Term + 90656 90930 275 0 2 68 32 192 0.790 6.25 2.04 PlyA + 91180 91185 6 1.05 3.00 Prom + 93910 93949 40 -6.75 3.01 Sngl + 96442 96723 282 0 0 83 47 184 0.732 7.05 3.02 PlyA + 99124 99129 6 1.05 4.02 PlyA - 99640 99635 6 1.05 4.01 Sngl - 101371 99998 1374 1 0 68 32 1494 0.970 137.65 4.00 Prom - 110723 110684 40 -7.45 5.04 PlyA - 111082 111077 6 1.05 5.03 Term - 113871 113482 390 0 0 11 42 200 0.595 1.70 5.02 Intr - 115554 115355 200 1 2 88 47 202 0.769 14.25 5.01 Init - 115703 115592 112 2 1 103 15 141 0.802 8.72 5.00 Prom - 117529 117490 40 -5.05 6.05 PlyA - 118221 118216 6 1.05 6.04 Term - 120496 120323 174 1 0 42 48 174 0.658 5.58 6.03 Intr - 122668 122555 114 1 0 72 94 99 0.731 8.62 6.02 Intr - 133673 133175 499 1 1 45 80 223 0.236 9.26 6.01 Init - 159266 158848 419 2 2 71 53 172 0.239 7.95 6.00 Prom - 159320 159281 40 -5.65 7.03 PlyA - 159437 159432 6 1.05 7.02 Term - 161106 160405 702 0 0 -3 32 262 0.105 4.03 7.01 Init - 161919 161335 585 0 0 44 20 265 0.091 10.54 7.00 Prom - 164847 164808 40 -7.75 8.02 PlyA - 165010 165005 6 1.05 8.01 Sngl - 169427 168846 582 2 0 46 47 349 0.453 22.53 8.00 Prom - 170896 170857 40 -7.05 9.00 Prom + 173518 173557 40 -6.05 9.01 Init + 178749 179152 404 2 2 47 78 294 0.976 20.45 9.02 Term + 179486 179597 112 2 1 50 47 136 0.864 2.75 9.03 PlyA + 179892 179897 6 -0.45 10.03 PlyA - 180498 180493 6 1.05 10.02 Term - 182611 182346 266 2 2 91 39 204 0.956 10.49 10.01 Init - 185817 185685 133 0 1 82 42 152 0.840 10.35 10.00 Prom - 192052 192013 40 -2.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 62777 62853 77 0 2 136 38 54 0.973 2.22 S.002 Init + 193581 193677 97 2 1 56 117 69 0.924 7.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_1|241_aa MAAAVAAAPGALGSLHAGGARLVAACSRRAGPAIWAQGWVPAAGGPAPKRGYSSEMKTED ELRVRHLEEENRVQGKAKRVVKGGIAVSTLNLRVKPEDQCFNFDMHLLSTYNLLSRSEKR EITEISELLLRATAFIPSLPDQFITQSQRVPGPCPATKRNEVRGHQRPRVPRGTSSAILP QCSCSTALAQLLRSYQEAADHQFQGFLVCTFLPVVRGGSVDITIGTHIVHISSIEYGEWT C >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_1|726_bp atggcggccgcggtggcggcggcacctggggccttgggatccctgcatgctggcggcgcc cgcctggtggccgcttgcagccggcgagcgggcccggcgatctgggcccagggctgggta cctgcggccgggggtcccgccccgaaaaggggctacagctctgagatgaagacggaggac gagctgcgggtgcggcacctggaggaggagaaccgagtgcagggaaaagctaaaagggtc gtgaagggtggaatagcagtttccacccttaacctccgggttaagccagaggaccagtgt tttaattttgacatgcatttgttgagcacctataatctgttaagcagaagtgaaaagcgt gagattacagagatttctgagctgctgcttcgtgctacggcttttataccctctttaccc gaccagtttataactcagtctcagcgagttccaggtccttgtcccgcgaccaagaggaat gaggtgcgtggacaccagagaccccgtgttcctagagggacgtcatccgccattttgcct cagtgctcatgctcaactgcacttgcccagctcctgagatcttatcaggaagctgctgat caccagtttcagggcttcctggtgtgcaccttccttccagtagttcgtggtggcagtgtt gacatcactattgggacccatattgtgcacattagcagtatagaatatggggagtggaca tgttaa >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_2|147_aa MDAAGGHYPKQTNSETENQILHAATFQSSRYLKLTIMAANRLSLHEEVVEPWGSPASDLN FPADLCERLFLGHRKSSQSTVNVSGIVQKPNVSGIAQKPRGPHFLGSWRSRSPAVPAMSP GKQFPSAKAEHSSEAASKSAVTCGLVT >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_2|444_bp atggatgcagctggaggccattatcctaagcaaactaactcagaaacagaaaaccaaata ctgcatgccgccactttccagtcaagtcgatacctaaaattaaccatcatggcggccaac cgactaagtctgcatgaggaggtagtagagccctggggctcaccggcaagtgacttaaat ttccctgccgacctctgcgaaaggctatttctgggacacaggaagagctctcaaagtaca gtgaatgttagtggcattgtgcagaagcccaatgttagtggcatcgcgcagaagcccaga ggaccacacttcctcgggagttggcgctcacggagtccagctgtgccagccatgtcccca gggaaacaattcccctctgcaaaggcagaacattcctctgaagcagcttccaaatctgct gtcacttgcggcctggtcacctag >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_3|93_aa MAMQLCSLPAGRMRMRALHRGAVGYQGAGHPNHPRPRPPTLPCAAPHCDWPGWWLWLLCP RPANAPEDHSGSPQGSPVQSLGRNPVGAFSGCD >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_3|282_bp atggctatgcagctgtgcagccttcccgcggggcgcatgcgcatgcgtgcactgcacagg ggcgctgtgggctaccagggcgcaggccaccccaaccacccccgcccccggccgccgact ctgccctgcgctgctccccactgcgattggcctggctggtggctgtggctgctgtgtcct cgcccagccaatgcccctgaggatcactcaggttccccacagggctctcctgttcagtca ctggggcggaacccagtgggagctttctcgggttgtgactga >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_4|457_aa MQTVKKEQASLDASSNVDKMMVLNSALTEVSEDSTTGEELLLSEGSVGKNKSSACRRKRE FIPDEKKDAMYWEKRRKNNEAAKRSREKRRLNDLVLENKLIALGEENATLKAELLSLKLK FGLISSTAYAQEIQKLSNSTAVYFQDYQTSKSNVSSFVDEHEPSMVSSSCISVIKHSPQS SLSDVSEVSSVEHTQESSVQGSCRSPENKFQIIKQEPMELESYTREPRDDRGSYTASIYQ NYMGNSFSGYSHSPPLLQVNRSSSNSPRTSETDDGVVGKSSDGEDEQQVPKGPIHSPVEL KHVHATVVKVPEVNSSALPHKLRIKAKAMQIKVEAFDNEFEATQKLSSPIDMTSKRHFEL EKHSAPSMVHSSLTPFSVQVTNIQDWSLKSEHWHQKELSGKTQNSFKTGVVEMKDSGYKV SDPENLYLKQGIANLSAEVVSLKRLIATQPISASDSG >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_4|1374_bp atgcagaccgtcaaaaaggagcaggcgtctcttgatgccagtagcaatgtggacaagatg atggtccttaattctgctttaacggaagtgtcagaagactccacaacaggtgaggagctg cttctcagtgaaggaagtgtggggaagaacaaatcttctgcatgtcggaggaaacgggaa ttcattcctgatgaaaagaaagatgctatgtattgggaaaaaaggcggaaaaataatgaa gctgccaaaagatctcgtgagaagcgtcgactgaatgacctggttttagagaacaaacta attgcactgggagaagaaaacgccactttaaaagctgagctgctttcactaaaattaaag tttggtttaattagctccacagcatatgctcaagagattcagaaactcagtaattctaca gctgtgtactttcaagattaccagacttccaaatccaatgtgagttcatttgtggacgag cacgaaccctcgatggtgtcaagtagttgtatttctgtcattaaacactctccacaaagc tcgctgtccgatgtttcagaagtgtcctcagtagaacacacgcaggagagctctgtgcag ggaagctgcagaagtcctgaaaacaagttccagattatcaagcaagagccgatggaatta gagagctacacaagggagccaagagatgaccgaggctcttacacagcgtccatctatcaa aactatatggggaattctttctctgggtactcacactctcccccactactgcaagtcaac cgatcctccagcaactccccgagaacgtcggaaactgatgatggtgtggtaggaaagtca tctgatggagaagacgagcaacaggtccccaagggccccatccattctccagttgaactc aagcatgtgcatgcaactgtggttaaagttccagaagtgaattcctctgccttgccacac aagctccggatcaaagccaaagccatgcagatcaaagtagaagcctttgataatgaattt gaggccacgcaaaaactttcctcacctattgacatgacatctaaaagacatttcgaactc gaaaagcatagtgccccaagtatggtacattcttctcttactcctttctcagtgcaagtg actaacattcaagattggtctctcaaatcggagcactggcatcaaaaagaactgagtggc aaaactcagaatagtttcaaaactggagttgttgaaatgaaagacagtggctacaaagtt tctgacccagagaacttgtatttgaagcaggggatagcaaacttatctgcagaggttgtc tcactcaagagacttatagccacacaaccaatctctgcttcagactctgggtaa >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_5|233_aa MAPFAKASTDHYPCIGKRRECATCLDSLARRLKKSRLAREFGREDYRIQLDAEITLSSYQ LKITGRDPARSSQAPAPNDSGRPSGGTRARTFTSSEERRTRVAGGGEEWGVEEWGIVLIC LVRKRVGSVNGDGIRRMPARRLFVSGAVSAQTFSTLFVVSVFYKPSGRLWDLSFHTLHNL PTVKCWQELRADVGICAYVCNDLVIEAQRWILDRYLRLYYVPNKQVLKRYYKT >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_5|702_bp atggctcccttcgccaaggccagcactgaccactatccgtgtatcggtaaacgcagggag tgtgccacgtgccttgacagtctagcgcgacgacttaagaagtcgcggctagcccgggaa tttgggagagaagactaccgtatccagttagacgcagaaataacactaagcagctatcag ctcaaaatcacaggaagagatcccgcgcggtcgtctcaagccccagcaccaaacgacagt gggcgtcccagcgggggcacccgagcgcggaccttcacatcctccgaggaacgccgcacc cgcgtggcgggggggggagaggaatggggggttgaggaatggggaatagtcttaatttgc ttggtgagaaaaagggttggtagcgtgaacggtgacggcattcggagaatgcctgcgcgg cgtttgtttgtgtcaggggctgtgtctgcacagacgttttccaccctgtttgtcgtcagc gtgttttacaagccctcgggaaggttgtgggatttgtcctttcatacattgcataatctg cccacggtgaaatgttggcaagaattgcgggctgatgtgggaatatgcgcgtatgtgtgc aatgatttggtaattgaggcgcagcgttggatattagatcgttatttacgtctatattac gtcccaaataaacaggttttaaaacgatactacaaaacctaa >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_6|401_aa MGKDFMTKTPRAMATKAKIDKWDLIKVKSFYTARETTIRVNRQPTEWEKIFAIYSSDKGL TSRIDKELKQIYKKKTNNPIKKWAKDVNRHFSKEDIYAAKRRMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIRKSGNNSEASLHLAYLPVVHIPHFSWMWDKNLDPLNGRTESAVTQTG LKHAPLLATFRMTRRREELWPFGELRPRGSQSQGCGTLFGSVWFLASPSFWAPPCPSRPD VGTHSRSHMWYIWSSCSLAGSQHLCWHLELPVLPKQPACLAVHNDQILHLLTHTPLTTLR LARPWQQQESIGNPGRPKLAAGGRRGKGVQDKACPVASKRGLGQHWALLPAIPGDCPRAG SFIILPVCGLTKSSTDLHAPPESSITQQQLKTQWHLNLEHF >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_6|1206_bp atgggcaaggacttcatgactaaaacaccaagagcaatggcaacaaaagccaaaatagat aaatgggatctaattaaagtaaagagcttctacacagcaagagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctactcatcggacaaagggcta acatccagaattgacaaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcgaaggatgtgaacagacatttctcaaaagaagacatttatgcagccaaa agacgcatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacgccagttagaatggcgatcattagaaagtcaggaaacaacagt gaagctagtcttcaccttgcttacctgccagttgtccacatacctcatttttcctggatg tgggacaagaacttggacccactgaatggcaggactgaaagtgctgtaacacaaacaggg ctgaaacatgcccctctgcttgccacattccggatgacaagaaggagagaagagctgtgg cccttcggggagctcagacctaggggctcccagagccagggctgtggcaccctctttggg tctgtgtggttcctggcatctccaagcttctgggcaccaccatgtccctctcgtccagat gtgggcacccacagcagaagccacatgtggtacatctggtccagctgcagccttgcaggg agccagcacctgtgctggcacctggagctgcctgtgctgccaaagcagccagcatgcctg gctgtgcacaatgaccagatcctgcacttgctcacccacacacccctcaccactctgcgc ctggctcgcccttggcagcagcaggaaagcattgggaacccaggaagacccaagttagct gctggtggtagaagagggaaaggcgtccaagacaaagcctgtccagtagcaagcaagaga ggcttgggacagcactgggctctactgcctgcaattcctggtgactgcccgagagctggc tcttttatcatcctgccagtatgcggactcacaaaatcctccacagacctacatgcaccg cctgagagcagcatcacacagcagcagctgaaaacacaatggcacctgaatcttgagcac ttctga >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_7|428_aa MVKGSIQREELTILNIYAPNTGAPRFIRQVLRDRQRDLDFHTIIMGDFNTPLSVLDRSTR QKANKDIQDFNSALYQADLTDIYITLHPKSTVYTFFSAPHHTYSKIDHLVGSKALLSKCK RTEITTNFLSDHSAIKLELRIEKLTQNHTTTWKLNNLLLNDYWVNNEMKTDIKMFFETNE NKDTTYQNFWDTFKAINKIDTPLARLIEKNQIDTIKNDKGDITTYPTEIQTTIREYYKHL YANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPEGFTA KFCQRYKKELVLFLLKLFQSTEKEGILPNSFYEASIILIPKPGRDTTKKENFTPISLMNI NVKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIIS IDAEKAFD >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_7|1287_bp atggtaaagggatcaattcaacgagaagagctaactatcctaaatatatatgcacccaac acaggagcacccagattcataaggcaagtccttagagaccgacaaagagacttagacttc cacacaataataatgggagactttaacaccccactgtcagtattagacagatcaacgaga cagaaggctaacaaggatatccaggacttcaactcagctctgtaccaagcagacctaaca gacatctacataaccctccaccccaaatcaacagtatatacattcttctcagcaccacat cacacttactccaaaattgaccacttagttggaagtaaagcactcctcagcaaatgtaaa agaacagaaatcacaacaaactttctctcagaccacagtgcaatcaaactagaactcagg attgagaaactcactcaaaaccacacaactacatggaaactgaacaacctgctccttaat gactactgggtaaataacgaaatgaagacagatataaagatgttctttgaaaccaatgag aacaaagacacaacataccagaatttctgggacacatttaaagcaatcaacaaaattgat acaccactagcaagactaatagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccacctatcccacagaaatacaaactaccatcagagaatactataaacacctc tatgcaaataaactagaaaatctagaagaaatggataaattcctggacacatacaccctc ccaagactaaaccaggaagaagttgaatccctgaatagaccaataacaggctctgaaatt gaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagaaggattcacagcc aaattctgccagaggtacaaaaaggagctggtactattccttctgaaactattccaatca acagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgatacca aagcctggcagagacaccacaaaaaaagagaattttacaccaatatccctgatgaacatc aatgtgaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatacgcaaa tcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatctca atagatgcagaaaaggcctttgactaa >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_8|193_aa MKIRKKQCKKAENSKKQNIFSPPKDHNSSPAREQNWTESDFDELTEVGFRRWVITNSSEL KEHVLTQCKEAKNLEKRLEELLTRITGLEKNINDLINIARELHEAYTSTNSQINQAEERI SEIKDQLNEIKREDKIREKRVKRNEQSLQEIWDYVKRPNLCLIGVPKSDGENGTKLENTL QDIIQENFPNLAR >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_8|582_bp atgaagatcaggaaaaaacagtgcaaaaaggctgaaaattccaaaaaacagaacatcttt tctcctccaaaggatcacaactcctcgccagcaagggaacaaaactggacggaaagtgac tttgacgaattgacagaagtaggcttcagaaggtgggtaataacaaactcctccgagcta aaggagcatgttctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagaggaa ctgctaactagaataaccggcttagagaagaacataaatgacctgataaacatagcacga gaacttcatgaagcatacacaagtaccaatagccaaatcaatcaagcagaagaaaggata tcagagattaaagatcaacttaatgaaataaagcgagaagacaagattagagaaaaaaga gtgaaaaggaatgaacaaagcctccaagaaatatgggactatgtgaaaaggccaaacctg tgtctgattggtgtacctaaaagtgacggggagaatggaaccaagctggaaaacactctt caggatattatccaggagaacttccccaacctagcaagatag >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_9|171_aa MLGSTGRGVDVKQRREGSRCTGCFTDVIAVGSSGSTPLEASRGKLGHTSDSGPACESGSL GPSMEKCRCICEKPLLCKLPVGATAGDGCNSAMSRRGASQALVGSIILGMLLPEEPNKEV IVSRETGQTLRSPGRHQSRFGKRTLGPQQANEEAGHLQSGNAKALAPRQID >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_9|516_bp atgctgggaagcactgggaggggcgtggatgtgaaacagagaagagaaggaagtcgatgc acagggtgctttacagacgttatcgctgtgggcagtagtggctcaaccccactggaagcc tctagaggaaagttaggacacacgtcagactcaggtcccgcgtgtgagtcaggctccctg ggcccttcaatggagaagtgccgctgcatctgtgagaagccactactgtgcaagcttcca gtgggtgccacggcgggcgatgggtgcaactcagccatgtccaggagaggagcatcccag gcacttgttggaagcatcattctagggatgctgttgcctgaggagccaaacaaagaggtg attgtatccagggaaacaggacagactctcaggtcccctgggaggcaccaaagccgattt ggtaaaagaacactgggccctcaacaggcaaatgaagaagctggtcatcttcagagtgga aatgctaaggctctcgcaccaagacaaatagactga >gi568815589r:91309349_91510734|GENSCAN_predicted_peptide_10|132_aa MPAPDTLKLTENDRELRENEEDLTKLKCELLNITNHQGNANQNHTLLYRRPWSHESDEKM ETEPWALDQAAGLVFRPDPHFRAGAVQHRAGFSEDDLEAAAPGLCRTRREAQSRLMDNLA WTQLCLKTGSVE >gi568815589r:91309349_91510734|GENSCAN_predicted_CDS_10|399_bp atgcctgcccctgatactctgaaattgaccgagaatgacagggaactcagagagaatgag gaggaccttacaaaattaaaatgtgagctcctcaacatcactaatcatcagggaaatgca aatcaaaaccacaccctactgtatcggcgtccttggtcccatgaatccgatgagaaaatg gagacagagccatgggcactcgaccaggctgcagggctagtgttcaggcctgacccacac ttcagagctggggctgtccagcaccgtgcagggttcagtgaagatgaccttgaggcagca gctcctgggctgtgcaggaccaggagggaagcccagtcgaggctcatggacaaccttgct tggacacagctttgtctcaaaactggaagtgttgaatga