GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:31:49 Sequence gi568815591r:127511067_127715520 : 204454 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 539 578 40 -2.26 1.01 Init + 25928 26036 109 1 1 74 97 109 0.803 8.82 1.02 Intr + 26684 26771 88 0 1 32 70 57 0.125 -2.67 1.03 Term + 34978 35080 103 1 1 95 55 107 0.687 5.85 1.04 PlyA + 36976 36981 6 1.05 2.00 Prom + 41555 41594 40 -2.46 2.01 Init + 42606 42667 62 2 2 95 78 49 0.871 5.32 2.02 Intr + 43312 43399 88 1 1 79 37 60 0.286 -0.03 2.03 Term + 53243 53308 66 1 0 131 47 27 0.208 0.84 2.04 PlyA + 53938 53943 6 1.05 3.00 Prom + 54685 54724 40 -5.16 3.01 Init + 56433 56557 125 2 2 71 91 100 0.705 8.24 3.02 Term + 63671 63770 100 2 1 91 46 52 0.222 -1.10 3.03 PlyA + 65565 65570 6 1.05 4.03 PlyA - 65936 65931 6 1.05 4.02 Term - 72243 70948 1296 0 0 108 44 1695 0.991 158.70 4.01 Init - 74116 73085 1032 1 0 78 86 1222 0.856 114.90 4.00 Prom - 76305 76266 40 -6.16 5.00 Prom + 76433 76472 40 -12.40 5.01 Init + 77433 77499 67 2 1 104 68 115 0.999 12.35 5.02 Intr + 77969 78097 129 0 0 98 71 104 0.501 10.37 5.03 Intr + 78419 78528 110 0 2 68 49 106 0.989 4.70 5.04 Intr + 79000 79071 72 0 0 101 101 149 0.999 17.10 5.05 Intr + 79897 80022 126 0 0 102 66 256 0.956 25.68 5.06 Term + 80147 80233 87 1 0 153 41 109 0.998 10.36 5.07 PlyA + 80613 80618 6 1.05 6.00 Prom + 83221 83260 40 -4.16 6.01 Init + 84166 84192 27 0 0 84 53 29 0.148 -2.50 6.02 Intr + 84241 84853 613 0 1 100 60 411 0.997 31.46 6.03 Intr + 85262 85380 119 0 2 88 94 126 0.996 13.38 6.04 Intr + 87369 87528 160 2 1 94 109 182 0.997 20.46 6.05 Intr + 88315 88485 171 2 0 100 96 129 0.985 14.81 6.06 Intr + 89128 89274 147 2 0 97 -28 110 0.386 0.51 6.07 Term + 90557 90750 194 0 2 141 41 128 0.957 10.98 6.08 PlyA + 91753 91758 6 1.05 7.07 PlyA - 91954 91949 6 1.05 7.06 Term - 92409 92303 107 1 2 36 49 112 0.050 0.67 7.05 Intr - 102025 101956 70 1 1 80 60 80 0.750 3.15 7.04 Intr - 102466 102384 83 2 2 61 96 97 0.811 7.16 7.03 Intr - 102815 102690 126 0 0 95 48 39 0.628 1.25 7.02 Intr - 104029 103897 133 1 1 86 96 149 0.684 15.72 7.01 Init - 104454 104335 120 0 0 80 90 123 0.848 11.97 7.00 Prom - 105469 105430 40 -0.26 8.00 Prom + 105953 105992 40 -7.26 8.01 Init + 106025 106072 48 1 0 60 65 40 0.262 -0.15 8.02 Intr + 109100 109301 202 1 1 101 77 74 0.719 6.46 8.03 Intr + 128938 129101 164 2 2 26 115 53 0.239 1.49 8.04 Term + 133549 133638 90 0 0 120 41 51 0.666 1.32 8.05 PlyA + 134178 134183 6 1.05 9.00 Prom + 138270 138309 40 -2.06 9.01 Init + 139491 139583 93 2 0 49 47 80 0.388 0.38 9.02 Intr + 140546 140726 181 1 1 28 80 71 0.446 -0.26 9.03 Intr + 141198 141385 188 1 2 83 81 138 0.839 12.01 9.04 Intr + 175547 175696 150 1 0 110 92 128 0.961 15.76 9.05 Intr + 183762 183882 121 2 1 114 105 94 0.998 13.77 9.06 Intr + 187809 187887 79 1 1 129 97 10 0.996 4.61 9.07 Intr + 190097 190257 161 2 2 100 77 164 0.804 16.13 9.08 Intr + 191369 191460 92 0 2 75 77 132 0.996 10.51 9.09 Intr + 192099 192257 159 2 0 67 81 167 0.999 14.08 9.10 Intr + 193773 193879 107 2 2 92 94 77 0.986 7.71 9.11 Term + 194451 194613 163 0 1 10 42 247 0.625 9.71 9.12 PlyA + 195968 195973 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_1|99_aa MVQRRAAAGMLHRSCTLQRLLLCAENILLRRERLHKAIQYKEKEFWFFESCINFLKFCKP KPVKERVQLLQLFSQDGIECLQLFQVHGASCPWIYHSGI >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_1|300_bp atggtgcagcggcgcgctgcagccggcatgctgcaccgaagctgtacccttcaacggcta cttttgtgtgctgagaacattttactgaggagggaacggctgcacaaagcaatacagtac aaagaaaaagagttctggttttttgagagctgtataaactttctgaaattctgcaaacct aaacctgtcaaggaaagggttcagctcctgcagctgttctcacaggatggcatcgagtgc ctgcagcttttccaggtgcatggtgcaagctgtccatggatctaccattctgggatctga >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_2|71_aa MGIQWVKNLKERCFCVEVQERTAVIGLRSILIQYDLILTNYICKTLIPKKIFFQGSLLAK HNGKPQDKIGC >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_2|216_bp atggggatccaatgggtcaagaatctgaaggagaggtgcttctgtgtggaggtgcaggag aggacagcagtcattggattaaggtccattctgatccagtatgacctcatcttaactaat tatatctgcaagaccctaattccaaaaaagatcttcttccagggctctttattggccaaa cacaatgggaagccacaggacaagataggctgttga >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_3|74_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKRFCTAKETTIRVNRLTVPKPPVNLASYCASHP AMTHHSGQEFLLYS >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_3|225_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagattctgcacagcaaaagaaactaccatcagagtg aacaggctgacagtgcccaaacccccagtgaacctggcatcctactgtgctagccatcca gccatgacccatcactctggacaagagtttctactctactcctaa >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_4|775_aa MEKFGMNFGGGPSKKDLLETIETQKKQLLQYQARLKDVVRAYKSLLKEKEALEASIKVLS VSHEADVGLAGVQLPGLTFPDSVDDRCSTHSEDSTGTATSLDTAASLTSTKGEFGVEDDR PARGPPPPKSEEASWSESGVSSSSGDGPFAGGEVDKRLHQLKTQLATLTSSLATVTQEKS RMEASYLADKKKMKQDLEDASNKAEEERARLEGELKGLQEQIAETKARLITQQHDRAQEQ SDHALMLRELQKLLQEERTQRQDLELRLEETREALAGRAYAAEQMEGFELQTKQLTREVE ELKSELQAIRDEKNQPDPRLQELQEEAARLKSHFQAQLQQEMRKTALAEDQLRQQSQVEE QRVAALENQISEVSELLGTYEKAKQKDQLAIQKLKERILQLDLENKTLALAASSRSPLDS HGEESSLDVNVLKDKMEKLKRLLQVAARKSQVTLDVEKLCDLEIMPSSEAADGEKATALY YQQELKQLKEEFERYKMRAQVVLKSKNTKDGNLGKELEAAQEQLAELKEKYISLRLSCEE LEHQHQQEADDWKQELARLQQLHRQELERCQLDFRDRTLKLEEELHKQRDRALAVLTEKD LELEQLRSVALASGLPGRRSPVGGGGPGDPADTSSSDSLTQALQLAAANEPTFFLYAEQL ARKEVEITSLRKQKHRLEVEVHQLQDRLLEEGERHREEVAALQSHIEKNIRDQSREGANL EYLKNIIYRFLTLPDSLGRQQTLTAILTILHFSPEEKQVIMRLPTSASWWPSGKR >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_4|2328_bp atggagaagtttgggatgaatttcgggggcggcccgagcaagaaggacttgctggagact atagagacccagaagaagcagcttctccagtaccaggcacggctcaaggatgtggtccgt gcctataaaagcctgctgaaggagaaagaggcattagaggccagcatcaaggtgctgtcg gtatcccacgaggcagatgtgggcctcgcaggtgtccagcttccaggcctcacctttcct gactctgtggatgaccggtgctccactcacagcgaggatagcactgggaccgccactagc ttggatactgcggccagtctcaccagcaccaagggtgagtttggggtagaagatgacaga ccggcccgtggaccaccacctccaaagtccgaagaggccagttggtccgagagtggcgtt agcagtagcagtggggatgggccatttgcaggtggggaggtggacaaaagactgcaccag ctgaagactcagttggctactttgaccagttctttggctacagtcactcaggagaagtcc cgcatggaggcttcttacttggctgacaagaaaaagatgaaacaggacttagaggatgcc agtaacaaggcggaggaggagagggcccgcctggagggagaattgaaggggctgcaggag caaatagcagaaaccaaagcccggcttatcacgcagcagcatgatcgggcccaagagcag agtgaccatgccttgatgctgcgtgagctccagaagctgctgcaggaggagaggacccag cgccaggacttggagcttaggttagaagagacccgagaagccttggcaggacgagcatat gcagctgaacagatggaaggatttgaactgcagaccaagcagctgacccgtgaggtggag gagctgaaaagtgaactgcaggccattcgagatgagaagaatcagccagatccccggctg caagaacttcaggaagaggctgcccgccttaagagccatttccaggctcagttacagcag gaaatgagaaagacagctcttgcagaggatcaactccgtcagcaatctcaggtagaagaa cagagggtggcagccctggagaatcaaatatccgaggtgtctgagctgctaggcacctac gagaaagccaagcagaaggaccagctggccattcagaagctgaaggagcgcattctgcag ctggacctggagaacaagacactggctctagcagcctccagcaggtcccctttagacagc catggagaggagtccagtctggatgtcaatgtcctgaaagataagatggagaagctgaag aggctgctgcaggttgcggccaggaaaagccaggtgaccctggatgtggagaagctctgt gacctggagataatgcccagctcggaggctgctgatggggagaaggctactgcactctat taccaacaggagctgaaacagctgaaggaagagtttgagaggtacaagatgagagcccag gttgtcctcaaaagcaagaataccaaagatggtaacctgggaaaggagctggaggcagcc caggaacagcttgcagagctgaaggagaagtatatttccctgcggctctcctgcgaggag ctggagcaccaacaccagcaggaggctgatgactggaagcaggagctggcccggctgcag cagctccaccggcaggagctggagcggtgccagctggacttcagggaccgcacactgaaa ctggaggaggagctgcacaagcagcgggatcgtgccctagctgtgctcaccgagaaggac ttggaactggagcaactgcgttctgtggccttggcctctgggctgccaggacgcagaagt cctgtgggtggtggcggtcctggggacccagctgacacatcatcctctgatagcctgacc caagcattacaacttgcagcggccaatgagcccactttctttctgtacgctgagcaactg gcccgcaaggaggtggagatcacatcactgaggaagcagaagcacaggctggaggtcgag gtgcatcagctgcaggatcggctgctggaggagggcgaacggcatcgtgaggaggttgca gccctgcagagccacatcgaaaagaacatcagggaccagagcagggagggagccaatctg gagtacctcaaaaacatcatctaccgcttcctgaccttacctgactccctgggccgccag cagactctcacagccatactgactatcttgcacttcagtccagaggagaaacaagtgata atgcgactcccaaccagtgccagctggtggccttctggcaagagatga >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_5|196_aa MGLTVSALFSRIFGKKQMRILMGGSLAPISIPVPLSVAVGLDAAGKTTILYKLKLGEIVT TIPTIGFNVETVEYKNICFTVWDVGGQDKIRPLWRHYFQNTQGLIFVVDSNDRERVQESA DELQKMLQEDELRDAVLLVFANKQDMPNAMPVSELTDKLGLQHLRSRTWYVQATCATQGT GLYDGLDWLSHELSKR >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_5|591_bp atgggcctcaccgtgtccgcgctcttttcgcggatcttcgggaagaagcagatgcggatt ctcatggggggctccctcgctcccatctccatccctgtgcccctttccgttgcagttggc ttggatgcggctggcaagaccacaatcctgtacaaactgaagttgggggagattgtcacc accatcccaaccataggcttcaatgtagaaacagtggaatataagaacatctgtttcaca gtctgggacgtgggaggccaggacaagattcggcctctgtggcggcactacttccagaac actcagggcctcatctttgtggtggacagtaatgaccgggagcgggtccaagaatctgct gatgaactccagaagatgctgcaggaggacgagctgcgggatgcagtgctgctggtattt gccaacaagcaggacatgcccaacgccatgcccgtgagcgagctgactgacaagctgggg ctacagcacttacgcagccgcacgtggtatgtccaggccacctgtgccacccaaggcaca ggtctgtacgatggtctggactggctgtcccacgagctgtcaaagcgctaa >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_6|476_aa MTVAGLLGLTWEILVSNEHETQAVVRLKSVQGLYLLCECDGTVCYGRPRTSHHGCFLLRF HRNSKWTLQCLISGRYLESNGKDVFCTSHVLSAYHMWTPRPALHVHVILYSPIHRCYARA DPTMGRIWVDAAVPCLEECGFLLHFRDGCYHLETSTHHFLSHVDRLFSQPSSQTAFHMQV RPGGLVALCDGEGGMLYPQGTHLLLGMGCNPMRDGEVRAASERLNRMSLFQFECDSESPT VQLRSANGYYLSQRRHRAVMADGHPLESDTFFRMHWNCGRIILQSCRGRFLGIAPNSLLM ANVILPGPNEEFGILFANRSFLVLRGRYGYVGSSSGHDLIQCNQDQPDRIHLLPCRPGIY HFQAQGGSFWSITSFGTFRPWGKFALNFCIELQGSNLLTVLAPNGFYMRADQSQWDVTYQ NPNPPGKTTTLNGPGTSESRSKRRTSVTTFPTQFSKTPVLCNNTSQQATPKILVCI >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_6|1431_bp atgacagttgcagggctccttggtctgacctgggagatcttggtgagcaatgagcatgag acacaggccgtggtgcgactaaagagcgtgcagggcctctacctgctgtgtgagtgtgat ggcaccgtgtgttatggccgcccaaggaccagccaccatgggtgctttctactgcgtttc caccggaacagcaagtggaccctccagtgcctaatctctggtcgttatttggagtccaat ggcaaggacgtgttttgcacttcccacgtcctctcagcttaccacatgtggaccccccga ccagccctccatgtccacgtgatcctctacagccccatccaccgctgctatgcccgggct gaccccactatgggccgcatctgggtggacgcagcagttccctgcctggaggagtgtggc ttcctgttgcatttccgagatggatgctaccacctggagacctctacacaccacttcttg tcccatgtagaccggctgttctcccaaccctcatcacagacagcttttcacatgcaagtg cggcctggagggcttgtggcactgtgtgatggagaaggaggcatgttatatccacagggc acgcatctgctcttgggcatgggctgcaaccccatgagggatggtgaggtgcgtgctgct tctgagcgcttaaaccgaatgtccttgttccagtttgaatgtgacagtgagagccccact gtgcagcttcgttcagccaatggctactacctatcccagaggcgccacagggcagtaatg gctgatgggcaccccctggagtctgacacgttcttccgaatgcactggaactgtggcagg atcatcctgcagtcctgcagggggcgcttcctgggcattgcacccaacagcctgctgatg gccaatgtcatccttccaggcccaaatgaggaatttgggattttatttgccaatcgctcc ttccttgtattgcgaggtcgttatggctatgtgggctcctcatcgggccatgacctcata cagtgcaaccaggatcagcccgaccgcattcatctactaccctgccgaccgggtatctac cacttccaggcacaggggggatccttctggtcaataacatcctttggcacctttcgccct tggggcaagtttgccctcaacttctgtatcgagcttcaggggagcaacttactcactgta ctggcccccaatggcttctacatgcgagccgaccaaagtcaatgggatgtcacctaccaa aatccaaatcctccaggaaaaactactacactaaatggaccaggaacctcagagtcaaga tccaagagaagaacatctgttacaacttttcctacccagtttagcaaaacacctgtttta tgcaacaatacatcacaacaggccacccccaagatccttgtgtgcatctga >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_7|212_aa MNQLGGLFVNGRPLPLDTRQQIVRLAVSGMRPCDISRILKVSNGCVSKILGRYYRTGVLE PKGIGGSKPRLATPPVVARIAQLKAVLAPAVLTPHSGSETPRGTHPGTGHRNRTIFSPSQ AEALEKEFQRGQYPDSVARGKLATATSLPEDTVRVWFSNRRAKWRRQEKLKWEMQLPEIA TATPTFSNYHLDQLAAINIEARSSTGKKIMTH >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_7|639_bp atgaaccagcttggggggctctttgtgaatggccggcccctgcctctggatacccggcag cagattgtgcggctagcagtcagtggaatgcggccctgtgacatctcacggatccttaag gtatctaatggctgtgtgagcaagatcctagggcgttactaccgcacaggtgtcttggag ccaaagggcattgggggaagcaagccacggctggctacaccccctgtggtggctcgaatt gcccagctgaaggctgttttggctccagctgtcctcactccccatagtggctctgagact ccccggggtacccacccagggaccggccaccggaatcggactatcttctccccaagccaa gcagaggcactggagaaagagttccagcgtgggcagtatcctgattcagtggcccgtgga aagctggctactgccacctctctgcctgaggacacggtgagggtctggttttccaacaga agagccaaatggcgtcggcaagagaagctcaagtgggaaatgcagctgccagaaatcgcc acagctaccccaaccttcagcaactatcaccttgatcagttagcagccatcaacattgag gcaagatcctccactggcaaaaagattatgactcactga >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_8|167_aa MSTLLVVQDEGSKQKRESLVERPVSVGKSCPQERRLEFPCLPQGTRGRAKPVRKAGEDWA RPAQRPAGVGLTPRLGLLTPAVGALFTIAKIWKQPKCPLIDEWIKKLWYIYTVEYCSAIK KNEILSFATTWMELEDIMCLELKQLFCNYKGKAKEIAEMPALRWLNF >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_8|504_bp atgagcacattgctggttgttcaggatgaaggcagtaaacagaaaagggagtccctggta gaaaggcccgtgtctgtagggaagtcttgtcctcaggaaaggcggctggagttcccctgc cttcctcaagggaccagagggcgagccaagcccgtgcgtaaagctggagaggactgggcg cggccagcgcagagacccgctggcgttggtctgacacctcgtcttgggctcttgacacct gctgtgggagcactattcacaatagccaagatttggaagcaacctaagtgtccattgata gatgaatggataaagaaactgtggtacatatacacagtggagtactgttcagccataaaa aagaatgagatcctgtcatttgcaacaacatggatggagctggaggacattatgtgtctg gagctgaagcagctattttgcaactacaaaggaaaagccaaggaaattgctgagatgcct gcactgagatggttgaacttttaa >gi568815591r:127511067_127715520|GENSCAN_predicted_peptide_9|497_aa MLVDIEVFQDYWLKTVHQVLTISMNDTVEIGVKAHIGRRFHKALIFQVLSILLAKVKCRK SSGAGTEIPPGLCLRGSTLPGRIPGYSGAHLAGSQPAPRLDSLSPTPTPTLTPPVRPAAP LVAFASPHMASSAQSGGSSGGPAVPTVQRGIIKMVLSGCAIIVRGQPRGGPPPERQINLS NIRAGNLARRAAATQPDAKDTPDEPWAFPAREFLRKKLIGKEVCFTIENKTPQGREYGMI YLGKDTNGENIAESLVAEGLATRREGMRANNPEQNRLSECEEQAKAAKKGMWSEGNGSHT IRDLKYTIENPRHFVDSHHQKPVNAIIEHVRDGSVVRALLLPDYYLVTVMLSGIKCPTFR READGSETPEPFAAEAKFFTESRLLQRDVQIILESCHNQNILGTILHPNGNITELLLKEG FARCVDWSIAVYTRGAEKLRAAERKLHVEIPNISRHFNCQRQDVDADVDVDVDVDVDVDV DVDVGELSPFDGIDGRK >gi568815591r:127511067_127715520|GENSCAN_predicted_CDS_9|1494_bp atgctcgtagatattgaagtatttcaggattattggctgaagaccgtgcaccaagtgtta acaatttcaatgaatgacacggtagaaataggggtcaaagcccacatcggtaggaggttc cacaaagctctgattttccaagttctctccatacttttagccaaggtaaaatgtcggaag tcgagtggagcgggtacggagattcctccaggactctgcctgcgtggctccaccctccca ggccgcattcccgggtacagcggcgcccacctagctggtagccagcctgcccctcgcctc gactccctttcaccaacaccgacacccacattgacacctccagtccggccagccgctcca ctcgttgcctttgcatctccacacatggcgtcctccgcgcagagcggcggctcctccggg ggacccgcggtccccaccgtgcagcggggcatcatcaagatggtcctctcagggtgcgcc atcattgtccgaggtcagcctcgtggtgggcctcctcctgagcggcagatcaacctcagc aacattcgtgctggaaatcttgctcgccgggcagccgccacacaacctgatgcaaaggat acccctgatgagccctgggcatttccagctcgagagttccttcgaaagaagctgattggg aaggaagtctgtttcacgatagaaaacaagactccccaggggcgagagtatggcatgatc taccttggaaaagataccaatggggaaaacattgcagaatcactggttgcagagggctta gccacccggagagaaggcatgagagctaataatcctgagcagaaccggctttcagaatgt gaagaacaagcaaaggcagccaagaaagggatgtggagtgaggggaacggttcacatact atccgggatctcaagtataccattgaaaacccaaggcactttgtggactcacaccaccag aagcctgttaatgctatcatcgagcatgtgcgggacggcagtgtggtcagggccctgctc ctcccagattactacctggttacagtcatgctgtcaggcatcaagtgcccaacttttcga cgggaagcagatggcagtgaaactccagagccttttgctgcagaagccaaatttttcact gagtcgcgactgcttcagagagatgttcagatcattctggagagctgccacaaccagaac attctgggtaccatccttcatccaaatggcaacatcacagagctcctcctgaaggaaggt ttcgcacgctgtgtggactggtcgattgcagtttacacccggggcgcagaaaagctgagg gcggcagagagaaaactgcatgtagaaattccaaatattagtcgacatttcaactgtcag cgtcaagatgtagatgcagatgtagatgtagatgtagatgtagatgtagatgtagatgta gatgtagatgtaggtgaactttctccatttgatgggatagatggaagaaaataa