GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:59:16 Sequence gi568815595r:187135687_187386058 : 250372 bp : 43.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9380 9574 195 1 0 104 48 134 0.614 8.41 1.02 PlyA + 11660 11665 6 1.05 2.00 Prom + 12653 12692 40 -2.96 2.01 Init + 19002 19219 218 2 2 77 39 104 0.048 2.56 2.02 Intr + 28851 28911 61 0 1 23 111 62 0.045 0.74 2.03 Intr + 51567 51608 42 2 0 116 100 -7 0.092 1.74 2.04 Intr + 61093 61247 155 0 2 94 18 66 0.171 -0.93 2.05 Intr + 61904 62101 198 2 0 82 109 230 0.223 22.97 2.06 Term + 63865 64384 520 1 1 136 44 1064 0.999 100.57 2.07 PlyA + 65755 65760 6 1.05 3.22 PlyA - 66073 66068 6 1.05 3.21 Term - 84575 84385 191 0 2 91 38 276 0.838 20.51 3.20 Intr - 85448 85349 100 2 1 50 98 132 0.999 10.08 3.19 Intr - 87508 87441 68 2 2 102 57 116 0.849 8.52 3.18 Intr - 88307 88212 96 0 0 93 72 31 0.475 1.98 3.17 Intr - 89823 89638 186 1 0 97 75 171 0.999 16.36 3.16 Intr - 90834 90721 114 1 0 93 92 59 0.992 7.22 3.15 Intr - 94211 94074 138 0 0 74 63 111 0.993 7.64 3.14 Intr - 96947 96835 113 1 2 69 96 99 0.999 8.82 3.13 Intr - 100881 100002 880 1 1 129 101 1011 0.998 96.89 3.12 Intr - 107929 107798 132 2 0 67 115 133 0.580 14.52 3.11 Intr - 114643 114565 79 1 1 132 69 72 0.978 8.82 3.10 Intr - 116066 115948 119 0 2 102 56 97 0.989 8.08 3.09 Intr - 117629 117482 148 2 1 111 91 93 0.999 11.71 3.08 Intr - 121174 120978 197 2 2 68 105 235 0.994 22.33 3.07 Intr - 125186 125055 132 0 0 43 65 352 0.996 29.02 3.06 Intr - 127034 126857 178 2 1 61 106 212 0.882 19.79 3.05 Intr - 141081 140907 175 2 1 55 92 86 0.747 5.64 3.04 Intr - 141359 141319 41 2 2 88 107 3 0.771 -0.78 3.03 Intr - 143147 143073 75 2 0 17 91 71 0.398 0.01 3.02 Intr - 150327 150139 189 0 0 84 100 185 0.927 19.08 3.01 Init - 151015 151001 15 1 0 90 84 -4 0.814 -0.15 3.00 Prom - 151538 151499 40 -11.92 4.00 Prom + 151957 151996 40 -2.86 4.01 Init + 153042 153154 113 2 2 61 54 77 0.381 1.28 4.02 Intr + 163855 163991 137 1 2 44 61 152 0.751 8.31 4.03 Intr + 177874 177921 48 2 0 99 53 47 0.477 0.95 4.04 Intr + 182058 182173 116 1 2 118 90 11 0.658 4.47 4.05 Intr + 185034 185076 43 2 1 123 53 17 0.664 -0.49 4.06 Term + 185224 185375 152 2 2 134 38 50 0.827 2.67 4.07 PlyA + 186006 186011 6 1.05 5.00 Prom + 205100 205139 40 -3.76 5.01 Init + 231999 232127 129 2 0 88 48 150 0.687 9.51 5.02 Term + 232281 232469 189 2 0 74 54 82 0.350 0.85 5.03 PlyA + 232479 232484 6 1.05 6.00 Prom + 234045 234084 40 -5.46 6.01 Init + 234433 234576 144 0 0 64 99 53 0.524 4.12 6.02 Term + 235124 235687 564 1 0 66 45 172 0.270 5.09 6.03 PlyA + 235802 235807 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_1|64_aa VMLMQNIGSRGLWQLHLCDTAGYSPHGCFHGLVLSVCSYSGHKVQAVDGSIILGSEVALF SQLH >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_1|195_bp gtcatgctgatgcaaaatataggctcccgaggcctttggcagctccacctctgtgacact gcaggatacagcccccatggctgctttcacgggctggtgttaagtgtctgcagctattcc gggcacaaggtgcaagctgtggatggctctatcattctggggtctgaggtggccctcttc tcacagctgcactag >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_2|397_aa MTILPKAIYKFNALIIKIPSFFTELEKTVLKSIWNQKRARIAKARLSKKRKSGGITLPDF ILYYKAIVTKIACLGGKRAKSEGSKVRFDMDNLAVSGTNHPATPNHKVAVLAQGHCSMDP IAQLFSTQSATSTDIELGTCLQTLPTCSDMFEMNELMNRWKLPSLTTDETMCKSVTTDEW KKVFYEKMEEAKPADSWDLIIDPNLKHNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYV VILFHMFLDRAQRAGSVRMRVFKQLCYECGTARLDESSMLEENIEGLVDNLITSLREQCY GERGGQYRIHVASRQDNRRHRGEFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKS QDQTGSGWNFCSIPWCLFWATVLLLIIYLQFSFRSSV >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_2|1194_bp atgaccatactgccaaaagcaatctacaaattcaatgcactcatcatcaaaataccatca ttcttcacagaattagaaaaaacagttctaaaatccatatggaaccaaaaaagagcccgc atagccaaagcaagactaagcaaaaagaggaaatctggaggcatcacattacctgatttc atactatactataaggccatagtcaccaaaatagcatgtttgggtggaaaaagagctaag agtgagggctccaaagtacgttttgacatggacaacttggctgtgtctggcactaaccat cctgccacccctaaccacaaggtggctgttcttgcacaaggccattgcagcatggatcct attgcacagttattcagtacacagtcagctacaagcactgacatagagcttggcacatgt ctgcaaaccctacccacatgctcggatatgtttgaaatgaatgaattaatgaaccggtgg aaattgccttccctcactactgacgagaccatgtgtaaaagcgtgaccacagatgagtgg aagaaagtcttctatgagaagatggaggaggcaaagccggctgacagctgggacctcatc atagaccccaacctcaagcacaatgtgctgagccctggttggaagcagtacctggaattg catgcttcaggcaggttccactgctcctggtgctggcacacctggcagtcgccctacgtg gtcatcctcttccacatgttcctggaccgcgcccagcgggcgggctcggtgcgcatgcgc gtcttcaagcagctgtgctatgagtgcggcacggcgcggctggacgagtccagcatgctg gaggagaacatcgagggcctggtggacaacctcatcaccagcctgcgcgagcagtgctac ggcgagcgtggcggccagtaccgcatccacgtggccagccgccaggacaaccggcggcac cgcggagagttctgcgaggcctgccaggagggcatcgtgcactggaagcccagcgagaag ctgctggaggaggaggcgaccacctacaccttctcccgggcgcccagccccaccaagtcg caggaccagacgggctcaggctggaacttctgctctatcccctggtgcttgttttgggcc acggtcctgctgctgatcatctacctgcagttctctttccgtagctccgtataa >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_3|1121_aa MVTCSASAHTVELNNMFGQIQSPGYPDSYPSDSEVTWNITVPDGFRIKLYFMHFNLESSY LCEYDYVKIGYDYVERSILQESPQASWTAMEEQATLGQSASCCVIERDTRTQTPILAYSE ELENENELASGKCNESSFYAKSANSTGFRVMENKRKLAVLRKSMWVETEDQVLATFCGRE TTDTEQTPGQEVVLSPGSFMSITFRSDFSNEERFTGFDAHYMAVDVDECKEREDEELSCD HYCHNYIGGYYCSCRFGYILHTDNRTCRVECSDNLFTQRTGVITSPDFPNPYPKSSECLY TIELEEGFMVNLQFEDIFDIEDHPEVPCPYDYIKIKVGPKVLGPFCGEKAPEPISTQSHS VLILFHSDNSGENRGWRLSYRAAGNECPELQPPVHGKIEPSQAKYFFKDQVLVSCDTGYK VLKDNVEMDTFQIECLKDGTWSNKIPTCKNCRAPGELEHGLITFSTRNNLTTYKSEIKYS CQEPYYKMLNNNTECGQPSRSLPSLVKRIIGGRNAEPGLFPWQALIVVEDTSRVPNDKWF GSGALLSASWILTAAHVLRSQRRDTTVIPVSKEHVTVYLGLHDVRDKSGAVNSSAARVVL HPDFNIQNYNHDIALVQLQEPVPLGPHVMPVCLPRLEPEGPAPHMLGLVAGWGISNPNVT VDEIISSGTRTLSDVLQYVKLPVVPHAECKTSYESRSGNYSVTENMFCAGYYEGGKDTCL GDSGGAFVIFDDLSQRWVVQGLVSWGGPEECGSKQVYGVYTKVSNYVDWVWEQMGLPQSV VEPQVERDLPAAQRLIEHGVEEKGGNKDVIDSDISRHLEKQRGEVCGLPKFSRKLMARIF NGRPAQKGTTPWIAMLSHLNGQPFCGGSLLGSSWIVTAAHCLHQSLDPEDPTLRDSDLLS PSDFKIILGKHWRLRSDENEQHLGVKHTTLHPQYDPNTFENDVALVELLESPVLNAFVMP ICLPEGPQQEGALCSLTYLRTMQALQGGTVASFTNEKTKDQQGAMVIVSGWGKQFLQRFP ETLMEIEIPIVDHSTCQKAYAPLKKKVTRDMICAGEKEGGKDACAGDSGGPMVTLNRERG QWYLVGTVSWGDDCGKKDRYGVYSYIHHNKDWIQRVTGVRN >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_3|3366_bp atggtgacgtgctctgcttcagcccacaccgtggagctaaacaatatgtttggccagatc cagtcgcctggttatccagactcctatcccagtgattcagaggtgacttggaatatcact gtcccagatgggtttcggatcaagctttacttcatgcacttcaacttggaatcctcctac ctttgtgaatatgactatgtgaagattggatatgactatgtggagcgtagtatacttcaa gaaagcccacaggcctcctggacagctatggaagagcaggccacattgggtcagtctgca tcctgctgtgtaattgagagagacaccaggacccaaacaccaattctagcttattcagaa gaattagaaaatgaaaatgagcttgcttcaggaaagtgtaatgaaagctcattttatgcc aaaagtgcaaacagcacaggcttcagggtgatggaaaacaaaaggaagctcgcagtgctg aggaagagcatgtgggtagaaactgaggaccaggtgctggcaaccttctgtggcagggag accacagacacagagcagactcccggccaggaggtggtcctctcccctggctccttcatg tccatcactttccggtcagatttctccaatgaggagcgtttcacaggctttgatgcccac tacatggctgtggatgtggacgagtgcaaggagagggaggacgaggagctgtcctgtgac cactactgccacaactacattggcggctactactgctcctgccgcttcggctacatcctc cacacagacaacaggacctgccgagtggagtgcagtgacaacctcttcactcaaaggact ggggtgatcaccagccctgacttcccaaacccttaccccaagagctctgaatgcctgtat accatcgagctggaggagggtttcatggtcaacctgcagtttgaggacatatttgacatt gaggaccatcctgaggtgccctgcccctatgactacatcaagatcaaagttggtccaaaa gttttggggcctttctgtggagagaaagccccagaacccatcagcacccagagccacagt gtcctgatcctgttccatagtgacaactcgggagagaaccggggctggaggctctcatac agggctgcaggaaatgagtgcccagagctacagcctcctgtccatgggaaaatcgagccc tcccaagccaagtatttcttcaaagaccaagtgctcgtcagctgtgacacaggctacaaa gtgctgaaggataatgtggagatggacacattccagattgagtgtctgaaggatgggacg tggagtaacaagattcccacctgtaaaaactgtagagccccaggagagctggaacacggg ctgatcaccttctctacaaggaacaacctcaccacatacaagtctgagatcaaatactcc tgtcaggagccctattacaagatgctcaacaataacacagagtgtggtcagccctcccgc tccctgccaagcctggtcaagaggatcattgggggccgaaatgctgagcctggcctcttc ccgtggcaggccctgatagtggtggaggacacttcgagagtgccaaatgacaagtggttt gggagtggggccctgctctctgcgtcctggatcctcacagcagctcatgtgctgcgctcc cagcgtagagacaccacggtgataccagtctccaaggagcatgtcaccgtctacctgggc ttgcatgatgtgcgagacaaatcgggggcagtcaacagctcagctgcccgagtggtgctc cacccagacttcaacatccaaaactacaaccacgatatagctctggtgcagctgcaggag cctgtgcccctgggaccccacgttatgcctgtctgcctgccaaggcttgagcctgaaggc ccggccccccacatgctgggcctggtggccggctggggcatctccaatcccaatgtgaca gtggatgagatcatcagcagtggcacacggaccttgtcagatgtcctgcagtatgtcaag ttacccgtggtgcctcacgctgagtgcaaaactagctatgagtcccgctcgggcaattac agcgtcacggagaacatgttctgtgctggctactacgagggcggcaaagacacgtgcctt ggagatagcggtggggcctttgtcatctttgatgacttgagccagcgctgggtggtgcaa ggcctggtgtcctgggggggacctgaagaatgcggcagcaagcaggtctatggagtctac acaaaggtctccaattacgtggactgggtgtgggagcagatgggcttaccacaaagtgtt gtggagccccaggtggaacgggatttaccagcagctcagcgactgattgaacatggggtt gaagaaaagggaggcaacaaggatgtgattgattcagacattagcagacatctggaaaaa caaaggggagaagtgtgtgggctccccaagttctcccggaagctgatggccaggatcttc aatggacgcccagcccagaaaggcaccactccctggattgccatgctgtcacacctgaat gggcagcccttctgcggaggctcccttctaggctccagctggatcgtgaccgccgcacac tgcctccaccagtcactcgatccggaagatccgaccctacgtgattcagacttgctcagc ccttctgacttcaaaatcatcctgggcaagcattggaggctccggtcagatgaaaatgaa cagcatctcggcgtcaaacacaccactctccacccccagtatgatcccaacacattcgag aatgacgtggctctggtggagctgttggagagcccagtgctgaatgccttcgtgatgccc atctgtctgcctgagggaccccagcaggaaggtgctttatgttcattaacttatttacga accatgcaagccttgcaaggtggcactgtagcctcatttacaaacgagaaaactaaggat cagcagggagccatggtcatcgtcagcggctgggggaagcagttcttgcaaaggttccca gagaccctgatggagattgaaatcccgattgttgaccacagcacctgccagaaggcttat gccccgctgaagaagaaagtgaccagggacatgatctgtgctggggagaaggaaggggga aaggacgcctgtgcgggtgactctggaggccccatggtgaccctgaatagagaaagaggc cagtggtacctggtgggcactgtgtcctggggtgatgactgtgggaagaaggaccgctac ggagtatactcttacatccaccacaacaaggactggatccagagggtcaccggagtgagg aactga >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_4|202_aa MTAFSRPHRGDTEPPEAGSVKDWQQTALCSPEARCHVWCVCEDLSGRNHHLNQRAEYGRS AVINVVGIIRSIEGFGGTRKQRQGTIHALNRNLWLKASAVTLQVWLWNHLCQKHPRSFLL TKGVTGVLVQTDCMESLECPEPSRKIDSPKQAGPLAGSGSVSGQGAISGWATDEQQWPSG SNCSEDFRQENGALSYPKGSEI >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_4|609_bp atgactgctttttccagaccacatcgaggagacactgagcctcctgaggccggttctgtc aaggactggcagcagacagccctgtgtagcccagaagcccggtgccatgtgtggtgtgtc tgtgaggacctttctggaagaaatcatcatttgaaccagcgggctgagtatggaagatct gctgtcatcaatgtggtgggcatcattcgatccattgagggctttggtggaacaagaaag cagaggcaggggaccatccatgccctcaacagaaacctctggctgaaagcttctgcagtg actcttcaagtgtggctctggaatcatttgtgtcagaaacacccaaggagctttctgctt actaaaggtgtgactggagttctagtccagaccgactgtatggagtctctggagtgccct gagccttccaggaaaattgattcacccaaacaagcaggtccactggctgggtcaggctct gtgtctgggcaaggagccatcagtggatgggctacagatgagcagcagtggcccagtggt tctaactgctcagaggacttcaggcaagaaaatggggctctttcttatcccaagggctca gaaatataa >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_5|105_aa MAGCRSRALPHREAAKAQREIELSAGGPALLGDLAHPPQLLAQQAEGAGSGFGLPRKGLP QCSGWLKGSSSAARVGTEAEEAPRASEGCEGYQHAVTSHFAYYKK >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_5|318_bp atggcgggctgcaggtcccgagccctgccccacagagaggcagctaaggcccagcgagaa atcgagctcagtgccggtgggccggcactgctgggggacctggcgcaccctccgcagctg ctggcccagcaggctgagggagccggttctggctttggcctacccaggaaggggctccca cagtgcagtggctggctgaagggctcctccagcgcggccagagtgggcaccgaggccgag gaggctccgagagcgagcgagggctgcgagggctaccagcatgctgtcacctctcacttt gcctattataaaaaatga >gi568815595r:187135687_187386058|GENSCAN_predicted_peptide_6|235_aa MRVIEGKGFAQGLPDVNWIFNPGMRDAVCFSRCERLATLLTRGNVLQQRSWASAQVQILC HTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSNLVQHILKKYYGNGT RKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSCMTKPSKSLLPHLKTGNSSPGIG AVYLANQAKNQSAEAKEAKGSGYEKLGPSRDPDPLNICVFILLLVFIVVKCFTSE >gi568815595r:187135687_187386058|GENSCAN_predicted_CDS_6|708_bp atgagggttatagaagggaagggctttgcccaaggtcttcctgatgtaaactggatcttc aacccaggtatgagggatgcagtctgtttctccaggtgtgagagattagctacactactc accagaggcaacgtgttacagcagcgaagttgggcttccgcccaagtgcagattctgtgc cacacgtactgggagcactggacatcccagggtcaggtgcgtatgaggctctttggccaa aggtgccagaagtgctcctggtcccaatatgagatgcctgagttctcctcggatagcacc atgaggattctgagcaacctggtgcagcatatactgaagaaatactatggaaatggcacg aggaagtctccagaaatgccagtaatcctggaagtgtccctggaaggatcccatgacaca gccaattgtgaggcatgcactttgggcatctgtggacagggcttaaaaagctgcatgaca aagccgtccaaatccctactcccccacctaaagactgggaattcctcacctggaattggt gctgtgtacctcgcaaaccaagccaagaaccagtcagctgaggcaaaagaggctaagggg agtgggtatgagaaattagggcccagtcgagacccagatccactgaacatctgtgtcttt attttgctgcttgtatttattgtagtcaaatgctttacatcagaatga