GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:02:12 Sequence gi568815590r:108101888_108348702 : 246815 bp : 37.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10930 11099 170 2 2 88 89 99 0.705 8.74 1.02 Term + 20394 20564 171 2 0 62 45 130 0.402 2.94 1.03 PlyA + 22435 22440 6 1.05 2.00 Prom + 25172 25211 40 -3.75 2.01 Init + 26086 26407 322 0 1 -6 83 171 0.564 4.74 2.02 Intr + 27704 27963 260 0 2 20 81 128 0.525 1.56 2.03 Term + 28740 28991 252 2 0 55 53 131 0.500 0.95 2.04 PlyA + 29061 29066 6 -0.45 3.03 PlyA - 29094 29089 6 1.05 3.02 Term - 30228 29820 409 2 1 111 47 191 0.265 11.00 3.01 Init - 39797 39733 65 2 2 68 100 28 0.381 2.67 3.00 Prom - 43681 43642 40 -6.65 4.00 Prom + 45483 45522 40 -5.85 4.01 Sngl + 48496 48828 333 0 0 58 44 238 0.685 12.17 4.02 PlyA + 49459 49464 6 1.05 5.04 PlyA - 50384 50379 6 -0.45 5.03 Term - 51003 50888 116 1 2 7 40 133 0.521 -1.85 5.02 Intr - 54252 54168 85 0 1 80 111 57 0.940 5.67 5.01 Init - 56072 55722 351 2 0 79 86 115 0.689 7.61 5.00 Prom - 56431 56392 40 -4.95 6.00 Prom + 57597 57636 40 -5.95 6.01 Init + 65762 66113 352 1 1 59 58 169 0.555 8.37 6.02 Term + 67697 68130 434 0 2 43 41 205 0.558 5.77 6.03 PlyA + 68217 68222 6 1.05 7.00 Prom + 68227 68266 40 -4.95 7.01 Init + 68589 68939 351 2 0 58 86 101 0.093 4.11 7.02 Term + 91822 91959 138 0 0 98 47 83 0.045 2.18 7.03 PlyA + 92244 92249 6 1.05 8.00 Prom + 94058 94097 40 -4.45 8.01 Init + 95634 96033 400 2 1 73 -57 308 0.398 11.67 8.02 Term + 97550 97635 86 0 2 57 47 97 0.377 -0.66 8.03 PlyA + 98510 98515 6 1.05 9.17 PlyA - 99216 99211 6 1.05 9.16 Term - 100820 100770 51 2 0 96 41 16 0.507 -5.85 9.15 Intr - 101230 101096 135 1 0 102 94 107 0.902 12.54 9.14 Intr - 101616 101514 103 2 1 60 64 100 0.116 4.06 9.13 Intr - 112829 112720 110 2 2 60 60 45 0.109 -2.94 9.12 Intr - 114626 114525 102 2 0 92 89 29 0.835 2.85 9.11 Intr - 115573 115447 127 0 1 81 98 147 0.800 14.76 9.10 Intr - 123865 123825 41 1 2 52 93 16 0.489 -5.40 9.09 Intr - 127308 127183 126 0 0 71 82 83 0.948 5.96 9.08 Intr - 133215 133111 105 0 0 86 119 59 0.657 8.29 9.07 Intr - 133733 133641 93 2 0 43 105 45 0.359 0.94 9.06 Intr - 134316 134274 43 2 1 95 78 27 0.817 -0.28 9.05 Intr - 138188 138071 118 0 1 27 97 140 0.951 7.40 9.04 Intr - 140026 139912 115 1 1 78 115 95 0.702 10.30 9.03 Intr - 146902 146726 177 1 0 7 78 162 0.015 6.29 9.02 Intr - 166535 166359 177 2 0 90 93 161 0.969 15.99 9.01 Init - 193598 193428 171 2 0 54 95 84 0.233 5.19 9.00 Prom - 202525 202486 40 -5.05 10.04 PlyA - 202631 202626 6 1.05 10.03 Term - 224309 224235 75 2 0 105 52 45 0.105 -0.54 10.02 Intr - 237272 237049 224 0 2 47 39 198 0.295 7.72 10.01 Init - 238175 237962 214 2 1 45 38 182 0.830 7.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 81953 82050 98 1 2 95 56 51 0.857 2.43 S.002 Term + 82618 82732 115 1 1 41 51 127 0.855 1.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_1|113_aa XTKFSNCKPLALERLVFVYNDAPLRSKVKWNLESSIPSSLSAQTVVSALPPNSLMLQSGP SAAGLVEFAGGPLQTLFASPAVAAEQQRLLPVPSSGSFIPEGHLPYPSQSSPV >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_1|342_bp ntaaccaagttctctaactgcaaacccctggctttggaaaggctagtctttgtttacaat gatgcaccattacgcagcaaagtgaaatggaatttggaatcttccatcccttcatctctc agtgcacaaacagttgtatcagctttgcctccaaattcactcatgctgcagtcaggcccc tctgctgcaggtctggtggaatttgctggaggtccactccagacgctgtttgcctcaccc gcagtggctgcagaacagcaaagattgctgcctgttccttcctctggaagcttcattcca gaggggcacctgccatatcccagccagagctctcctgtatga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_2|277_aa MKKKRESNRKERKRKEKEGEEKRKEKKKGKKRKEKKSMESKLISGQSPWEFIRPEWVAVW DTKKPRPRNPPCPNFPHFQSGTDWTQLQKWKGPVQLEQDCHGSEWMGEKEREKRERERER ERKKERKKERKKERKKERKKAKEERKQAKEGRKEERGRKEVNRKKERKKERKKERKKERK KERKKERKKKKDLKLFTESSLGSKSSIIVMDRAMNKSYEQLTPPLSSNLAVSQNPPGNLP SMLCCLEKSTRILNPLYVPLILNILKLKPPFPTQALQ >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_2|834_bp atgaaaaagaaaagggaaagtaacagaaaggaaaggaaaagaaaagaaaaggaaggagaa gagaagagaaaagaaaagaaaaaaggaaagaaaagaaaagaaaagaagtccatggaaagt aaactcatctcaggtcagtctccctgggagttcataagaccagagtgggtagctgtatgg gatactaagaagcctaggcccagaaaccccccgtgcccaaacttcccacacttccagagt ggtacagactggacacagcttcagaaatggaaagggccagtccagctagaacaagattgt catggcagtgagtggatgggagaaaaagagagagagaagagagagagagaaagagagaga gagagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaa gcaaaggaagaaagaaagcaagcaaaggaaggaaggaaagaagaaagaggaaggaaggaa gtaaacagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaag aaagaaagaaagaaagaaagaaagaaaaagaaagacctcaagttatttactgagtcctca ctgggtagcaagtcttcgataatcgtcatggatagagctatgaacaagagctatgaacag ctcactccgcctttgagctcaaatctggcagtgtctcagaatcccccaggaaatcttcca agcatgctttgctgtctagagaagtccactaggattctgaatccactttatgttcctcta attctcaacatattaaagctgaagccaccttttccaacccaagcattacaatga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_3|157_aa MSRSGNGKWHDVTSSQHRRHQRMAQKENSYPWPYGKQTAPAGLSTLLPRVLPRIPTEAAR ELPSCADPQPAAAPGHEVVENSCGKRSILTRPFLVDDLETGRPLGKDKFVHVYLARKKTS HFIVALKAFKSQIEEGVGSTRCAGRWKSRPPFSIPTY >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_3|474_bp atgagcaggtcaggcaatggcaaatggcatgatgtgacctcgagtcaacatcggaggcat caaaggatggcccagaaggagaacagttatccctggccctatggcaagcagacggctcca gccggcctgagtaccctgctcccgcgagtcctcccgaggatccccaccgaagctgcgcgt gagctcccgagctgcgcagacccacagcccgcagcggcccctggccatgaggtggtagag aacagttgtgggaagcgcagcatcttaacgcggcccttcctggtcgacgaccttgagact gggcgtcccctgggcaaagacaagtttgtacatgtgtacttggctcgaaagaagacaagc catttcatcgtggccctcaaggccttcaagtctcagatagaggagggcgtggggagcacc agatgcgcaggcagatggaaatccaggccccctttcagcatcccaacatattga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_4|110_aa MELKNRTQELHNATTSISNQIDQAEERISELEDYLAEIRQADKIRGKRMKRNEQNLQELW DYVKRLNLQLIGVTERDGENRTKLKNKLQNIIQETFLNLTRQANIQIQEI >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_4|333_bp atggagctgaaaaacagaacacaagaacttcacaatgcaaccacaagtatcagtaaccaa atagaccaagcagaggaaagaatttcggagcttgaagactatcttgctgaaataagacag gcagacaagattagaggaaagagaatgaaaaggaatgaacaaaacctccaagaactatgg gattatgtaaaaagactgaacctacaactgattggggtaacagaaagagatggggagaac agaaccaagttgaaaaacaaacttcagaatatcatccaggagaccttcctcaacctaaca agacaggccaacattcaaattcaggaaatctag >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_5|183_aa MIVYLENPIFSAQNLLKLISNFSKVSGYKINVQKSQAFLYTSNRQTESQIMSELPFTIAS KRIKYLGIQLARDMKDLFKENYKPLLNKIKEDTNKWKNIPCSQIGRINIVKMAILPKEKI LSDVFAHFGEIEKYQGFYDKNLNITALHPRDEADLIVVDKLFDVLLDLVCQYFSKDFCID VHQ >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_5|552_bp atgattgtatatctagaaaaccccatcttctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accagtaacagacaaacagagagccaaatcatgagtgaactcccattcactattgcttca aagagaataaaatacctaggaatccaacttgcaagggatatgaaggacctcttcaaggag aactacaaaccactgctcaacaaaataaaagaggacacaaacaaatggaagaatattcca tgctcacagataggaagaatcaatatcgtgaaaatggccatactgcccaaggaaaaaatc ttatcagatgtttttgcacattttggggaaattgaaaaatatcaaggtttttatgataag aacctcaacattacagccttgcatcccagggatgaagctgacttaatcgtagtggataaa ctttttgatgtgctgctggatttggtttgccaatattttagtaaggacttttgcatcgat gttcatcagtga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_6|261_aa MEDEMNEMKREEKFREKRIKRNEQSLQEIWDYVKRPNLHLIGVPESDGENGTKLENTLQD IIQENFPHPARQANIQIQEIQRTPQRYSLRRATPRHIIVRFTKVEMKEKMLRATREKEIQ TTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVARINSLPTK KSPGPDGFTAKFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKE NFRPISLMNIDAKILNKILAN >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_6|786_bp atggaagatgaaatgaatgaaatgaagcgagaagagaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttcccccatccagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactccttgagaagagcaacaccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcaaccagagagaaagaaatacaa actaccatcagagaatactacaaacacctctatgcaaataaactagaaaatctagaagaa atggataaattccttgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaagaatcaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagccaaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aactga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_7|162_aa MIVYQENPIVSAQSLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEFPFTIAS KRIKYLEIQLTRDVKDLFKENYKPLLNEIKEDTKKWKNIPCSQIGRINIVKMAILPKVSL TSNGMHLGNSWQRFKCKSVVRSRETLTGLCRDGPTALALLAK >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_7|489_bp atgattgtatatcaagaaaaccccattgtctcagcccaaagtctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaattcccattcacaatagcctca aagagaataaaatacctagaaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaagaaatggaagaacattcca tgctcacagataggaagaatcaatattgtgaaaatggccatactgcccaaggtctccctc acatccaatggcatgcacttgggaaacagctggcaaaggttcaagtgcaaatctgtggtt aggagcagagaaacactaactggcttatgcagagatggacccacagctctggccttgtta gcgaaatga >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_8|161_aa MPKTINSCCRKLCPDVVQNFTGFTTEPIKEIMKEVMDMAKKVGGAGFQDTDLGEIQELTD TIPQELTKHDLMEMFLNQCQMIRQCRSNSARTQIDVRHPTEGFRLFKIAFDFLHGTDPSD TGTENYTNGEKRMSITLSQQSPIVLAPGTGFVEDKFSTDGG >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_8|486_bp atgcccaaaacaataaattcctgctgcagaaaactgtgtccagatgttgtgcaaaacttc acaggatttacaacagaaccaatcaaggaaatcatgaaagaggttatggacatggcaaaa aaggtggggggtgcagggtttcaagatacagatcttggagaaattcaagagctaacagac accataccacaggaattaacaaaacacgacttgatggagatgtttctgaaccagtgccag atgataagacaatgcagaagcaacagtgccagaacacaaattgatgttagacatccgaca gaagggttccgattattcaagattgcttttgacttccttcatggcacagacccttctgat acaggcactgaaaactacacaaatggagaaaaaaggatgagtattaccttaagccagcag tccccaatcgttttggcaccaggcaccggtttcgtggaagacaagttttccacggatggg gggtag >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_9|597_aa MSRKKEIVDRVGSARNIVPQGLSLIGAERNTKIIWHPLLRSDSGEDSGTEMSEIHSQTEP ALACSISHYCDDGCIQMLNTPETLQCSAKDSKHFIPKECSIPGENRPPSDTGKTVKHMRS TGSVCSFVGATNKVLVSTDSLFFGKMAEYDLTTRIAHFLDRHLVFPLLEFLSVKEIYNEK ELLQGKLDLLSDTNMVDFAMDVYKNLYSDDIPHALREKRTTVVAQLKQLQAETEPIVKMF EDPETTRQMQSTRDGRMLFDYLADKHGHEALHIINYLRSPQLPGHGPVPEGHRADDQKFR QEYLDTLYRYAKFQYECGNYSGAAEYLYFFRVLVPATDRNALSSLWGKLASEILMQNWDA AMEDLTRLKETIDNNDEKPSFTHVVGKERYLNAIQTMCPHILRYLTTAVITNKDVRKRRQ VLKDLVKVIQQESYTYKDPITEFVECLYVNFDFDGAQKKLRECESVLVNDFFLVACLEDF IENARLFIFETFCRIHQCISINMLADKLNMTPEEAERWIVNLIRNARLDAKIDSKLGHVV MGNNAVSPYQQVIEKTKSLSFRSQMLAMNIEKKLNQNSRSEGFQHIILILADDNLHV >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_9|1794_bp atgtctaggaagaaagaaattgtggatagggttgggtcagccagaaacattgtaccacaa ggtttgagtctaattggagctgaaaggaatacaaagattatttggcaccccctactgaga tctgacagtggagaagacagtggcacagaaatgtctgagatccatagccagacagagcca gctctggcctgctctatctctcattactgtgatgacggctgtatccagatgctaaacact ccagagacactccaatgcagtgctaaagactcaaaacacttcatccccaaagaatgtagc attccaggagaaaacaggcctccttctgacactggtaaaactgtgaagcacatgcgcagt acaggatctgtctgttcgtttgtcggcgctaccaataaagttttagtgagcacagactcc cttttctttggcaagatggcggagtacgacttgactactcgcatcgcgcactttttggat cggcatctagtctttccgcttcttgaatttctctctgtaaaggagatatataatgaaaag gaattattacaaggtaaattggaccttcttagtgataccaacatggtagactttgctatg gatgtatacaaaaacctttattctgatgatattcctcatgctttgagagagaaaagaacc acagtggttgcacaactgaaacagcttcaggcagaaacagaaccaattgtgaagatgttt gaagatccagaaactacaaggcaaatgcagtcaaccagggatggtaggatgctctttgac tacctggcggacaagcatggtcatgaggctttgcatataattaattatttaaggagtccc caactcccaggtcatggaccagtaccggagggccacagagcagatgatcagaagtttagg caggaatatttagatacactctacagatatgcaaaattccagtacgaatgtgggaattac tcaggagcagcagaatatctttatttttttagagtgctggttccagcaacagatagaaat gctttaagttcactctggggaaagctggcctctgaaatcttaatgcagaattgggatgca gccatggaagaccttacacggttaaaagagaccatagataataatgatgaaaaacctagc ttcacacatgtagttggaaaagaaagatatcttaatgcaattcagacaatgtgtccacac attcttcgctatttgactacagcagtcataacaaacaaggatgttcgaaaacgtcggcag gttctaaaagatctagttaaagttattcaacaggagtcttacacatataaagacccaatt acagaatttgttgaatgtttatatgttaactttgactttgatggggctcagaaaaagctg agggaatgtgaatcagtgcttgtgaatgacttcttcttggtggcttgtcttgaggatttc attgaaaatgcccgtctcttcatatttgagactttctgtcgcatccaccagtgtatcagc attaacatgttggcagataaattgaacatgactccagaagaagctgaaaggtggattgta aatttgattagaaatgcaagactggatgccaagattgattctaaattaggtcatgtggtt atgggtaacaatgcagtctcaccctatcagcaagtgattgaaaagaccaaaagcctttcc tttagaagccagatgttggccatgaatattgagaagaaacttaatcagaatagcaggtca gagggttttcaacacatcattttgatactggcagatgataacttacatgtgtag >gi568815590r:108101888_108348702|GENSCAN_predicted_peptide_10|170_aa MEATLEQDNKTRLEQFGGFRRKEDRKMWESLELPRDLWNDFDQNADSDMDNEVQAEVVSD GDKELVRNWSKVWKGNVGLEPRYRVPTGALTSRVVRRGPPSFRPQKCRSTDSLHHEPGKA AGTQCQPVKDLPKAVGAHSLHQPALDSEKHVSISGYSSDSVQTVKTHNAS >gi568815590r:108101888_108348702|GENSCAN_predicted_CDS_10|513_bp atggaagcgactttggagcaggacaacaagacgaggttggaacagtttggagggttcaga agaaaagaagacagaaagatgtgggaaagtttggaacttcctagagacttgtggaatgat tttgaccaaaatgctgatagtgatatggacaatgaagtccaggctgaggtggtctcagat ggagataaggaacttgttagaaactggagtaaagtgtggaagggaaatgtggggctggag ccccgatacagagtccccactggggcactgactagcagagttgtgagaagaggtccacca tccttcagacctcagaaatgtagatccactgacagcttgcaccatgaacctggaaaagct gcaggcactcaatgccagcctgtgaaggatctgcccaaggctgtgggagcccactccttg catcagccagccctggattcagaaaaacatgtttccattagtggctactcctcagactca gtgcagacagtgaagacacataatgcaagctga