GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:00:38 Sequence gi568815582f:56609352_56784042 : 174691 bp : 46.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3497 3552 56 1 2 87 75 46 0.470 3.96 1.02 Intr + 7734 7970 237 0 0 83 46 129 0.820 4.93 1.03 Intr + 8037 8239 203 0 2 40 95 84 0.715 3.23 1.04 Term + 9238 9329 92 2 2 134 41 100 0.750 7.78 1.05 PlyA + 9447 9452 6 1.05 2.00 Prom + 14569 14608 40 -2.86 2.01 Init + 16501 16528 28 0 1 76 95 19 0.803 1.21 2.02 Intr + 17115 17223 109 1 1 130 47 83 0.926 7.74 2.03 Term + 17529 17649 121 0 1 130 36 105 0.954 7.45 2.04 PlyA + 17739 17744 6 1.05 3.00 Prom + 19389 19428 40 -7.36 3.01 Init + 22626 23157 532 2 1 95 62 115 0.641 4.62 3.02 Intr + 23989 24054 66 2 0 145 70 40 0.980 6.78 3.03 Term + 24400 24491 92 2 2 131 41 102 0.981 7.68 3.04 PlyA + 24609 24614 6 1.05 4.00 Prom + 25603 25642 40 -9.16 4.01 Init + 26467 26494 28 0 1 76 99 31 0.947 2.80 4.02 Intr + 27088 27196 109 2 1 139 53 80 0.783 8.94 4.03 Intr + 28765 28933 169 1 1 -28 47 150 0.391 -0.75 4.04 Intr + 29342 29415 74 1 2 66 95 86 0.860 5.30 4.05 Intr + 29913 29978 66 0 0 157 70 2 0.787 3.32 4.06 Term + 30508 30628 121 1 1 136 43 74 0.889 5.65 4.07 PlyA + 30717 30722 6 1.05 5.00 Prom + 32399 32438 40 -3.06 5.01 Init + 34019 34178 160 1 1 82 66 104 0.865 7.59 5.02 Intr + 34586 34657 72 0 0 54 75 72 0.717 1.88 5.03 Intr + 34760 34825 66 0 0 158 70 44 0.973 8.48 5.04 Intr + 39475 39540 66 2 0 130 70 56 0.731 6.88 5.05 Term + 40072 40163 92 2 2 133 44 48 0.733 2.78 5.06 PlyA + 40285 40290 6 1.05 6.00 Prom + 42499 42538 40 -1.46 6.01 Init + 42603 42630 28 2 1 46 110 24 0.642 0.20 6.02 Intr + 43220 43285 66 0 0 137 65 39 0.959 5.38 6.03 Term + 43623 43714 92 1 2 137 41 74 0.983 5.48 6.04 PlyA + 43833 43838 6 1.05 7.00 Prom + 46252 46291 40 -5.36 7.01 Init + 48708 48735 28 2 1 59 95 31 0.654 0.91 7.02 Intr + 49324 49412 89 2 2 125 2 71 0.473 1.89 7.03 Intr + 51071 51201 131 1 2 55 93 141 0.759 10.79 7.04 Term + 53820 53946 127 0 1 97 52 142 0.930 9.26 7.05 PlyA + 54794 54799 6 1.05 8.09 PlyA - 55714 55709 6 1.05 8.08 Term - 57619 57490 130 0 1 142 46 111 0.961 9.95 8.07 Intr - 58029 57918 112 1 1 111 53 81 0.552 6.34 8.06 Intr - 61060 60957 104 0 2 110 115 18 0.002 6.52 8.05 Intr - 66146 66017 130 0 1 50 3 115 0.000 -1.05 8.04 Intr - 72856 72742 115 1 1 126 89 14 0.009 5.32 8.03 Intr - 73279 72990 290 2 2 77 60 110 0.010 4.06 8.02 Intr - 82875 82799 77 2 2 93 106 19 0.080 3.36 8.01 Init - 88675 88587 89 1 2 69 37 60 0.033 -0.84 8.00 Prom - 92083 92044 40 -0.06 9.00 Prom + 94694 94733 40 -6.76 9.01 Init + 98057 98067 11 1 2 68 77 15 0.200 -2.19 9.02 Intr + 99414 99548 135 0 0 90 99 40 0.386 4.98 9.03 Intr + 103813 103920 108 1 0 78 78 84 0.464 5.80 9.04 Intr + 135534 135667 134 0 2 17 79 71 0.403 -0.61 9.05 Intr + 138883 139075 193 2 1 91 69 142 0.899 11.15 9.06 Intr + 149187 149304 118 0 1 81 110 23 0.071 4.27 9.07 Term + 162773 162835 63 1 0 57 55 77 0.007 -0.81 9.08 PlyA + 163342 163347 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35359 35414 56 2 2 128 48 49 0.908 2.62 S.002 Init - 58642 58615 28 1 1 69 95 36 0.982 2.28 S.003 Intr + 61155 61263 109 1 1 104 53 82 0.944 5.64 S.004 Term + 61547 61676 130 2 1 129 47 65 0.933 4.15 S.005 Intr + 73814 73879 66 0 0 124 70 33 0.851 3.98 S.006 Term + 74607 74698 92 1 2 145 44 73 0.922 6.48 S.007 Term + 140595 140727 133 0 1 102 32 87 0.843 2.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_1|195_aa MGILSEVDVTVFHLKAVYSSGLPRDEAGVAHRLREWRNTADGVVGVGISRPPEAGFPERS GTEMASNHKEMGNMGKLRGQDGTDAADDRAHRPERASDPSTYPQEAQTQRGGCKRGAGPL RPVPSPPAKGAAGSRLQRAFQLPDCLFASPVISWLEMDPNCSCATGCCSCCPMGCAKCAQ GCVCKGASEKCSCCA >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_1|588_bp atgggaattctgtcagaagttgatgttacagtctttcatctgaaggcggtctacagctca ggcctacccagggacgaagctggggttgcacaccggctccgggaatggcgaaataccgca gatggggtggtgggggtgggaatatcgcgaccaccagaggctgggttcccggaacgctcg gggacggagatggccagcaaccacaaggaaatggggaatatggggaagttgcgcgggcag gacggcacggacgccgcggacgaccgggcgcacaggcctgagcgagcgagcgatccctcc acgtacccacaggaggcccagactcagcggggcgggtgcaagcgcggggcggggcctctg cgtccggtcccatctccgcctgcaaaaggagcagctggctccaggctccaacgtgccttc cagctgcctgactgcctcttcgcctctcccgtcatttcttggctcgaaatggaccccaac tgctcctgcgccactggctgctgctcctgctgccccatgggctgtgccaagtgtgcccag ggctgcgtctgcaaaggggcgtcggagaagtgcagctgctgtgcctga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_2|85_aa MDPNCSCATGGSCTCAGSCKCKECKCTSCKKSECGAISRNLGLWLRLLFLLPRGLCQVCP GLRLQRGIGEVQLLCLMWEQLFSQM >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_2|258_bp atggaccccaactgctcttgcgccactggtggctcctgcacgtgcgccggctcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggggccatctccaggaat ctggggctgtggctcaggctgctgttcctgctgccccgtgggctgtgccaagtgtgccca gggctgcgtctgcaaaggggcatcggagaagtgcagctgctgtgcctgatgtgggaacag ctcttctcccagatgtaa >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_3|229_aa MEIYLQRRTWTNVPPHPLRRREWTGERGRPVFPVLLCTEEPVRGTAVWTGTGKAGKEEKR KPHWWRVLCTRLAPYHTPPAPHTTDPGTGAGGLHRDSGTGPVENGGARGWGGDARDAKAG VPESAGRRVEGKGNFGETGKGGRDLGDTAYHPAHSPSRANPSQRGGRAAHSERSSGDGVS CACTGSCTCKECKCTSCKKSCCSCCPVGCAKCAHGCVCKGTLENCSCCA >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_3|690_bp atggagatttatctgcaaaggaggacctggacaaatgttcccccacatcctctcaggcga agagaatggacgggagagagaggccgaccagtgttccccgtgttgctgtgtacggaggag ccagtccgagggaccgcggtgtggacagggacaggcaaggcggggaaggaggagaaacga aagccacattggtggcgggtgctctgcacacgactcgctccctaccacacgccccccgct ccgcacacgaccgatccggggactggagcaggagggctgcaccgggactccgggacaggc ccagttgaaaacggcggggcgagggggtggggtggagacgcccgcgacgccaaggctggg gtcccggaaagcgcggggaggagggtggaaggcaaaggcaacttcggggaaactgggaaa ggcggccgggacctcggggacactgcgtaccacccggcgcacagcccctcccgcgcaaac ccgagccaaaggggcggtcgagcggcgcactcggagcggagctcaggggatggtgtctcc tgcgcctgcaccggctcctgcacgtgcaaagagtgcaaatgcacctcctgcaagaagagc tgctgctcctgctgccccgtgggctgtgccaagtgtgcccacggctgtgtctgcaaaggg acgttggagaactgcagctgctgtgcctga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_4|188_aa MDPNYSCTTGGSCTCAGSCKCKECKCTSCKKSECGAISRNLGLWLRDRQGDSEEKRKSHR WRLLCTQLARYRTLHALHYADPGTGAGGCGCTQTSGQAELKTPLNFLLGISNLTAARNGP QLLLRHWWLLHLHWLLQMQRVQMHLLQEELLLLLPHELCQVCPGLHLQRGIREVQLLCLM SGQPCSKI >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_4|567_bp atggaccccaactactcctgcaccactggtggctcctgcacgtgcgccggctcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggggccatctccaggaat ctggggctgtggttaagggacaggcaaggcgacagcgaggagaaacgaaaatcacatcgg tggcggttgctctgcacacaactcgctcgctaccgcacgctccacgctctgcactacgcc gatccggggacaggagcaggaggctgtggctgcactcagacttcgggacaggccgagctg aaaacccctctcaacttcttgcttgggatctccaacctcaccgcggctcgaaatggaccc caactgctcctgcgccactggtggctcctgcacctgcactggctcctgcaaatgcaaaga gtgcaaatgcacctcctgcaagaagagctgctgctcctgctgccccatgagctgtgccaa gtgtgcccagggctgcatctgcaaaggggcatcagagaagtgcagctgctgtgcctgatg tccggacagccctgctcgaagatatag >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_5|151_aa MECNLREIREGRDYSRDAAYRQEHSPSKASVSQRAAPRVRNSHVKEDPAPGPPVPPISEA RGLRLKTARLQVTLKAKGGSCTCASSCKCKEYKCTSCKKTGSCTYASFCKCKEYKCTSCK KNCCSCYPVGCAKCAQGCICKGASDKCSCCA >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_5|456_bp atggagtgcaatctccgggaaattcgggagggccgggattatagcagggacgccgcgtac cgccaggagcacagcccctccaaagcgagcgtgagccaaagggccgcccccagggtgcgc aacagccacgttaaggaggatcctgcgcccggcccgcctgtgcctccgatttctgaggcg agaggactgaggctgaaaactgcccggctgcaggtcaccctcaaggccaaaggtggctcc tgcacctgtgccagctcctgcaaatgcaaagagtacaaatgcacctcctgcaagaagact gggtcctgcacttatgccagcttctgcaaatgcaaagagtacaaatgcacctcttgcaag aagaactgctgctcctgctaccctgtgggctgtgccaagtgtgcccaaggctgcatttgc aaaggggcatcagataagtgcagctgctgtgcctga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_6|61_aa MDPNCSCTTGGSCACAGSCKCKECKCTSCKKCCCSCCPVGCAKCAQGCVCKGSSEKCRCC A >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_6|186_bp atggatcccaactgctcctgcaccacaggtggctcctgtgcctgcgccggctcctgcaag tgcaaagagtgcaaatgtacctcctgcaagaagtgctgctgctcttgctgccccgtgggc tgtgccaagtgtgcccagggctgtgtctgcaaaggctcatcagagaagtgccgctgctgt gcctga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_7|124_aa MDPNCSCAAGVSCTCAGSCKCKECKCTSCKKSECEAISMGSKVTGDVEESVISATQSHEK DWVQMKDVECADAVVMAYSAVGRWSRYPHPPPCTFVISFASPVSVCSSILVLAFFIGGSW EPRE >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_7|375_bp atggaccccaactgctcctgcgccgctggtgtctcctgcacctgcgctggttcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgtgaggccatctccatgggc agcaaagtgactggagatgtggaggaaagcgtcatctcggcaacgcagagtcatgagaag gactgggtgcagatgaaggatgttgagtgtgctgatgcagtggtgatggcatattctgca gttggaaggtggtcgcgttatcctcatcccccgccctgcaccttcgtcatcagctttgcc tctcccgtgtccgtgtgtagctccatcctcgtgctcgccttcttcattggcggaagctgg gagccgagagagtga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_8|348_aa MEAAYKPGGWPSSEPNHAGTLISDFWPPELDSAHVTHQTLKSHFGLLQTLNPCQERRVCT VAGKRASQGAELGVPYQASRSSWGPFRGKEKQEFPIKRKTRSGTDEKRGGAQKPAALFIV REEGRAQRPRLVLAPAPAESAPPAILGRVQQPACRETRNPGGLHTPSPRSTVCSFAFPRV PGSRPWIGGSVLRDFPGLPATPLQDGSFENEGCPPKRMTLGDMQTLRSIKERSGASRQSS FPTQQKKRRPISSPFMLGKPSTLRLYSKAGVSCTCASSCKCKECKCTSCKKSECGAISRN LGLWLRLLLLLPCGLCQVCPGLHLQRGIGEVQLLRLMSGQPCSQVQIE >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_8|1047_bp atggaagctgcctacaagccaggaggatggccttcctcagaacccaaccatgctggcacc ctgatctcggacttctggcctcctgaactggattcagcccatgtgacccaccagactctg aagagccactttggactgttacaaactttgaatccttgccaggaaagaagagtctgtact gtggctgggaaacgggcatcccaaggcgcggagctaggtgtcccttaccaggcgagcagg agcagttggggtccatttcgaggcaaggagaagcaggagttcccgatcaagaggaaaaca cgcagcgggacagatgaaaagcgtggtggagcccagaagccggcggctctctttatagtc cgggaagagggccgggcgcagaggccccgcctcgtccttgcacccgcccctgctgagtct gcaccgcccgcgatcctgggccgggtgcagcaacccgcgtgccgggaaactcggaatccc ggcggtctacacacccccagcccccgctccactgtgtgtagctttgcatttcccagagtc cctggatcacgtccctggatcggcgggagcgttctccgggactttccaggcctgccggcc accccactgcaggatggcagttttgaaaatgagggatgccctccaaagaggatgacttta ggtgacatgcagaccctgcggtcaatcaaggagagaagtggggccagccgtcagagctcc tttcccactcagcagaagaagagaagacccatctcctctccattcatgctgggaaaacca agcaccctccgtttgtattccaaagcaggtgtctcctgcacctgcgccagctcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggggccatctccaggaat ctggggctgtggctaaggctgctgctcctgctgccctgtgggctgtgccaagtgtgccca gggctgcatctgcaaaggggcatcggagaagtgcagctgctgcgcctgatgtcgggacag ccctgctcccaagtacaaatagagtga >gi568815582f:56609352_56784042|GENSCAN_predicted_peptide_9|253_aa MLASSLGLHMRDLLEEQKKPNMEGESLGLIQGALEQIPPTELSCCGASSNRNLTSVWDID IQPKPHFPDYLAAKSGHMTVLSDDFGCPFGKGVQEAFEKDRLSPTEEKTPPIHSKNEYLL CARHCCRSLGSASPMDTEGFGELLQQAEQLAAETEGISELPHVERNLQEIQQAGERLRSR TLTRTSQETADVKASVLLGSRGLDISHISQRLESLSAATTFEPLEPVKDTDIQTGCQQEL PLLMGVYRMPQRY >gi568815582f:56609352_56784042|GENSCAN_predicted_CDS_9|762_bp atgttggccagctccttagggctccacatgcgtgacttattagaggagcaaaagaagccg aatatggaaggggaaagccttggcctgatccagggtgctctagaacaaattcctcccact gaactgtcctgctgtggggcaagcagtaataggaacctcacaagtgtctgggacatagac atacagccaaaaccacatttcccagactatctggcagccaagagtggccacatgaccgtt ttgtccgatgattttgggtgcccatttggcaaaggagtgcaggaagcctttgagaaggac agactctcaccaacggaagaaaaaacacctcctattcattcaaaaaatgagtacctgcta tgtgccaggcactgttgtagaagcttgggatctgcatctccaatggatactgaggggttt ggtgagctccttcagcaagctgaacagcttgctgctgagactgagggcatctcagagctt ccccatgtggaacggaacttacaggagatccagcaggcgggagagcgcctgcgttcccgt accctaacacgcacgtcccaggagacggcagatgtcaaggcgtcagttctcctcgggtct cggggacttgacatatcccacatctcccagcgattggagagtctgagtgcagccaccacc tttgagcctcttgagcctgtgaaggacactgacattcagactggctgtcagcaggagctg ccattgctaatgggagtgtaccggatgcctcagcggtattga