GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:48:21 Sequence gi568815590f:20097407_20320399 : 222993 bp : 45.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7162 7248 87 0 0 59 73 55 0.171 1.27 1.02 Term + 20229 20585 357 2 0 100 49 141 0.433 5.91 1.03 PlyA + 22242 22247 6 1.05 2.00 Prom + 33387 33426 40 -1.16 2.01 Init + 35350 35379 30 0 0 57 106 47 0.511 2.29 2.02 Term + 40095 40358 264 2 0 68 54 145 0.689 4.61 2.03 PlyA + 40714 40719 6 1.05 3.12 PlyA - 41416 41411 6 1.05 3.11 Term - 48470 48357 114 2 0 110 43 139 0.999 10.17 3.10 Intr - 49134 49060 75 0 0 65 70 59 0.736 1.51 3.09 Intr - 49985 49852 134 0 2 91 61 225 0.525 20.46 3.08 Intr - 50316 50197 120 1 0 101 77 186 0.992 19.27 3.07 Intr - 50664 50601 64 0 1 132 82 -11 0.528 0.99 3.06 Intr - 52321 52270 52 0 1 109 75 5 0.438 0.21 3.05 Intr - 53338 53260 79 2 1 98 88 63 0.399 5.91 3.04 Intr - 58678 58595 84 2 0 92 59 62 0.680 3.49 3.03 Intr - 59676 59527 150 1 0 26 84 92 0.378 2.73 3.02 Intr - 64727 64647 81 0 0 90 91 22 0.536 2.31 3.01 Init - 67538 67463 76 2 1 117 106 100 0.991 15.11 3.00 Prom - 67742 67703 40 -7.36 4.08 PlyA - 69090 69085 6 1.05 4.07 Term - 71547 71522 26 1 2 80 42 20 0.285 -5.01 4.06 Intr - 74088 73999 90 1 0 72 107 112 0.631 11.47 4.05 Intr - 75722 75630 93 0 0 85 106 75 0.950 8.94 4.04 Intr - 77038 76955 84 2 0 77 92 41 0.893 3.19 4.03 Intr - 81087 81029 59 2 2 97 83 29 0.971 1.83 4.02 Intr - 82078 81715 364 2 1 94 70 278 0.969 20.84 4.01 Init - 83558 83435 124 2 1 81 79 153 0.948 14.21 4.00 Prom - 88448 88409 40 -3.66 5.00 Prom + 97116 97155 40 -4.26 5.01 Init + 100001 100136 136 1 1 101 70 164 0.961 15.20 5.02 Intr + 107078 107133 56 0 2 126 85 18 0.965 4.00 5.03 Intr + 112027 112125 99 0 0 93 111 40 0.992 7.11 5.04 Intr + 112940 113033 94 1 1 104 27 104 0.957 5.44 5.05 Intr + 113163 113240 78 1 0 109 69 89 0.986 8.62 5.06 Intr + 113771 113910 140 0 2 5 78 88 0.906 -0.32 5.07 Intr + 114246 114347 102 2 0 64 78 32 0.498 0.17 5.08 Intr + 114696 114793 98 2 2 54 111 77 0.985 5.41 5.09 Intr + 115376 115499 124 2 1 71 107 107 0.998 11.49 5.10 Intr + 117412 117562 151 0 1 56 115 30 0.801 2.24 5.11 Intr + 119007 119089 83 1 2 80 84 60 0.940 4.16 5.12 Intr + 120747 120876 130 2 1 103 90 154 0.992 17.37 5.13 Intr + 127383 127474 92 1 2 9 115 33 0.011 -2.19 5.14 Term + 133346 133609 264 1 0 13 41 197 0.280 3.01 5.15 PlyA + 142760 142765 6 1.05 6.06 PlyA - 142790 142785 6 1.05 6.05 Term - 152957 152316 642 2 0 94 55 1399 0.999 131.37 6.04 Intr - 156179 155376 804 2 0 115 78 1212 0.992 114.55 6.03 Intr - 157825 157431 395 2 2 90 103 477 0.019 43.77 6.02 Intr - 169904 169761 144 0 0 72 78 102 0.845 7.85 6.01 Init - 172470 172383 88 0 1 63 72 114 0.660 8.10 6.00 Prom - 179796 179757 40 -4.06 7.03 PlyA - 181559 181554 6 1.05 7.02 Term - 185246 185137 110 0 2 53 43 110 0.248 1.67 7.01 Init - 190351 190303 49 1 1 86 89 30 0.276 2.01 7.00 Prom - 191192 191153 40 -3.26 8.04 PlyA - 191538 191533 6 1.05 8.03 Term - 195102 195040 63 0 0 83 48 54 0.278 -1.21 8.02 Intr - 197004 196949 56 1 2 97 64 43 0.195 1.50 8.01 Intr - 206061 205972 90 1 0 127 59 53 0.244 6.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 157775 157431 345 2 0 107 103 508 0.878 51.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_1|147_aa SIQYLSPAVSDHICTQLLQDLSTSSGIFAEMPLEWEKLLLPQIAGEEREFLIFLIEAHGA SVLEIQYTSVMALEKRTQDPQDPMARDDKSLWVYRHFPYVLRHVCMHLLAYFSILKASCY VLVLKQPIIVKTNFCTKRMNRQLRKSE >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_1|444_bp tctattcagtatctgagcccagcagtcagtgaccacatctgcacccagctcctgcaggac ctcagcacctcctcaggaatctttgcagaaatgcctcttgaatgggagaagctgctgctc ccccaaattgctggggaagaaagggagttcctgatctttcttattgaggctcatggagct tctgtcctggagattcagtatacttcagtaatggctctggagaagagaactcaggatcct caagacccaatggctcgtgatgataaaagcctttgggtctaccgacattttccctacgtg ctcagacatgtctgcatgcatttattggcatatttttccatactcaaagcaagttgctat gtgttagtactaaagcaacccataatagtaaaaacgaatttctgcaccaagagaatgaat cggcagctgagaaagtcagagtga >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_2|97_aa MTLAVAMAVWLSEELLEDILQQKMKMHQEQESEFRKLESSLGEQHKKTQPKAHAGGSENE SRGHRAGNKTYRQMTCYDQKQIVRGLHKSVSGLENNG >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_2|294_bp atgacactggcagtggctatggctgtctggctatctgaggaattattggaggacatactc caacagaagatgaaaatgcaccaagaacaagaaagtgaattcaggaaactggagtccagt ttaggagagcaacacaagaaaacccagcccaaagcccatgcaggtggctctgagaatgaa tccagaggacacagggctggaaacaaaacgtaccgacagatgacctgctatgatcagaaa caaatagtcaggggtcttcataaatcagttagtggacttgagaacaatgggtga >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_3|342_aa MGVAILEPTLPIWMMQTMCSPKWQLVQCPSTVPAMAQVGPDVAWPPTLEGASGTSAYRIS YAHRCSSPAPVFPPFLTTKKGVQAAGLWWSITSVPNIAFAIRGPVPVLKYWVISSKALAK ELRLGDCIAQGLAFLPASVSYLIGTNLFGVLANKMGRWLCSLIGMLVVGTSLLCVPLAHN IFGLIGPNAGLGLAIGMVDSSMMPIMGHLVDLRHTSVYGSVYAIADVAFCMGFAIGPSTG GAIVKAIGFPWLMVITGVINIVYAPLCYYLRSPPAKEEKLHKHPEGLMGLGYRQRLLKGT KEDMKAILSQDCPMETRMYATQKPTKEFPLGEDSDEEPDHEE >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_3|1029_bp atgggggtggccatcctggagcccacactgcccatctggatgatgcagaccatgtgctcc cccaagtggcagctggtgcagtgcccctcaactgtgccagccatggctcaagtgggccca gatgtggcctggcctcccactctggagggtgcaagtgggacatctgcttaccggattagt tacgctcaccgatgtagcagtcctgcacctgttttcccgcctttcttgaccacaaagaaa ggagtccaggctgctggattatggtggtccattaccagcgtgcccaacattgcctttgcg atcaggggcccagttccagtgttaaagtactgggtcatcagttctaaggccctcgccaag gagctgaggcttggagattgtattgcacagggtctagctttcttgcctgccagtgtgtcc tacctcattggcaccaacctctttggtgtgttggccaacaagatgggtcggtggctgtgt tccctaatcgggatgctggtagtaggtaccagcttgctctgtgttcctctggctcacaat atttttggtctcattggccccaatgcagggcttggccttgccataggcatggtggattct tctatgatgcccatcatggggcacctggtggatctacgccacacctcggtgtatgggagt gtctacgccatcgctgatgtggctttttgcatgggctttgctataggtccatccaccggt ggtgccattgtaaaggccatcggttttccctggctcatggtcatcactggggtcatcaac atcgtctatgctccactctgctactacctgcggagccccccggcaaaggaagagaagctt cacaaacatcctgagggcctgatgggtttgggataccgacagaggctgctcaagggaaca aaagaagacatgaaggctattctgagtcaggactgccccatggagacccggatgtatgca acccagaagcccacgaaggaatttcctctgggggaggacagtgatgaggagcctgaccat gaggagtag >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_4|279_aa MLRTILDAPQRLLKEGRASRQLVLVVVFVALLLDNMLFTVVVPIVPTFLYDMEFKEVNSS LHLGHAGSSPHALASPAFSTIFSFFNNNTVAVEESVPSGIAWMNDTASTIPPPATEAISA HKNNCLQGTGFLEEEITRVGVLFASKAVMQLLVNPFVGPLTNRIGYHIPMFAGFVIMFLS TVMFAFSGTYTLLFVARTLQGIGSSFSSVAGLGMLASVYTDDHERGRAMGTALGGLALGL LVGAPFGSVMYEFVGKSAPFLILAFLALLDGDLLSIDDS >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_4|840_bp atgctccggaccattctggatgctccccagcggttgctgaaggaggggagagcgtcccgg cagctggtgctggtggtggtattcgtcgctttgctcctggacaacatgctgtttactgtg gtggtgccaattgtgcccaccttcctatatgacatggagttcaaagaagtcaactcttct ctgcacctcggccatgccggaagttccccacatgccctcgcctctcctgccttttccacc atcttctccttcttcaacaacaacaccgtggctgttgaagaaagcgtacctagtggaata gcatggatgaatgacactgccagcaccatcccacctccagccactgaagccatctcagct cataaaaacaactgcttgcaaggcacaggtttcttggaggaagagattacccgggtcggg gttctgtttgcttcaaaggctgtgatgcaacttctggtcaacccattcgtgggccctctc accaacaggattggatatcatatccccatgtttgctggctttgttatcatgtttctctcc acagttatgtttgctttttctgggacctatactctactctttgtggcccgaacccttcaa ggcattggatcttcattttcatctgttgcaggtcttggaatgctggccagtgtctacact gatgaccatgagagaggacgagccatgggaactgctctggggggcctggccttggggttg ctggtgggagctccctttggaagtgtaatgtacgagtttgttgggaagtctgcacccttc ctcatcctggccttcctggcactactggatggagatttgttatccattgatgattcttga >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_5|548_aa MALRAMRGIVNGAAPELPVPTGGPAVGAREQALAVSRNYLSQPRLTYKTVSGVNGPLVIL DHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGTSGIDAKKTSCEFTGDILR TPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINPQCRIYPEEMIQTGISAIDGM NSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKDVVDYSEENFAIVFAAMGVNMET ARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDM SSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDD ITHPIPDLTGYITEGQIYVDRQLHNRQYACYAIGKDVQAMKAVVGEEALTSDDLLYLEFL QKFERNFIAQVALKCGPSTKEASLVIYLLDVHCSDKTWKVQPCPIRERVPFPRGEAPGKA ALGITPRQQLLMPERGSPPSPAVAAPLDPEAQPPLLSLVTDNAYRLCHQLFQSKNYLSAQ CPKKRERA >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_5|1647_bp atggcgctgcgggcgatgcgggggattgtcaacggggccgcacccgagctacccgtgccc accggtgggccggcggtgggagctcgggagcaggcgctggcagtcagtcggaactacctc tcccagcctcgcctcacatacaagacagtatctggagtcaatggtccactagtgatctta gatcatgttaagtttcccaggtatgctgaaattgtccatttgaccttaccggatggcaca aagagaagtgggcaagttctggaagttagtggttccaaggcagtagttcaggtatttgaa gggacttcaggtatagatgctaagaaaacgtcctgtgagtttactggggatattctccga acaccggtgtctgaggatatgcttggtcgggtattcaatggatcgggaaaacccattgac agaggtcctgttgtactggccgaagacttccttgatatcatgggtcagccaatcaaccct caatgtcgaatctacccagaggaaatgattcagactggcatttcggccatcgatgggatg aacagtattgctagggggcagaaaattcctatcttctctgctgctgggctaccacacaat gagattgcagctcagatctgtcgccaggctggtttggtaaagaaatccaaagatgtagta gactacagtgaggaaaattttgcaattgtatttgctgctatgggtgtaaacatggaaact gcccggttcttcaaatctgactttgaagaaaatggctcaatggacaatgtctgcctcttt ttgaacttggctaatgacccaaccattgagcgaattatcactcctcgcctggctctaacc acagctgaatttctggcgtaccaatgtgagaaacatgtattggttattctaacagacatg agttcttatgctgaagcacttcgagaggtttcagcagccagggaagaggtacctggtcga cgaggttttccaggttacatgtatacagatttagccacgatatatgaacgcgctgggcga gtggaagggagaaacggctcgattactcaaatccctattctaaccatgcctaatgatgat atcactcaccccatcccagacttgactggctacattacagaggggcagatctatgtggac agacagctgcacaacagacagtatgcgtgctatgctattggaaaggatgtgcaagccatg aaagctgtcgttggagaagaagcccttacctcagatgatcttctctacttggaatttctg cagaagtttgagaggaacttcattgctcaggtcgcactcaagtgtggccctagcaccaag gaagcttcattagtgatttacctactagatgttcattgttcagataaaacttggaaagtg cagccctgtcccatcagggagcgagtgcccttcccccgaggagaagctccagggaaagca gccctgggcatcacccccaggcagcagctgctcatgcccgagagagggagcccaccttcc ccagctgtggctgctcccttagaccccgaggcccagccaccactgctcagtttggtcact gacaatgcctacagattgtgtcaccagttattccagagcaagaattacctctcagctcag tgcccgaaaaaacgagagagggcttga >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_6|690_aa MEDPGLESEEDHSVDLGTVKEDQNFGERGAPNMETRSTLLGSQHPGDKEAGGKEHKDDAR NQGTRQPRQHPPPGCLRAFPRPCPALPHPQPRVTMGSVSSLISGHSFHSKHCRASQYKLR KSSHLKKLNRYSDGLLRFGFSQDSGHGKSSSKMGKSEDFFYIKVSQKARGSHHPDYTALS SGDLGGQAGVDFDPSTPPKLMPFSNQLEMGSEKGAVRPTAFKPVLPRSGAILHSSPESAS HQLHPAPPDKPKEQELKPGLCSGALSDSGRNSMSSLPTHSTSSSYQLDPLVTPVGPTSRF GGSAHNITQGIVLQDSNMMSLKALSFSDGGSKLGHSNKADKGPSCVRSPISTDECSIQEL EQKLLEREGALQKLQRSFEEKELASSLAYEERPRRCRDELEGPEPKGGNKLKQASQKSQR AQQVLHLQVLQLQQEKRQLRQELESLMKEQDLLETKLRSYEREKTSFGPALEETQWEVCQ KSGEISLLKQQLKESQTEVNAKASEILGLKAQLKDTRGKLEGLELRTQDLEGALRTKGLE LEVCENELQRKKNEAELLREKVNLLEQELQELRAQAALARDMGPPTFPEDVPALQRELER LRAELREERQGHDQMSSGFQHERLVWKEEKEKVIQYQKQLQQSYVAMYQRNQRLEKALQQ LARGDSAGEPLEVDLEGADIPYEDIIATEI >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_6|2073_bp atggaagaccctggtctagagagtgaagaggaccacagtgtggacctggggacagtgaag gaagatcagaattttggagagaggggagcccccaacatggagaccaggtctacactcttg ggctctcagcaccctggggacaaagaagcaggaggaaaagaacacaaggacgatgccaga aaccaggggacccgacagcccagacagcatcctccaccagggtgcctgagagcctttcca agaccctgcccggccctgccccatcctcagccccgagtcaccatgggcagcgtcagtagc ctcatctccggccacagcttccacagcaagcactgccgggcttcgcagtacaagctgcgc aagtcctcccacctcaagaagctcaaccggtattccgacgggctgctgaggtttggcttc tcccaggactccggtcacggcaagtccagctccaaaatgggcaagagcgaagacttcttc tacatcaaggtcagccagaaagcccggggctcccatcacccagattacacggcactgtcc agcggggatttagggggccaggctggggtggactttgacccgtccacaccccccaagctc atgcccttctccaatcagctagaaatgggctccgagaagggtgcagtgaggcccacagcc ttcaagcctgtgctgccacggtcaggagccatcctgcactcctccccggagagtgccagc caccagctgcaccccgcccctccagacaagcccaaggagcaggagctgaagcctggcctg tgctctggggcgctgtcagactccggccggaactccatgtccagcctgcccacacacagc accagcagcagctaccagctggacccgctggtcacacccgtgggacccacaagccgtttt gggggctccgcccacaacatcacccagggcatcgtcctccaggacagcaacatgatgagc ctgaaggctctgtccttctccgacggaggtagcaagctgggccactcgaacaaggcagac aagggcccctcgtgtgtccgctcccccatctccacggacgagtgcagcatccaggagctg gagcagaagctgttggagagggagggcgccctccagaagctgcagcgcagctttgaggag aaggagcttgcctccagcctggcctacgaggagcggccgcggcgctgcagggacgagctg gagggcccggagcccaaaggcggcaacaagctcaagcaggcctcgcagaagagccagcgc gcgcagcaggtcctgcacctgcaggtactgcagcttcagcaggagaagcggcagctccgg caggagctcgagagcctcatgaaggagcaggacctgctggagaccaagctcaggtcctac gagagggagaagaccagcttcggccccgcgctggaggagacccagtgggaggtgtgccag aagtcaggcgagatctccctcctgaagcagcagctgaaggagtcccagacggaggtgaac gccaaggctagcgagatcctgggtctcaaggcacagctgaaggacacgcggggcaagctg gagggcctggagctgaggacccaggacctggagggcgccctgcgcaccaagggcctggag ctggaggtctgtgagaatgagctgcagcgcaagaagaacgaggcggagctgctgcgggag aaggtgaacctgctggagcaggagctgcaggagctgcgggcccaggccgccctggcccgc gacatggggccgcccaccttccccgaggacgtccctgccctgcagcgggagctggagcgg ctgcgggccgagctgcgggaggagcggcaaggccatgaccagatgtcctcgggcttccag catgagcggctcgtgtggaaggaggagaaggagaaggtgattcagtaccagaaacagctg cagcagagctacgtggccatgtaccagcggaaccagcgcctggagaaggccctgcagcag ctggcacgtggggacagcgccggggagcccttggaggttgacctggaaggggctgacatc ccctacgaggacatcatagccactgagatctga >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_7|52_aa MGLHHFGQAGLELLISGQPLTTENYLAFVNNAKVEKPKHNLNAKGKTTVQSN >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_7|159_bp atggggttgcaccattttggtcaggctggtctcgaactcctgatctcaggacagcccctt acaacagagaattacctagcctttgtcaataatgccaaggtggagaagcccaagcacaac ttaaatgcgaaaggaaaaactacagttcaaagcaattag >gi568815590f:20097407_20320399|GENSCAN_predicted_peptide_8|69_aa XGVRLFPAPMLILAGTSVLGDTGVGGEEVGGPQLALLKGCAAGWWLVGQGSVTAKAVLVA VANSRYFQN >gi568815590f:20097407_20320399|GENSCAN_predicted_CDS_8|210_bp nacggggttcgtctattcccagcaccgatgctgatcctggcggggacgtcagtgttaggt gacactggtgtcggaggtgaagaagtgggaggcccgcagctggccctgctcaaaggctgt gcagccggctggtggctggtggggcagggttctgtgacagcaaaggctgtgctggtcgca gttgctaacagtcggtactttcaaaactga