GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:13:04 Sequence gi568815590r:20149725_20355181 : 205457 bp : 45.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1020 942 79 1 1 98 88 63 0.179 5.91 1.04 Intr - 6360 6277 84 1 0 92 59 62 0.265 3.49 1.03 Intr - 7358 7209 150 0 0 26 84 92 0.367 2.73 1.02 Intr - 12409 12329 81 2 0 90 91 22 0.527 2.31 1.01 Init - 15220 15145 76 1 1 117 106 100 0.991 15.11 1.00 Prom - 15424 15385 40 -7.36 2.08 PlyA - 16772 16767 6 1.05 2.07 Term - 19229 19204 26 0 2 80 42 20 0.285 -5.01 2.06 Intr - 21770 21681 90 0 0 72 107 112 0.631 11.47 2.05 Intr - 23404 23312 93 2 0 85 106 75 0.950 8.94 2.04 Intr - 24720 24637 84 1 0 77 92 41 0.893 3.19 2.03 Intr - 28769 28711 59 1 2 97 83 29 0.971 1.83 2.02 Intr - 29760 29397 364 1 1 94 70 278 0.969 20.84 2.01 Init - 31240 31117 124 1 1 81 79 153 0.948 14.21 2.00 Prom - 36130 36091 40 -3.66 3.00 Prom + 44798 44837 40 -4.26 3.01 Init + 47683 47818 136 0 1 101 70 164 0.961 15.20 3.02 Intr + 54760 54815 56 2 2 126 85 18 0.965 4.00 3.03 Intr + 59709 59807 99 2 0 93 111 40 0.992 7.11 3.04 Intr + 60622 60715 94 0 1 104 27 104 0.957 5.44 3.05 Intr + 60845 60922 78 0 0 109 69 89 0.986 8.62 3.06 Intr + 61453 61592 140 2 2 5 78 88 0.906 -0.32 3.07 Intr + 61928 62029 102 1 0 64 78 32 0.498 0.17 3.08 Intr + 62378 62475 98 1 2 54 111 77 0.985 5.41 3.09 Intr + 63058 63181 124 1 1 71 107 107 0.998 11.49 3.10 Intr + 65094 65244 151 2 1 56 115 30 0.801 2.24 3.11 Intr + 66689 66771 83 0 2 80 84 60 0.940 4.16 3.12 Intr + 68429 68558 130 1 1 103 90 154 0.992 17.37 3.13 Intr + 75065 75156 92 0 2 9 115 33 0.011 -2.19 3.14 Term + 81028 81291 264 0 0 13 41 197 0.280 3.01 3.15 PlyA + 90442 90447 6 1.05 4.06 PlyA - 90472 90467 6 1.05 4.05 Term - 100639 99998 642 1 0 94 55 1399 0.999 131.37 4.04 Intr - 103861 103058 804 1 0 115 78 1212 0.992 114.55 4.03 Intr - 105507 105113 395 1 2 90 103 477 0.019 43.77 4.02 Intr - 117586 117443 144 2 0 72 78 102 0.845 7.85 4.01 Init - 120152 120065 88 2 1 63 72 114 0.660 8.10 4.00 Prom - 127478 127439 40 -4.06 5.03 PlyA - 129241 129236 6 1.05 5.02 Term - 132928 132819 110 2 2 53 43 110 0.247 1.67 5.01 Init - 138033 137985 49 0 1 86 89 30 0.276 2.01 5.00 Prom - 138874 138835 40 -3.26 6.05 PlyA - 139220 139215 6 1.05 6.04 Term - 142784 142722 63 2 0 83 48 54 0.276 -1.21 6.03 Intr - 144686 144631 56 0 2 97 64 43 0.194 1.50 6.02 Intr - 153743 153654 90 0 0 127 59 53 0.241 6.27 6.01 Init - 178114 178069 46 1 1 93 62 62 0.594 4.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 105457 105113 345 1 0 107 103 508 0.878 51.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_1|157_aa MGVAILEPTLPIWMMQTMCSPKWQLVQCPSTVPAMAQVGPDVAWPPTLEGASGTSAYRIS YAHRCSSPAPVFPPFLTTKKGVQAAGLWWSITSVPNIAFAIRGPVPVLKYWVISSKALAK ELRLGDCIAQGLAFLPASVSYLIGTNLFGVLANKMGR >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_1|471_bp atgggggtggccatcctggagcccacactgcccatctggatgatgcagaccatgtgctcc cccaagtggcagctggtgcagtgcccctcaactgtgccagccatggctcaagtgggccca gatgtggcctggcctcccactctggagggtgcaagtgggacatctgcttaccggattagt tacgctcaccgatgtagcagtcctgcacctgttttcccgcctttcttgaccacaaagaaa ggagtccaggctgctggattatggtggtccattaccagcgtgcccaacattgcctttgcg atcaggggcccagttccagtgttaaagtactgggtcatcagttctaaggccctcgccaag gagctgaggcttggagattgtattgcacagggtctagctttcttgcctgccagtgtgtcc tacctcattggcaccaacctctttggtgtgttggccaacaagatgggtcgn >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_2|279_aa MLRTILDAPQRLLKEGRASRQLVLVVVFVALLLDNMLFTVVVPIVPTFLYDMEFKEVNSS LHLGHAGSSPHALASPAFSTIFSFFNNNTVAVEESVPSGIAWMNDTASTIPPPATEAISA HKNNCLQGTGFLEEEITRVGVLFASKAVMQLLVNPFVGPLTNRIGYHIPMFAGFVIMFLS TVMFAFSGTYTLLFVARTLQGIGSSFSSVAGLGMLASVYTDDHERGRAMGTALGGLALGL LVGAPFGSVMYEFVGKSAPFLILAFLALLDGDLLSIDDS >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_2|840_bp atgctccggaccattctggatgctccccagcggttgctgaaggaggggagagcgtcccgg cagctggtgctggtggtggtattcgtcgctttgctcctggacaacatgctgtttactgtg gtggtgccaattgtgcccaccttcctatatgacatggagttcaaagaagtcaactcttct ctgcacctcggccatgccggaagttccccacatgccctcgcctctcctgccttttccacc atcttctccttcttcaacaacaacaccgtggctgttgaagaaagcgtacctagtggaata gcatggatgaatgacactgccagcaccatcccacctccagccactgaagccatctcagct cataaaaacaactgcttgcaaggcacaggtttcttggaggaagagattacccgggtcggg gttctgtttgcttcaaaggctgtgatgcaacttctggtcaacccattcgtgggccctctc accaacaggattggatatcatatccccatgtttgctggctttgttatcatgtttctctcc acagttatgtttgctttttctgggacctatactctactctttgtggcccgaacccttcaa ggcattggatcttcattttcatctgttgcaggtcttggaatgctggccagtgtctacact gatgaccatgagagaggacgagccatgggaactgctctggggggcctggccttggggttg ctggtgggagctccctttggaagtgtaatgtacgagtttgttgggaagtctgcacccttc ctcatcctggccttcctggcactactggatggagatttgttatccattgatgattcttga >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_3|548_aa MALRAMRGIVNGAAPELPVPTGGPAVGAREQALAVSRNYLSQPRLTYKTVSGVNGPLVIL DHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGTSGIDAKKTSCEFTGDILR TPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINPQCRIYPEEMIQTGISAIDGM NSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKDVVDYSEENFAIVFAAMGVNMET ARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDM SSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDD ITHPIPDLTGYITEGQIYVDRQLHNRQYACYAIGKDVQAMKAVVGEEALTSDDLLYLEFL QKFERNFIAQVALKCGPSTKEASLVIYLLDVHCSDKTWKVQPCPIRERVPFPRGEAPGKA ALGITPRQQLLMPERGSPPSPAVAAPLDPEAQPPLLSLVTDNAYRLCHQLFQSKNYLSAQ CPKKRERA >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_3|1647_bp atggcgctgcgggcgatgcgggggattgtcaacggggccgcacccgagctacccgtgccc accggtgggccggcggtgggagctcgggagcaggcgctggcagtcagtcggaactacctc tcccagcctcgcctcacatacaagacagtatctggagtcaatggtccactagtgatctta gatcatgttaagtttcccaggtatgctgaaattgtccatttgaccttaccggatggcaca aagagaagtgggcaagttctggaagttagtggttccaaggcagtagttcaggtatttgaa gggacttcaggtatagatgctaagaaaacgtcctgtgagtttactggggatattctccga acaccggtgtctgaggatatgcttggtcgggtattcaatggatcgggaaaacccattgac agaggtcctgttgtactggccgaagacttccttgatatcatgggtcagccaatcaaccct caatgtcgaatctacccagaggaaatgattcagactggcatttcggccatcgatgggatg aacagtattgctagggggcagaaaattcctatcttctctgctgctgggctaccacacaat gagattgcagctcagatctgtcgccaggctggtttggtaaagaaatccaaagatgtagta gactacagtgaggaaaattttgcaattgtatttgctgctatgggtgtaaacatggaaact gcccggttcttcaaatctgactttgaagaaaatggctcaatggacaatgtctgcctcttt ttgaacttggctaatgacccaaccattgagcgaattatcactcctcgcctggctctaacc acagctgaatttctggcgtaccaatgtgagaaacatgtattggttattctaacagacatg agttcttatgctgaagcacttcgagaggtttcagcagccagggaagaggtacctggtcga cgaggttttccaggttacatgtatacagatttagccacgatatatgaacgcgctgggcga gtggaagggagaaacggctcgattactcaaatccctattctaaccatgcctaatgatgat atcactcaccccatcccagacttgactggctacattacagaggggcagatctatgtggac agacagctgcacaacagacagtatgcgtgctatgctattggaaaggatgtgcaagccatg aaagctgtcgttggagaagaagcccttacctcagatgatcttctctacttggaatttctg cagaagtttgagaggaacttcattgctcaggtcgcactcaagtgtggccctagcaccaag gaagcttcattagtgatttacctactagatgttcattgttcagataaaacttggaaagtg cagccctgtcccatcagggagcgagtgcccttcccccgaggagaagctccagggaaagca gccctgggcatcacccccaggcagcagctgctcatgcccgagagagggagcccaccttcc ccagctgtggctgctcccttagaccccgaggcccagccaccactgctcagtttggtcact gacaatgcctacagattgtgtcaccagttattccagagcaagaattacctctcagctcag tgcccgaaaaaacgagagagggcttga >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_4|690_aa MEDPGLESEEDHSVDLGTVKEDQNFGERGAPNMETRSTLLGSQHPGDKEAGGKEHKDDAR NQGTRQPRQHPPPGCLRAFPRPCPALPHPQPRVTMGSVSSLISGHSFHSKHCRASQYKLR KSSHLKKLNRYSDGLLRFGFSQDSGHGKSSSKMGKSEDFFYIKVSQKARGSHHPDYTALS SGDLGGQAGVDFDPSTPPKLMPFSNQLEMGSEKGAVRPTAFKPVLPRSGAILHSSPESAS HQLHPAPPDKPKEQELKPGLCSGALSDSGRNSMSSLPTHSTSSSYQLDPLVTPVGPTSRF GGSAHNITQGIVLQDSNMMSLKALSFSDGGSKLGHSNKADKGPSCVRSPISTDECSIQEL EQKLLEREGALQKLQRSFEEKELASSLAYEERPRRCRDELEGPEPKGGNKLKQASQKSQR AQQVLHLQVLQLQQEKRQLRQELESLMKEQDLLETKLRSYEREKTSFGPALEETQWEVCQ KSGEISLLKQQLKESQTEVNAKASEILGLKAQLKDTRGKLEGLELRTQDLEGALRTKGLE LEVCENELQRKKNEAELLREKVNLLEQELQELRAQAALARDMGPPTFPEDVPALQRELER LRAELREERQGHDQMSSGFQHERLVWKEEKEKVIQYQKQLQQSYVAMYQRNQRLEKALQQ LARGDSAGEPLEVDLEGADIPYEDIIATEI >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_4|2073_bp atggaagaccctggtctagagagtgaagaggaccacagtgtggacctggggacagtgaag gaagatcagaattttggagagaggggagcccccaacatggagaccaggtctacactcttg ggctctcagcaccctggggacaaagaagcaggaggaaaagaacacaaggacgatgccaga aaccaggggacccgacagcccagacagcatcctccaccagggtgcctgagagcctttcca agaccctgcccggccctgccccatcctcagccccgagtcaccatgggcagcgtcagtagc ctcatctccggccacagcttccacagcaagcactgccgggcttcgcagtacaagctgcgc aagtcctcccacctcaagaagctcaaccggtattccgacgggctgctgaggtttggcttc tcccaggactccggtcacggcaagtccagctccaaaatgggcaagagcgaagacttcttc tacatcaaggtcagccagaaagcccggggctcccatcacccagattacacggcactgtcc agcggggatttagggggccaggctggggtggactttgacccgtccacaccccccaagctc atgcccttctccaatcagctagaaatgggctccgagaagggtgcagtgaggcccacagcc ttcaagcctgtgctgccacggtcaggagccatcctgcactcctccccggagagtgccagc caccagctgcaccccgcccctccagacaagcccaaggagcaggagctgaagcctggcctg tgctctggggcgctgtcagactccggccggaactccatgtccagcctgcccacacacagc accagcagcagctaccagctggacccgctggtcacacccgtgggacccacaagccgtttt gggggctccgcccacaacatcacccagggcatcgtcctccaggacagcaacatgatgagc ctgaaggctctgtccttctccgacggaggtagcaagctgggccactcgaacaaggcagac aagggcccctcgtgtgtccgctcccccatctccacggacgagtgcagcatccaggagctg gagcagaagctgttggagagggagggcgccctccagaagctgcagcgcagctttgaggag aaggagcttgcctccagcctggcctacgaggagcggccgcggcgctgcagggacgagctg gagggcccggagcccaaaggcggcaacaagctcaagcaggcctcgcagaagagccagcgc gcgcagcaggtcctgcacctgcaggtactgcagcttcagcaggagaagcggcagctccgg caggagctcgagagcctcatgaaggagcaggacctgctggagaccaagctcaggtcctac gagagggagaagaccagcttcggccccgcgctggaggagacccagtgggaggtgtgccag aagtcaggcgagatctccctcctgaagcagcagctgaaggagtcccagacggaggtgaac gccaaggctagcgagatcctgggtctcaaggcacagctgaaggacacgcggggcaagctg gagggcctggagctgaggacccaggacctggagggcgccctgcgcaccaagggcctggag ctggaggtctgtgagaatgagctgcagcgcaagaagaacgaggcggagctgctgcgggag aaggtgaacctgctggagcaggagctgcaggagctgcgggcccaggccgccctggcccgc gacatggggccgcccaccttccccgaggacgtccctgccctgcagcgggagctggagcgg ctgcgggccgagctgcgggaggagcggcaaggccatgaccagatgtcctcgggcttccag catgagcggctcgtgtggaaggaggagaaggagaaggtgattcagtaccagaaacagctg cagcagagctacgtggccatgtaccagcggaaccagcgcctggagaaggccctgcagcag ctggcacgtggggacagcgccggggagcccttggaggttgacctggaaggggctgacatc ccctacgaggacatcatagccactgagatctga >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_5|52_aa MGLHHFGQAGLELLISGQPLTTENYLAFVNNAKVEKPKHNLNAKGKTTVQSN >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_5|159_bp atggggttgcaccattttggtcaggctggtctcgaactcctgatctcaggacagcccctt acaacagagaattacctagcctttgtcaataatgccaaggtggagaagcccaagcacaac ttaaatgcgaaaggaaaaactacagttcaaagcaattag >gi568815590r:20149725_20355181|GENSCAN_predicted_peptide_6|84_aa MVGNDNKKGPPVTRADGVRLFPAPMLILAGTSVLGDTGVGGEEVGGPQLALLKGCAAGWW LVGQGSVTAKAVLVAVANSRYFQN >gi568815590r:20149725_20355181|GENSCAN_predicted_CDS_6|255_bp atggtgggcaacgacaataagaagggaccaccagtgacaagggcagacggggttcgtcta ttcccagcaccgatgctgatcctggcggggacgtcagtgttaggtgacactggtgtcgga ggtgaagaagtgggaggcccgcagctggccctgctcaaaggctgtgcagccggctggtgg ctggtggggcagggttctgtgacagcaaaggctgtgctggtcgcagttgctaacagtcgg tactttcaaaactga