GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:50:36 Sequence gi568815590r:20049676_20280964 : 231289 bp : 44.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5016 5177 162 2 0 66 43 117 0.240 4.90 1.02 Intr + 5800 6022 223 0 1 52 31 115 0.055 -0.10 1.03 Intr + 24235 24345 111 2 0 20 74 93 0.031 1.45 1.04 Intr + 42276 42409 134 1 2 81 113 32 0.025 5.36 1.05 Intr + 54893 54979 87 1 0 59 73 55 0.096 1.27 1.06 Term + 67960 68316 357 0 0 100 49 141 0.373 5.91 1.07 PlyA + 69973 69978 6 1.05 2.00 Prom + 81118 81157 40 -1.16 2.01 Init + 83081 83110 30 1 0 57 106 47 0.511 2.29 2.02 Term + 87826 88089 264 0 0 68 54 145 0.689 4.61 2.03 PlyA + 88445 88450 6 1.05 3.12 PlyA - 89147 89142 6 1.05 3.11 Term - 96201 96088 114 0 0 110 43 139 0.999 10.17 3.10 Intr - 96865 96791 75 1 0 65 70 59 0.736 1.51 3.09 Intr - 97716 97583 134 1 2 91 61 225 0.525 20.46 3.08 Intr - 98047 97928 120 2 0 101 77 186 0.992 19.27 3.07 Intr - 98395 98332 64 1 1 132 82 -11 0.528 0.99 3.06 Intr - 100052 100001 52 1 1 109 75 5 0.438 0.21 3.05 Intr - 101069 100991 79 0 1 98 88 63 0.399 5.91 3.04 Intr - 106409 106326 84 0 0 92 59 62 0.680 3.49 3.03 Intr - 107407 107258 150 2 0 26 84 92 0.378 2.73 3.02 Intr - 112458 112378 81 1 0 90 91 22 0.536 2.31 3.01 Init - 115269 115194 76 0 1 117 106 100 0.991 15.11 3.00 Prom - 115473 115434 40 -7.36 4.08 PlyA - 116821 116816 6 1.05 4.07 Term - 119278 119253 26 2 2 80 42 20 0.285 -5.01 4.06 Intr - 121819 121730 90 2 0 72 107 112 0.631 11.47 4.05 Intr - 123453 123361 93 1 0 85 106 75 0.950 8.94 4.04 Intr - 124769 124686 84 0 0 77 92 41 0.893 3.19 4.03 Intr - 128818 128760 59 0 2 97 83 29 0.971 1.83 4.02 Intr - 129809 129446 364 0 1 94 70 278 0.969 20.84 4.01 Init - 131289 131166 124 0 1 81 79 153 0.948 14.21 4.00 Prom - 136179 136140 40 -3.66 5.00 Prom + 144847 144886 40 -4.26 5.01 Init + 147732 147867 136 2 1 101 70 164 0.961 15.20 5.02 Intr + 154809 154864 56 1 2 126 85 18 0.965 4.00 5.03 Intr + 159758 159856 99 1 0 93 111 40 0.992 7.11 5.04 Intr + 160671 160764 94 2 1 104 27 104 0.957 5.44 5.05 Intr + 160894 160971 78 2 0 109 69 89 0.986 8.62 5.06 Intr + 161502 161641 140 1 2 5 78 88 0.906 -0.32 5.07 Intr + 161977 162078 102 0 0 64 78 32 0.498 0.17 5.08 Intr + 162427 162524 98 0 2 54 111 77 0.985 5.41 5.09 Intr + 163107 163230 124 0 1 71 107 107 0.998 11.49 5.10 Intr + 165143 165293 151 1 1 56 115 30 0.801 2.24 5.11 Intr + 166738 166820 83 2 2 80 84 60 0.940 4.16 5.12 Intr + 168478 168607 130 0 1 103 90 154 0.992 17.37 5.13 Intr + 175114 175205 92 2 2 9 115 33 0.011 -2.19 5.14 Term + 181077 181340 264 2 0 13 41 197 0.280 3.01 5.15 PlyA + 190491 190496 6 1.05 6.06 PlyA - 190521 190516 6 1.05 6.05 Term - 200688 200047 642 0 0 94 55 1399 0.999 131.37 6.04 Intr - 203910 203107 804 0 0 115 78 1212 0.992 114.55 6.03 Intr - 205556 205162 395 0 2 90 103 477 0.019 43.77 6.02 Intr - 217635 217492 144 1 0 72 78 102 0.847 7.85 6.01 Init - 220201 220114 88 1 1 63 72 114 0.662 8.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 205506 205162 345 0 0 107 103 508 0.878 51.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_1|357_aa MLEEPFSLPLHRGSPSLGWSRPEPVPSACGEVWRERHEWELGLHTMLVGQCEFWVCKCTN QHSVSSSGFVNTPIHTVYLANLVGTWRTFVSSSGIVNAPISTLSKRTNQLSIKQTNRLSV KWTNQQDVELRDQRESKIEGLEEVKGNLIEKGQGILSEELGSKVRKSSNSILPFNQAKNL EIVLLFLFLSYPTSIQPGNLVDSLFEYIQKSIQYLSPAVSDHICTQLLQDLSTSSGIFAE MPLEWEKLLLPQIAGEEREFLIFLIEAHGASVLEIQYTSVMALEKRTQDPQDPMARDDKS LWVYRHFPYVLRHVCMHLLAYFSILKASCYVLVLKQPIIVKTNFCTKRMNRQLRKSE >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_1|1074_bp atgcttgaggagcccttcagcctgccgctgcaccgcgggagcccttctctgggctggtcg aggccggagccggttccctcagcttgcggggaggtgtggagggagaggcacgagtgggaa ctggggctgcacacgatgcttgtgggccagtgcgagttctgggtttgtaaatgcaccaat cagcactctgtgtctagctcagggtttgtaaatacaccaatccacactgtgtacctagct aatctagtggggacgtggagaacttttgtgtctagctcagggattgtaaacgcaccaatc agcaccctgtcaaaacggaccaatcagctctccataaaacagaccaatcggctctctgta aaatggaccaatcagcaggatgtggaactacgtgatcaaagggaatccaagattgaaggg ttagaggaagtgaaaggaaatttaattgagaaaggacagggcatccttagtgaagaattg ggatccaaagtcagaaaaagcagtaactccattcttccgtttaatcaggctaaaaaccta gaaattgtgctgctttttctctttctctcatatcccacatccatccaaccaggaaatctt gttgactctctctttgaatatatccagaagtctattcagtatctgagcccagcagtcagt gaccacatctgcacccagctcctgcaggacctcagcacctcctcaggaatctttgcagaa atgcctcttgaatgggagaagctgctgctcccccaaattgctggggaagaaagggagttc ctgatctttcttattgaggctcatggagcttctgtcctggagattcagtatacttcagta atggctctggagaagagaactcaggatcctcaagacccaatggctcgtgatgataaaagc ctttgggtctaccgacattttccctacgtgctcagacatgtctgcatgcatttattggca tatttttccatactcaaagcaagttgctatgtgttagtactaaagcaacccataatagta aaaacgaatttctgcaccaagagaatgaatcggcagctgagaaagtcagagtga >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_2|97_aa MTLAVAMAVWLSEELLEDILQQKMKMHQEQESEFRKLESSLGEQHKKTQPKAHAGGSENE SRGHRAGNKTYRQMTCYDQKQIVRGLHKSVSGLENNG >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_2|294_bp atgacactggcagtggctatggctgtctggctatctgaggaattattggaggacatactc caacagaagatgaaaatgcaccaagaacaagaaagtgaattcaggaaactggagtccagt ttaggagagcaacacaagaaaacccagcccaaagcccatgcaggtggctctgagaatgaa tccagaggacacagggctggaaacaaaacgtaccgacagatgacctgctatgatcagaaa caaatagtcaggggtcttcataaatcagttagtggacttgagaacaatgggtga >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_3|342_aa MGVAILEPTLPIWMMQTMCSPKWQLVQCPSTVPAMAQVGPDVAWPPTLEGASGTSAYRIS YAHRCSSPAPVFPPFLTTKKGVQAAGLWWSITSVPNIAFAIRGPVPVLKYWVISSKALAK ELRLGDCIAQGLAFLPASVSYLIGTNLFGVLANKMGRWLCSLIGMLVVGTSLLCVPLAHN IFGLIGPNAGLGLAIGMVDSSMMPIMGHLVDLRHTSVYGSVYAIADVAFCMGFAIGPSTG GAIVKAIGFPWLMVITGVINIVYAPLCYYLRSPPAKEEKLHKHPEGLMGLGYRQRLLKGT KEDMKAILSQDCPMETRMYATQKPTKEFPLGEDSDEEPDHEE >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_3|1029_bp atgggggtggccatcctggagcccacactgcccatctggatgatgcagaccatgtgctcc cccaagtggcagctggtgcagtgcccctcaactgtgccagccatggctcaagtgggccca gatgtggcctggcctcccactctggagggtgcaagtgggacatctgcttaccggattagt tacgctcaccgatgtagcagtcctgcacctgttttcccgcctttcttgaccacaaagaaa ggagtccaggctgctggattatggtggtccattaccagcgtgcccaacattgcctttgcg atcaggggcccagttccagtgttaaagtactgggtcatcagttctaaggccctcgccaag gagctgaggcttggagattgtattgcacagggtctagctttcttgcctgccagtgtgtcc tacctcattggcaccaacctctttggtgtgttggccaacaagatgggtcggtggctgtgt tccctaatcgggatgctggtagtaggtaccagcttgctctgtgttcctctggctcacaat atttttggtctcattggccccaatgcagggcttggccttgccataggcatggtggattct tctatgatgcccatcatggggcacctggtggatctacgccacacctcggtgtatgggagt gtctacgccatcgctgatgtggctttttgcatgggctttgctataggtccatccaccggt ggtgccattgtaaaggccatcggttttccctggctcatggtcatcactggggtcatcaac atcgtctatgctccactctgctactacctgcggagccccccggcaaaggaagagaagctt cacaaacatcctgagggcctgatgggtttgggataccgacagaggctgctcaagggaaca aaagaagacatgaaggctattctgagtcaggactgccccatggagacccggatgtatgca acccagaagcccacgaaggaatttcctctgggggaggacagtgatgaggagcctgaccat gaggagtag >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_4|279_aa MLRTILDAPQRLLKEGRASRQLVLVVVFVALLLDNMLFTVVVPIVPTFLYDMEFKEVNSS LHLGHAGSSPHALASPAFSTIFSFFNNNTVAVEESVPSGIAWMNDTASTIPPPATEAISA HKNNCLQGTGFLEEEITRVGVLFASKAVMQLLVNPFVGPLTNRIGYHIPMFAGFVIMFLS TVMFAFSGTYTLLFVARTLQGIGSSFSSVAGLGMLASVYTDDHERGRAMGTALGGLALGL LVGAPFGSVMYEFVGKSAPFLILAFLALLDGDLLSIDDS >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_4|840_bp atgctccggaccattctggatgctccccagcggttgctgaaggaggggagagcgtcccgg cagctggtgctggtggtggtattcgtcgctttgctcctggacaacatgctgtttactgtg gtggtgccaattgtgcccaccttcctatatgacatggagttcaaagaagtcaactcttct ctgcacctcggccatgccggaagttccccacatgccctcgcctctcctgccttttccacc atcttctccttcttcaacaacaacaccgtggctgttgaagaaagcgtacctagtggaata gcatggatgaatgacactgccagcaccatcccacctccagccactgaagccatctcagct cataaaaacaactgcttgcaaggcacaggtttcttggaggaagagattacccgggtcggg gttctgtttgcttcaaaggctgtgatgcaacttctggtcaacccattcgtgggccctctc accaacaggattggatatcatatccccatgtttgctggctttgttatcatgtttctctcc acagttatgtttgctttttctgggacctatactctactctttgtggcccgaacccttcaa ggcattggatcttcattttcatctgttgcaggtcttggaatgctggccagtgtctacact gatgaccatgagagaggacgagccatgggaactgctctggggggcctggccttggggttg ctggtgggagctccctttggaagtgtaatgtacgagtttgttgggaagtctgcacccttc ctcatcctggccttcctggcactactggatggagatttgttatccattgatgattcttga >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_5|548_aa MALRAMRGIVNGAAPELPVPTGGPAVGAREQALAVSRNYLSQPRLTYKTVSGVNGPLVIL DHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGTSGIDAKKTSCEFTGDILR TPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINPQCRIYPEEMIQTGISAIDGM NSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKDVVDYSEENFAIVFAAMGVNMET ARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPRLALTTAEFLAYQCEKHVLVILTDM SSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVEGRNGSITQIPILTMPNDD ITHPIPDLTGYITEGQIYVDRQLHNRQYACYAIGKDVQAMKAVVGEEALTSDDLLYLEFL QKFERNFIAQVALKCGPSTKEASLVIYLLDVHCSDKTWKVQPCPIRERVPFPRGEAPGKA ALGITPRQQLLMPERGSPPSPAVAAPLDPEAQPPLLSLVTDNAYRLCHQLFQSKNYLSAQ CPKKRERA >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_5|1647_bp atggcgctgcgggcgatgcgggggattgtcaacggggccgcacccgagctacccgtgccc accggtgggccggcggtgggagctcgggagcaggcgctggcagtcagtcggaactacctc tcccagcctcgcctcacatacaagacagtatctggagtcaatggtccactagtgatctta gatcatgttaagtttcccaggtatgctgaaattgtccatttgaccttaccggatggcaca aagagaagtgggcaagttctggaagttagtggttccaaggcagtagttcaggtatttgaa gggacttcaggtatagatgctaagaaaacgtcctgtgagtttactggggatattctccga acaccggtgtctgaggatatgcttggtcgggtattcaatggatcgggaaaacccattgac agaggtcctgttgtactggccgaagacttccttgatatcatgggtcagccaatcaaccct caatgtcgaatctacccagaggaaatgattcagactggcatttcggccatcgatgggatg aacagtattgctagggggcagaaaattcctatcttctctgctgctgggctaccacacaat gagattgcagctcagatctgtcgccaggctggtttggtaaagaaatccaaagatgtagta gactacagtgaggaaaattttgcaattgtatttgctgctatgggtgtaaacatggaaact gcccggttcttcaaatctgactttgaagaaaatggctcaatggacaatgtctgcctcttt ttgaacttggctaatgacccaaccattgagcgaattatcactcctcgcctggctctaacc acagctgaatttctggcgtaccaatgtgagaaacatgtattggttattctaacagacatg agttcttatgctgaagcacttcgagaggtttcagcagccagggaagaggtacctggtcga cgaggttttccaggttacatgtatacagatttagccacgatatatgaacgcgctgggcga gtggaagggagaaacggctcgattactcaaatccctattctaaccatgcctaatgatgat atcactcaccccatcccagacttgactggctacattacagaggggcagatctatgtggac agacagctgcacaacagacagtatgcgtgctatgctattggaaaggatgtgcaagccatg aaagctgtcgttggagaagaagcccttacctcagatgatcttctctacttggaatttctg cagaagtttgagaggaacttcattgctcaggtcgcactcaagtgtggccctagcaccaag gaagcttcattagtgatttacctactagatgttcattgttcagataaaacttggaaagtg cagccctgtcccatcagggagcgagtgcccttcccccgaggagaagctccagggaaagca gccctgggcatcacccccaggcagcagctgctcatgcccgagagagggagcccaccttcc ccagctgtggctgctcccttagaccccgaggcccagccaccactgctcagtttggtcact gacaatgcctacagattgtgtcaccagttattccagagcaagaattacctctcagctcag tgcccgaaaaaacgagagagggcttga >gi568815590r:20049676_20280964|GENSCAN_predicted_peptide_6|690_aa MEDPGLESEEDHSVDLGTVKEDQNFGERGAPNMETRSTLLGSQHPGDKEAGGKEHKDDAR NQGTRQPRQHPPPGCLRAFPRPCPALPHPQPRVTMGSVSSLISGHSFHSKHCRASQYKLR KSSHLKKLNRYSDGLLRFGFSQDSGHGKSSSKMGKSEDFFYIKVSQKARGSHHPDYTALS SGDLGGQAGVDFDPSTPPKLMPFSNQLEMGSEKGAVRPTAFKPVLPRSGAILHSSPESAS HQLHPAPPDKPKEQELKPGLCSGALSDSGRNSMSSLPTHSTSSSYQLDPLVTPVGPTSRF GGSAHNITQGIVLQDSNMMSLKALSFSDGGSKLGHSNKADKGPSCVRSPISTDECSIQEL EQKLLEREGALQKLQRSFEEKELASSLAYEERPRRCRDELEGPEPKGGNKLKQASQKSQR AQQVLHLQVLQLQQEKRQLRQELESLMKEQDLLETKLRSYEREKTSFGPALEETQWEVCQ KSGEISLLKQQLKESQTEVNAKASEILGLKAQLKDTRGKLEGLELRTQDLEGALRTKGLE LEVCENELQRKKNEAELLREKVNLLEQELQELRAQAALARDMGPPTFPEDVPALQRELER LRAELREERQGHDQMSSGFQHERLVWKEEKEKVIQYQKQLQQSYVAMYQRNQRLEKALQQ LARGDSAGEPLEVDLEGADIPYEDIIATEI >gi568815590r:20049676_20280964|GENSCAN_predicted_CDS_6|2073_bp atggaagaccctggtctagagagtgaagaggaccacagtgtggacctggggacagtgaag gaagatcagaattttggagagaggggagcccccaacatggagaccaggtctacactcttg ggctctcagcaccctggggacaaagaagcaggaggaaaagaacacaaggacgatgccaga aaccaggggacccgacagcccagacagcatcctccaccagggtgcctgagagcctttcca agaccctgcccggccctgccccatcctcagccccgagtcaccatgggcagcgtcagtagc ctcatctccggccacagcttccacagcaagcactgccgggcttcgcagtacaagctgcgc aagtcctcccacctcaagaagctcaaccggtattccgacgggctgctgaggtttggcttc tcccaggactccggtcacggcaagtccagctccaaaatgggcaagagcgaagacttcttc tacatcaaggtcagccagaaagcccggggctcccatcacccagattacacggcactgtcc agcggggatttagggggccaggctggggtggactttgacccgtccacaccccccaagctc atgcccttctccaatcagctagaaatgggctccgagaagggtgcagtgaggcccacagcc ttcaagcctgtgctgccacggtcaggagccatcctgcactcctccccggagagtgccagc caccagctgcaccccgcccctccagacaagcccaaggagcaggagctgaagcctggcctg tgctctggggcgctgtcagactccggccggaactccatgtccagcctgcccacacacagc accagcagcagctaccagctggacccgctggtcacacccgtgggacccacaagccgtttt gggggctccgcccacaacatcacccagggcatcgtcctccaggacagcaacatgatgagc ctgaaggctctgtccttctccgacggaggtagcaagctgggccactcgaacaaggcagac aagggcccctcgtgtgtccgctcccccatctccacggacgagtgcagcatccaggagctg gagcagaagctgttggagagggagggcgccctccagaagctgcagcgcagctttgaggag aaggagcttgcctccagcctggcctacgaggagcggccgcggcgctgcagggacgagctg gagggcccggagcccaaaggcggcaacaagctcaagcaggcctcgcagaagagccagcgc gcgcagcaggtcctgcacctgcaggtactgcagcttcagcaggagaagcggcagctccgg caggagctcgagagcctcatgaaggagcaggacctgctggagaccaagctcaggtcctac gagagggagaagaccagcttcggccccgcgctggaggagacccagtgggaggtgtgccag aagtcaggcgagatctccctcctgaagcagcagctgaaggagtcccagacggaggtgaac gccaaggctagcgagatcctgggtctcaaggcacagctgaaggacacgcggggcaagctg gagggcctggagctgaggacccaggacctggagggcgccctgcgcaccaagggcctggag ctggaggtctgtgagaatgagctgcagcgcaagaagaacgaggcggagctgctgcgggag aaggtgaacctgctggagcaggagctgcaggagctgcgggcccaggccgccctggcccgc gacatggggccgcccaccttccccgaggacgtccctgccctgcagcgggagctggagcgg ctgcgggccgagctgcgggaggagcggcaaggccatgaccagatgtcctcgggcttccag catgagcggctcgtgtggaaggaggagaaggagaaggtgattcagtaccagaaacagctg cagcagagctacgtggccatgtaccagcggaaccagcgcctggagaaggccctgcagcag ctggcacgtggggacagcgccggggagcccttggaggttgacctggaaggggctgacatc ccctacgaggacatcatagccactgagatctga