GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:56:33 Sequence gi568815584f:55129217_55329678 : 200462 bp : 40.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 44 84 41 1 2 46 89 104 0.439 2.40 1.02 Intr + 239 431 193 2 1 64 63 207 0.257 14.37 1.03 Intr + 8829 9152 324 2 0 114 85 163 0.770 13.65 1.04 Term + 15900 16055 156 2 0 73 33 146 0.969 4.55 1.05 PlyA + 16177 16182 6 1.05 2.15 PlyA - 16234 16229 6 1.05 2.14 Term - 19326 19135 192 0 0 40 38 98 0.289 -3.56 2.13 Intr - 21632 21583 50 0 2 105 89 24 0.666 1.68 2.12 Intr - 22725 22479 247 0 1 105 94 170 0.998 15.41 2.11 Intr - 23431 23374 58 0 1 109 103 89 0.953 10.47 2.10 Intr - 25590 25401 190 1 1 79 68 185 0.910 13.32 2.09 Intr - 29525 29306 220 2 1 87 92 167 0.998 13.95 2.08 Intr - 33859 33755 105 1 0 57 110 23 0.616 0.89 2.07 Intr - 40343 40183 161 0 2 87 91 178 0.841 16.79 2.06 Intr - 51562 51440 123 2 0 94 81 73 0.957 6.84 2.05 Intr - 52081 51997 85 1 1 41 97 135 0.999 8.17 2.04 Intr - 54537 54344 194 1 2 72 94 115 0.904 8.79 2.03 Intr - 59964 59726 239 2 2 89 108 147 0.129 13.14 2.02 Intr - 69798 69632 167 0 2 109 71 59 0.091 4.14 2.01 Init - 82453 82394 60 1 0 61 100 36 0.153 1.57 2.00 Prom - 85277 85238 40 -6.15 3.00 Prom + 85913 85952 40 -2.85 3.01 Sngl + 100001 100465 465 1 0 99 38 405 0.994 32.49 3.02 PlyA + 100730 100735 6 1.05 4.03 PlyA - 102017 102012 6 1.05 4.02 Term - 105786 105331 456 0 0 10 34 899 0.701 70.84 4.01 Init - 111639 111412 228 0 0 50 39 218 0.864 11.52 4.00 Prom - 118114 118075 40 -5.55 5.00 Prom + 121745 121784 40 -5.75 5.01 Sngl + 125477 125929 453 1 0 61 32 237 0.676 11.35 5.02 PlyA + 127381 127386 6 1.05 6.05 PlyA - 128116 128111 6 1.05 6.04 Term - 133940 133875 66 2 0 81 39 79 0.320 -0.74 6.03 Intr - 135833 135646 188 0 2 86 47 149 0.419 9.09 6.02 Intr - 142944 142792 153 1 0 52 37 254 0.043 15.82 6.01 Init - 155753 155705 49 2 1 69 116 17 0.482 3.76 6.00 Prom - 156825 156786 40 -3.35 7.00 Prom + 158512 158551 40 -10.55 7.01 Init + 158651 158874 224 1 2 90 60 136 0.261 9.08 7.02 Term + 169412 170027 616 2 1 -81 49 943 0.215 66.45 7.03 PlyA + 170257 170262 6 1.05 8.00 Prom + 175191 175230 40 -1.65 8.01 Init + 178097 178184 88 1 1 38 113 76 0.520 5.95 8.02 Term + 180084 180175 92 1 2 74 38 50 0.429 -4.50 8.03 PlyA + 180212 180217 6 1.05 9.03 PlyA - 180477 180472 6 1.05 9.02 Term - 183560 183408 153 2 0 54 54 182 0.447 8.34 9.01 Init - 184251 184042 210 0 0 63 57 250 0.704 18.23 9.00 Prom - 189469 189430 40 -2.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 12884 13013 130 0 1 100 64 94 0.821 7.03 S.002 Intr + 13368 13533 166 0 1 80 106 53 0.996 5.34 S.003 Init - 40749 40701 49 0 1 76 99 36 0.874 2.66 S.004 Intr - 50483 50413 71 0 2 62 75 102 0.874 4.28 S.005 Init - 59963 59726 238 2 1 60 108 147 0.838 12.12 S.006 Term - 142944 142787 158 1 2 52 49 264 0.892 16.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_1|237_aa HLLASSRPEPANERSGAAGSDLGPGQSPLIIEGAGVRGRLAAPYETHTRPRGGTGHLLRS LVGFAAVAPPPPALCGPRLHDALSGSGNPNPQGWPGAWGNQPAGAGGYPGASYPGAYPGQ APPGAYPGQAPPGAYPGAPGAYPGAPAPGVYPGPPSGPGAYPSSGQPSATGAYPATGPYG APAGPLIQVLVEPDHFKVAVNDAHLLQYNHRVKKLNEISKLGISGDIDLTSASYTMI >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_1|714_bp cacctcctcgccagcagccgtccggagccagccaacgagcggagcggggcggcgggcagc gatctgggcccggggcagtcgcctttgattatcgagggcgctggcgttcggggaaggttg gcagcaccttacgagacccacacacgtccccggggcggcacgggccaccttctgcggagc ctcgtgggcttcgccgccgtcgcacctccgccgcctgcgctctgcggccccagactccat gatgcgttatctgggtctggaaacccaaaccctcaaggatggcctggcgcatgggggaac cagcctgctggggcagggggctacccaggggcttcctatcctggggcctaccccgggcag gcacccccaggggcttatcctggacaggcacctccaggcgcctaccctggagcacctgga gcttatcccggagcacctgcacctggagtctacccagggccacccagcggccctggggcc tacccatcttctggacagccaagtgccaccggagcctaccctgccactggcccctatggc gcccctgctgggccactgatacaagtactggttgaacctgaccacttcaaggttgcagtg aatgatgctcacttgttgcagtacaatcatcgggttaaaaaactcaatgaaatcagcaaa ctgggaatttctggtgacatagacctcaccagtgcttcatataccatgatataa >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_2|696_aa MRIGLMGSLMPVAPSLWEAKLEAQGCCPALSTPLSRDCIIIAKQAYNLLRLPSTYIMAQR NYSFSELFPETRCHDKMSSSHFASRHRKDISTEMIRTKIAHRKSLSQKENRHKEYERNRH FGLKDVNIPTLEGRILVELDETSQGLVPEKTNVKPRAMKTILGDQRKQMLQKYKEEKQLQ KLKEQREKAKRGIFKVGRYRPDMPCFLLSNQNAVKAEPKKIDNESDVRAIRPGPRQTSEK KVSDKEKKVVQPVMPTSLRMTRSATQAAKQVPRTVSSTTARKPVTRAANAKDLIRTAVGQ TRLLMKERFKQFEGLVDDCEYKRGIKETTCTDLDGFWDMVSFQIEDVIHKFNNLIKLEES GWQVNNNMNHNMNKNVFRKKVVSGIASKPKQDDAGRIAARNRLAAIKNAMRERIRQEECA ETAVSVIPKEVDKIVFDAGFFRVESPVKLFSGLSVSSEGPSQRLGTPKSVNKAVSQSRNE MGIPQQTTSPENAGPQNTKSEHVKKTLFLSIPESRSSIEDAQCPGLPDLIEENHVVNKTD LKVDCLSSERMSLPLLAGGVADDINTNKKEGISDVVEGMELNSSITSQDVLMSSPEKNTA SQNSILEEGETKISQSELFDNKSLTTECHLLDSNVEELIGETLDGYFTDFSGFVLQPGLN CSNPFTQLERRHQEHARHISFGGNLITFSPLQPGEF >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_2|2091_bp atgagaataggcctgatgggctcgctcatgcctgtagccccatcactctgggaggccaag cttgaagctcagggctgctgtccagccctgtccactcccctgtcaagggattgtatcatc attgcgaaacaagcatataatcttctacgacttccaagtacatacataatggcacaaagg aattatagtttttcagagctcttccctgaaaccagatgccatgacaagatgtcttcatca cattttgccagtcgacacaggaaggatataagtactgaaatgattagaactaaaattgct cataggaaatcactgtctcagaaagaaaatagacataaggaatacgaacgaaatagacac tttggtttgaaagatgtaaacattccaaccttggaaggtagaattcttgttgaattagat gagacatctcaagggcttgttccagaaaagaccaatgttaagccaagggcaatgaaaact attctaggtgatcaacgaaaacagatgctccaaaaatacaaagaagaaaagcaacttcaa aaattgaaagagcagagagagaaagctaaacgaggaatatttaaagtgggtcgttataga cctgatatgccttgttttcttttatcaaaccagaatgctgtgaaagctgagccaaaaaag attgataacgagagtgatgttcgagcaatccgacctggtccaagacaaacttctgaaaag aaagtgtcagacaaagagaaaaaagttgtgcagcctgtaatgcccacgtcgttgagaatg actcgatcagctactcaagcagcaaagcaggttcccagaacagtctcatctaccacagca agaaagccagtcacaagagctgctaatgctaaagatcttattcgcacagcagttggtcaa acaagactccttatgaaggaaaggtttaaacagtttgaaggactggttgatgattgtgaa tataaacgaggtataaaggagactacctgtacagatctggatggattttgggatatggtt agttttcagatagaagatgtaatccacaaattcaacaatctgatcaaacttgaggaatct gggtggcaagtcaataataatatgaatcataatatgaacaaaaatgtctttaggaaaaaa gttgtctcaggtatagcaagtaaaccaaaacaggatgatgctggaagaattgcagcgaga aatcgcctagctgccataaaaaatgcaatgagagagagaattaggcaggaagaatgtgct gaaacagcagtttctgtgataccaaaggaagttgataaaatagtgttcgatgctggattt ttcagagttgaaagtcctgttaaattattctcaggactttctgtctcttctgaaggccct tctcaaagacttggaacacctaagtctgtcaacaaagctgtatctcagagtagaaatgag atgggcattccacaacaaactacatcaccagaaaatgccggtcctcagaatacgaaaagt gaacatgtgaagaagactttgtttttgagtattcctgaaagcaggagcagcatagaagat gctcagtgtcctggattaccagatttaattgaagaaaatcatgttgtaaataagacagac ttgaaggtggattgtttatccagtgagagaatgagtttgcctcttcttgctggtggagta gcagatgatattaatactaacaaaaaagaaggaatttcagatgttgtggaaggaatggaa ctgaattcttcaattacatcacaggatgttttgatgagtagccctgaaaaaaatacagct tcacaaaatagcatcttagaagaaggggaaactaaaatttctcagtcagaactatttgat aataaaagtctcactactgaatgccaccttcttgattcaaatgttgaagaattaattggt gaaactcttgatggatactttactgacttctctggatttgttctccagccaggtctaaac tgcagtaatccatttactcagctggagaggagacatcaagaacatgccagacacatttct tttggtggtaacctgattactttttcacctctacaaccaggagaattttga >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_3|154_aa MAASRRLMKELEEIRKCGMKNFCNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEINFPAE YPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPKHP LRADLAEEYSKDRKKFCKNAEEFTKKYGEKRPVD >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_3|465_bp atggcggccagcaggaggctgatgaaggagcttgaagaaatccgcaaatgtgggatgaaa aacttctgtaacatccaggttgatgaagctaatttattgacttggcaagggcttattgtt cctgacaaccctccatatgataagggggccttcagaatagaaatcaactttccagcagag tacccattcaaaccaccgaagatcacatttaaaacaaagatctatcacccaaacatcgac gaaaaggggcaggtctgtctgccagtaattagtgctgaaaactggaagccagcaaccaaa accgaccaagtaatccagtccctcatagcactggtgaatgacccccagcccaagcacccg cttcgggctgacctagctgaagaatactctaaggaccgtaaaaaattctgtaagaatgct gaagagtttacaaagaaatatggggaaaagcgacctgtggactaa >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_4|227_aa MNTLVSYDPVPEPKIVDAALRACRRLNDFASAVGILEAVKDKAGPHKEIYPYVIQELRPT LNELGISTPEDLGLDKSETLSRKKRKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEL FQKKKRKKKKRKKKHYSKRRRRRGRRSTIPEEEEEQEEEEEEEEEEEEEALFQKKKKKRK KKKKKHYSRRRRRKKKKKKKKKKKKNYSKRETLSLSSQRSQTDSFIV >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_4|684_bp atgaacacacttgttagctatgatccggttccagagcccaaaattgttgatgctgctttg cgggcatgcagacggttaaacgattttgctagtgcagttggcatcctagaggctgttaag gacaaagcaggacctcataaggaaatctacccctatgtcatacaggaactcagaccaact ttaaatgaactgggaatctccactccggaggatctgggccttgacaagagcgagactctg tctcgaaaaaagagaaaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaggaagaggaagaggaagaggaagaggaagaggaagaagaagaagaagaggaagaacta ttccaaaagaagaagaggaagaagaagaagaggaagaagaagcactattccaaaagaaga agaagaagaggaagaagaagcactattccagaagaagaagaagagcaagaagaagaagag gaagaagaagaagaggaggaagaagaagcactattccaaaagaagaagaagaagaggaag aagaagaagaagaagcactattccagaagaagaagaagaaagaagaagaagaagaagaag aagaagaagaagaagaactattccaaaagagaaacattaagtttaagtagtcaaagaagt caaacagattcctttattgtgtaa >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_5|150_aa MHTFICLEKNPYIPNGIFGRGPRGSLGNPLDDQIPREKGTFNCTVMEIQGSASCMNHMLP LLGQDPRERLPSSGGQLHLNVNLLQEQVRPCLGVLHTNRHLGSSFGLPSYMTREPLVPVD SSLILLSGLRCYRGDRRAPGRTMLPFHPLL >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_5|453_bp atgcatacattcatatgtctggaaaagaatccttatattccaaatgggatatttggcaga ggcccaagaggttcactagggaatcctctggatgaccagatccccagggaaaaaggaaca ttcaactgtactgtcatggagatccagggctcagcatcttgtatgaatcatatgctgccc ttgttgggtcaggatcccagagaaaggctcccttcatcaggaggccaactccatctcaac gtgaatcttctacaggagcaagtgagaccttgtcttggtgttcttcacactaacagacac ttgggatcttcttttggcttaccttcctacatgacccgggagccgctggttccagttgat tcttctctcatcctcctgtcagggttgagatgttatagaggggacagaagagcaccagga cgtactatgttgcctttccatcccttgctgtaa >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_6|151_aa MVKIHKTKQDNCSKNQVTSQDPSSDVTDTRTHSPMTSSWTEDEPQSTCASPEAGEASEPP SRLVVEEGAAHKINELVFPKNNDSWVIQTSLMAFRSLIGLDGIQKFDRPRDLTFHVATFL METLNHVQRQNPDLGYPICASGEVGPEPSCE >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_6|456_bp atggtaaagattcacaaaacaaaacaagacaactgcagtaagaaccaagtgacatcccag gacccatcttctgacgtcacagacacacggacccacagccctatgacttcatcgtggact gaggacgaaccgcagtcgacttgcgccagcccagaggctggggaggcgtccgagcctccg tcccgcctcgtcgttgaagaaggggctgcccacaagattaatgaacttgtttttccaaag aacaatgattcttgggtcattcagacctccttgatggcattcagaagtttgatcggcctt gatggcattcagaagtttgatcggcctagggatctcaccttccacgtagctacctttctc atggaaacactcaatcatgttcaaaggcagaatcctgatctgggctaccctatctgtgct tctggtgaggtgggcccagaaccatcgtgtgaataa >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_7|279_aa MHLHHVGNKHFFCCVNNVIFIVVCLCFDCDLSHEVGCGIFHLWHRVGAQKVLDFGPFRIS DFWIRNSQPVLSARRLARRALFGAGGGKAGKGGLTLQEAIQRLRDTEEMLSKKQEFLEKK IEQRHGTKNKPAALQALKRKKRYEKQLAQIDGTLSTIEFQQQALENANTNTEVLKNMGSA AKAKKAAHDNMDIDKVDELMQDIADQQELGEEISTAISKPVGFGEKSDEDELMAELEELE QEEPDKNLLEVSGPETVPLPNVPSIALPSKPAKKRKTTT >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_7|840_bp atgcatttgcatcatgtcggcaataagcactttttttgctgtgttaacaatgtcatcttc attgttgtgtgcctgtgttttgactgtgacctgtcacatgaggttgggtgtggaattttc cacttgtggcatcgtgttggagctcaaaaagttttggattttggaccatttcggatttcg gatttttggattaggaattctcaacctgtgctatcagccaggaggctggcgcggcgagcg ctgttcggggctggagggggtaaggccggcaagggcggcctgaccctccaggaggccatc cagcggctgcgggacacggaggagatgttaagcaagaaacaggagttcctggagaagaaa atcgagcagaggcacggcaccaaaaacaagcccgcggccctccaggcactgaagcgtaag aagaggtatgagaagcagctggcgcagatcgacggtacattatcaaccatcgagttccag cagcaggccctggagaatgccaacaccaataccgaggtgctcaagaacatgggctctgca gccaaggccaagaaggcggcccacgacaacatggacatcgataaagttgatgagttaatg caggacattgctgaccagcaagaacttggggaggagatttcaacagcaatttcgaaacct gtagggtttggagaaaagtctgacgaggatgagctcatggcggaattagaagaactagaa caggaggaaccagacaagaatttgctggaagtcagtggccccgaaacagtccctctacca aatgttccctctatagccctaccatcaaaacctgccaagaagaggaagacgacgacatga >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_8|59_aa MCLHICHCNFKRTSLNSFRGGKQNNVREEGVWRCVDGGDVSTAMSQVSRTMGYDCVTFL >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_8|180_bp atgtgtttgcatatttgtcactgcaactttaagcggacctctttgaattcttttcgtgga ggcaagcaaaacaatgtgcgggaagaaggagtgtggcggtgtgtggatgggggagatgtc tctacagcaatgtctcaggtttccagaacgatgggctatgattgtgtcaccttcctctag >gi568815584f:55129217_55329678|GENSCAN_predicted_peptide_9|120_aa MWKQFWNWVTGKGCNNLEDSEEDMKMWESLELPRNLLNGSDQNADSDMVNEVQDEVVSDE EEDLLRTGIKSQDLVPRIPTGVKMEQRTAQAITSEGASPKPWQLTHGVGPAGARKSRIEV >gi568815584f:55129217_55329678|GENSCAN_predicted_CDS_9|363_bp atgtggaagcaattttggaactgggtaacaggcaaaggttgcaacaatttggaggactca gaagaagacatgaagatgtgggaaagtttggaacttcctagaaacttgctgaatggctct gaccaaaatgctgatagtgatatggtcaatgaagtccaggatgaggtggtctcagatgaa gaggaggacttattgagaactggaataaagtctcaagacttggtgccccgcatccccact ggggtgaaaatggagcaacgtacagctcaggccattacttcagagggtgcaagccccaag ccttggcagcttacacatggtgttgggcctgcaggtgcacgaaagtcaagaattgaggtt tga