GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:20:03 Sequence gi568815575r:111201690_111510398 : 308709 bp : 38.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2778 2886 109 2 1 79 -57 182 0.063 3.23 1.02 Intr + 12244 12326 83 2 2 51 67 63 0.273 -1.06 1.03 Intr + 14732 14869 138 1 0 70 67 157 0.802 11.54 1.04 Term + 18669 18758 90 2 0 113 47 34 0.609 -1.46 1.05 PlyA + 18805 18810 6 1.05 2.04 PlyA - 18931 18926 6 1.05 2.03 Term - 21553 21423 131 2 2 97 42 60 0.268 -0.34 2.02 Intr - 29649 29545 105 1 0 51 106 38 0.065 1.17 2.01 Init - 41713 41491 223 1 1 66 39 174 0.172 9.06 2.00 Prom - 41957 41918 40 -6.65 3.13 PlyA - 42234 42229 6 1.05 3.12 Term - 45070 44888 183 1 0 144 35 216 0.989 18.46 3.11 Intr - 45815 45679 137 0 2 68 110 102 0.998 9.77 3.10 Intr - 46303 46182 122 0 2 91 69 163 0.999 13.92 3.09 Intr - 47082 46880 203 0 2 97 31 144 0.976 6.76 3.08 Intr - 47368 47246 123 1 0 101 93 148 0.999 16.46 3.07 Intr - 49414 49228 187 0 1 95 36 238 0.999 18.07 3.06 Intr - 49597 49520 78 0 0 81 73 123 0.996 7.85 3.05 Intr - 50053 49860 194 1 2 86 82 247 0.983 21.27 3.04 Intr - 50810 50618 193 1 1 56 68 219 0.635 15.27 3.03 Intr - 51527 51319 209 2 2 91 78 111 0.999 7.35 3.02 Intr - 52714 52583 132 1 0 83 62 80 0.965 4.82 3.01 Init - 62247 62083 165 0 0 61 93 135 0.973 10.98 3.00 Prom - 66485 66446 40 -5.35 4.04 PlyA - 66786 66781 6 1.05 4.03 Term - 68751 68589 163 2 1 13 33 272 0.440 10.63 4.02 Intr - 69264 69092 173 0 2 70 69 78 0.287 1.92 4.01 Init - 71402 71316 87 2 0 70 86 53 0.605 3.99 4.00 Prom - 81697 81658 40 -3.75 5.10 PlyA - 81967 81962 6 1.05 5.09 Term - 95672 95537 136 1 1 58 47 119 0.036 1.31 5.08 Intr - 100054 99941 114 0 0 111 42 68 0.028 3.14 5.07 Intr - 110730 110586 145 1 1 77 26 62 0.001 -2.68 5.06 Intr - 129352 129215 138 2 0 95 113 181 0.956 20.81 5.05 Intr - 131464 131362 103 1 1 97 103 129 0.999 14.13 5.04 Intr - 138763 138618 146 2 2 74 53 103 0.005 4.48 5.03 Intr - 151129 151021 109 1 1 115 30 59 0.013 1.74 5.02 Intr - 167307 167272 36 0 0 87 84 42 0.045 1.24 5.01 Init - 170487 170398 90 0 0 84 70 60 0.040 4.34 5.00 Prom - 178118 178079 40 -3.85 6.07 PlyA - 178194 178189 6 1.05 6.06 Term - 184713 184544 170 1 2 81 53 98 0.754 2.66 6.05 Intr - 194424 194292 133 0 1 2 100 98 0.865 1.70 6.04 Intr - 199641 199301 341 1 2 88 96 425 0.911 37.57 6.03 Intr - 208731 208346 386 2 2 104 86 575 0.916 52.47 6.02 Intr - 209457 209420 38 0 2 114 36 63 0.597 -0.16 6.01 Init - 215023 214883 141 1 0 81 44 119 0.485 6.98 6.00 Prom - 225626 225587 40 -1.65 7.00 Prom + 228822 228861 40 -7.95 7.01 Sngl + 233120 233335 216 1 0 68 43 350 0.871 23.42 7.02 PlyA + 233954 233959 6 1.05 8.05 PlyA - 234553 234548 6 1.05 8.04 Term - 242857 242721 137 2 2 77 41 75 0.531 -1.10 8.03 Intr - 243194 243093 102 0 0 91 100 89 0.941 9.63 8.02 Intr - 266397 266314 84 1 0 42 93 71 0.443 1.87 8.01 Init - 269479 269272 208 1 1 110 50 138 0.848 11.48 8.00 Prom - 270941 270902 40 -5.65 9.07 PlyA - 271467 271462 6 1.05 9.06 Term - 274670 274459 212 0 2 24 37 215 0.750 6.47 9.05 Intr - 276730 276579 152 0 2 115 54 81 0.547 6.29 9.04 Intr - 281189 281095 95 2 2 38 33 95 0.045 -3.16 9.03 Intr - 293084 293030 55 1 1 103 61 64 0.102 3.26 9.02 Intr - 305880 305774 107 0 2 16 89 81 0.079 -0.91 9.01 Intr - 307513 307364 150 1 0 36 81 123 0.434 5.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 246361 246325 37 1 1 80 78 36 0.829 2.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_1|139_aa MECDDRHLRTEMPMHVFYPMAFEICVFANEELFYQEIKTGFVTDLYVGMEVLIAATCISC SSMKALYLIATNGTPELQNPERLSAVFRDFLNRCLEMDVDRRGSAKELLQHPFLKLAKPL SSLTPLIIAAKEAIKNSSR >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_1|420_bp atggagtgtgatgatcggcatttgcgcactgaaatgcccatgcacgtgttttatcctatg gcttttgaaatctgtgtttttgccaatgaagaattattttaccaagaaatcaaaactgga tttgtcactgacttgtatgttggcatggaggtcctaatagctgccacctgcatttcctgt tcatcaatgaaggcattgtatctgatagccactaatggaactccagagctccagaatcct gagagactgtcagctgtattccgtgactttttaaatcgctgtcttgagatggatgtggat aggcgaggatctgccaaggagcttttgcagcatccatttttaaaattagccaagcctctc tccagcctgactcctctgattatcgctgcaaaggaagcaattaagaacagcagccgctaa >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_2|152_aa MKTLEENLGNTIQGIGMGKDFMTKTPKAMATKAKIDKRDLIKLNSVFTAKETIISMNRQP TELEKFFAIYLSDKGRKMSHPITVGHSVYSSQQEIKEKSPGHHPYVCQQRHFGINIERCA LVISGHTSNIKDLDRLTCVKMSLKIQGKRINC >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_2|459_bp atgaaaaccctagaagaaaacctaggcaataccattcagggcataggcatgggcaaagat ttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaacgggatcta attaaactaaacagcgtcttcacagcaaaagaaactatcatcagcatgaacaggcaacct acagaattggagaaattttttgcaatctatctatctgacaaaggaagaaaaatgagtcac cctatcacagtgggccacagtgtatactctagtcaacaggaaattaaagaaaagtcacca ggccaccatccatatgtttgccagcaaagacattttggaatcaacattgaaagatgtgcc ctcgttatctcagggcacactagcaacatcaaagatttggatagactcacctgtgtcaag atgagcttaaaaatccagggaaagagaataaactgttaa >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_3|641_aa MGPPLKLFKNQKYQELKQECIKDSRLFCDPTFLPENDSLFYNRLLPGKVVWKRPQDICDD PHLIVGNISNHQLTQGRLGHKPMVSAFSCLAVQESHWTKTIPNHKEQEWDPQKTEKYAGI FHFRFWHFGEWTEVVIDDLLPTINGDLVFSFSTSMNEFWNALLEKAYAKLLGCYEALDGL TITDIIVDFTGTLAETVDMQKGRYTELVEEKYKLFGELYKTFTKGGLICCSIESPNQEEQ EVETDWGLLKGHTYTMTDIRKIRLGERLVEVFSAEKVYMVRLRNPLGRQEWSGPWSEISE EWQQLTASDRKNLGLVMSDDGEFWMSLEDFCRNFHKLNVCRNVNNPIFGRKELESVLGCW TVDDDPLMNRSGGCYNNRDTFLQNPQYIFTVPEDGHKVIMSLQQKDLRTYRRMGRPDNYI IGFELFKVEMNRKFRLHHLYIQERAGTSTYIDTRTVFLSKYLKKGNYVLVPTMFQHGRTS EFLLRIFSEVPVQLRELTLDMPKMSCWNLARGYPKVVTQITVHSAEDLEKKYANETVNPY LVIKCGKEEVRSPVQKNTVHAIFDTQAIFYRRTTDIPIIVQVWNSRKFCDQFLGQVTLDA DPSDCRDLKSLYLRKKGGPTAKVKQGHISFKVISSDDLTEL >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_3|1926_bp atgggtcctcctctgaagctcttcaaaaaccagaaataccaggaactgaagcaggaatgc atcaaagacagcagacttttctgtgatccaacatttctgcctgagaatgattctcttttc tacaaccgactgcttcctggaaaggtggtgtggaaacgtccccaggacatctgtgatgac ccccatctgattgtgggcaacattagcaaccaccagctgacccaagggagactggggcac aagccaatggtttctgcattttcctgtttggctgttcaggagtctcattggacaaagaca attcccaaccataaggaacaggaatgggaccctcaaaaaacagaaaaatacgctgggata tttcactttcgtttctggcattttggagaatggactgaagtggtgattgatgacttgttg cccaccattaacggagatctggtcttctctttctccacttccatgaatgagttttggaat gctctgctggaaaaagcttatgcaaagctgctaggctgttatgaggccctggatggtttg accatcactgatattattgtggacttcacgggcacattggctgaaactgttgacatgcag aaaggaagatacactgagcttgttgaggagaagtacaagctattcggagaactgtacaaa acatttaccaaaggtggtctgatctgctgttccattgagtctcccaatcaggaggagcaa gaagttgaaactgattggggtctgctgaagggccatacctataccatgactgatattcgc aaaattcgtcttggagagagacttgtggaagtcttcagtgctgagaaggtgtatatggtt cgcctgagaaaccccttgggaagacaggaatggagtggcccctggagtgaaatttctgaa gagtggcagcaactgactgcatcagatcgcaagaacctggggcttgttatgtctgatgat ggagagttttggatgagcttggaggacttttgccgcaactttcacaaactgaatgtctgc cgcaatgtgaacaaccctatttttggccgaaaggagctggaatcggtgttgggatgctgg actgtggatgatgatcccctgatgaaccgctcaggaggctgctataacaaccgtgatacc ttcctgcagaatccccagtacatcttcactgtgcctgaggatgggcacaaggtcattatg tcactgcagcagaaggacctgcgcacttaccgccgaatgggaagacctgacaattacatc attggctttgagctcttcaaggtggagatgaaccgcaaattccgcctccaccacctctac atccaggagcgtgctgggacttccacctatattgacacccgcacagtgtttctgagcaag tacctgaagaagggcaactatgtgcttgtcccaaccatgttccagcatggtcgcaccagc gagtttctcctgagaatcttctctgaagtgcctgtccagctcagggaactgactctggac atgcccaaaatgtcctgctggaacctggctcgtggctacccgaaagtagttactcagatc actgttcacagtgctgaggacctggagaagaagtatgccaatgaaactgtaaacccatat ttggtcatcaaatgtggaaaggaggaagtccgttctcctgtccagaagaatacagttcat gccatttttgacacccaggccattttctacagaaggaccactgacattcctattatagta caggtctggaacagccgaaaattctgtgatcagttcttggggcaggttactctggatgct gaccccagcgactgccgtgatctgaagtctctgtacctgcgtaagaagggtggtccaact gccaaagtcaagcaaggccacatcagcttcaaggttatttccagcgatgatctcactgag ctctaa >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_4|140_aa MAFILGKSVECTVRQIVCEALERERKSLGSLPEVLITITDLPTRAMSDQGHVCSLAHINF ANTSPFFSACPLSSNVHVCDINPGTENSSGNGSSSSSSSSSSSRAPGITQVSREGIRKLT LEFYFLDNLASKKDEETSTL >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_4|423_bp atggcatttatccttggcaaaagtgtagaatgcactgttaggcagatcgtttgtgaggct ttagaaagggagcggaaatcactagggtcacttcctgaagtcctcatcactatcacagat ctgcctaccagggccatgtctgaccagggccatgtctgcagcctagcccatattaacttt gccaacacttcaccatttttctctgcatgtcctctctctagtaatgtgcatgtctgtgat attaaccctgggacagaaaacagcagcggcaacggcagcagcagcagcagcagcagcagc agcagcagcagggctcctgggataactcaggtgagtagagagggaattcgcaaacttacc ctggagttttatttcctggataacttagcgtctaagaaagatgaagaaacttcaactttg tag >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_5|338_aa MMAYLIAGISQSHDGNGIVKNTIDTDACTKPSRQPIVEPCDHLSLTVTGGATLVLADSSL SCFFPGAGTFLTEIQKAAELPGRKAKSAGLQRLQTPLLLQVQAHGDQSSAPEPLTRVNKV PAGNPHPVTCLHDFFGDDDVFIACGPEKFRYAQDDFSLDENECRVMKGNPSATAGPKASP TPQKTSAKSPGPMRRSKSPADSGNDQDALLPKGFQKVSKANEVTKPCFGTWVEHLSPQFR DIAKGVFRALYVTRHQTCTCLCPWMTRTRLVIPCKGGESAQSPEYKSKPIIVVGTREQDA YIPKMSSRTLMGMIPKITPPQKRLCQETSPDRNTGTVV >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_5|1017_bp atgatggcttatcttattgctggtatttctcaaagtcatgatggaaatggcattgttaag aacacaatagatacagatgcttgtaccaagccaagcagacagcctattgtggaaccttgt gatcatctttcactgacagtcacaggaggagcaactcttgtgctagctgatagttcattg agttgtttcttccctggggctggcacctttctcacagagattcaaaaggcagcagaacta ccaggaagaaaggctaagtctgctggtctacagagactgcagacccccttactcctacag gttcaggcccacggagatcagagttctgcccctgagcctctgactagagttaataaagtt cctgcagggaatccccacccagtaacttgtctccatgatttctttggtgatgatgatgtg tttattgcctgtggtcctgaaaaatttcgctatgctcaggatgatttttctctggatgaa aatgaatgccgagtcatgaagggaaacccatcagccacagctggcccaaaggcatcccca acacctcagaagacttcagccaagagccctggtcctatgcgccgaagcaagtctccagct gactcaggtaacgaccaagacgcacttcttccaaagggtttccaaaaggtctccaaagct aatgaggtaaccaaaccatgctttgggacatgggttgaacacttgtctccccagttcaga gacatagcaaagggggtgttcagggcactttatgtcaccaggcaccagacctgtacctgc ctctgtccttggatgactcggactcgcttggtgattccatgtaaaggaggggagagtgct cagagtccagagtacaaatccaagcctatcattgtagtagggacaagagaacaggatgcc tatatccccaaaatgagctccaggacactgatgggaatgatcccaaagatcaccccacct cagaaacgtctgtgccaagagacttccccagatagaaacactgggacagtggtttga >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_6|402_aa MAKGEVGAGTSHGKSWSKSERTRERENKRAREQESERTRERENKRARLISYECRIAAPAW SLRFHQNMELDFGHFDERDKTSRNMRGSRMNGLPSPTHSAHCSFYRTRTLQALSNEKKAK KVRFYRNGDRYFKGIVYAVSSDRFRSFDALLADLTRSLSDNINLPQGVRYIYTIDGSRKI GSMDELEEGESYVCSSDNFFKKVEYTKNVNPNWSVNVKTSANMKAPQSLASSNSAQAREN KDFVRPKLVTIIRSGVKPRKAVRVLLNKKTAHSFEQVLTDITEAIKLETGVVKKLYTLDG KQYSRKDLPFPRCHTKELGAVSTKAIAGNTARQIGFNLGNVECTYQDIFLAWSGAGTWTP EEITVTEQAKKIHTHIPTQLNSAGCGSVHRAVWRLATPQGPC >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_6|1209_bp atggcaaaaggcgaagtgggagcaggcacttcacatggcaaaagctggagcaagagcgag agaacaagagagcgagagaacaagagagcgagagaacaagagagcgagagaacaagagag cgagagaacaagagagcgagacttatttcttatgaatgtcggatagctgcaccagcttgg tctctgaggttccaccaaaatatggaacttgattttggacactttgacgaaagagataag acatccaggaacatgcgaggctcccggatgaatgggttgcctagccccactcacagcgcc cactgtagcttctaccgaaccagaaccttgcaggcactgagtaatgagaagaaagccaag aaggtacgtttctaccgcaatggggaccgctacttcaaggggattgtgtacgctgtgtcc tctgaccgttttcgcagctttgacgccttgctggctgacctgacgcgatctctgtctgac aacatcaacctgcctcagggagtgcgttacatttacaccattgatggatccaggaagatc ggaagcatggatgaactggaggaaggggaaagctatgtctgttcctcagacaacttcttt aaaaaggtggagtacaccaagaatgtcaatcccaactggtctgtcaacgtaaaaacatct gccaatatgaaagccccccagtccttggctagcagcaacagtgcacaggccagggagaac aaggactttgtgcgccccaagctggttaccatcatccgcagtggggtgaagcctcggaag gctgtgcgtgtgcttctgaacaagaagacagcccactcttttgagcaagtcctcactgat atcacagaagccatcaaactggagaccggggttgtcaaaaaactctacactctggatgga aaacagtattctaggaaagacttgccctttccacgttgccatacaaaagaactgggtgct gtaagcacaaaggcaattgcaggaaacacagcaagacagattggatttaaccttggaaat gttgaatgcacttaccaagatatttttctggcttggtccggggctggcacttggactcca gaggaaataactgttactgagcaagctaaaaaaattcacacacacattcccactcagctt aacagcgctggctgtggttcagttcaccgagcggtatggaggttggctactccacaggga ccttgctga >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_7|71_aa MWNNTAADNKQPCERKAAKLKEKYDKDIVVHGAKGKPDAAKKKAVGAEKSKKKNEEEKDE EDDVEEEDDHE >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_7|216_bp atgtggaataacactgctgcagacaataagcagccatgtgaaaggaaggctgcaaagctg aaggaaaaatatgacaaggatatcgttgtacacggagctaaaggaaagcctgatgcagca aaaaagaaagccgttggggctgagaaaagcaagaaaaagaatgaagaggagaaagatgaa gaggatgatgttgaggaggaagatgatcatgaataa >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_8|176_aa MGVKPRKATGVELPKVLGAHILHKCGLDVGHEIKGDYFEALRFSDLPVGFQTCMGLISPF FWLISPFQNELLSARSQPTQRDDSAKLTGTQKAHWQHCKKYRSQYSKKLSASYVQGRHST VTRDPRRSESQVLLLVTDNHKGHQQKSFSEGKHRLPELLVAERKVLIDLFFWVTKG >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_8|531_bp atgggggttaagccccgcaaagccacaggggtggagctgcccaaggtcttgggagcccac atcttgcacaagtgtggcctggatgtgggccatgaaatcaaaggggattactttgaagct ttaaggtttagtgaccttcctgttggctttcagacttgcatggggcttatatcccctttc ttttggctgatttctccctttcagaatgagcttctaagtgccagatctcagccaactcaa cgagatgactctgcaaagctaacaggaacccagaaagcccattggcagcactgcaagaag tatcgaagccagtactcaaaaaaattgagtgccagctatgtacaaggaaggcatagtact gtcaccagagaccctcgaaggtcggaatcacaagtcttgcttcttgtcactgacaatcat aaaggacaccagcagaaaagcttctcagagggaaaacacaggttaccagaactcttggtg gctgagagaaaagttctgattgatttatttttctgggtaactaaaggataa >gi568815575r:111201690_111510398|GENSCAN_predicted_peptide_9|256_aa VSQVTVEGSSVTEAFEEPTEEPKTQRVGSHSTKEEMTNPLPLFPTPISTLTFIRNGNIWL RLYTVTDPNTVPIFIIHIDDLGTLGNASLYTFQVKVKDRMMSLRASYIRVCRLFEVVTIL PVTGDFTTRMQPKQCGPPYSLRNSNTEIRPTNNPMMASTRSSERKSYISHSKSKARNNLS EKGMMKDIIKNVCGSWEEVKISTLTGVWKKLIPTLTDDFEGFKASGEEVTADVEIARELE LEFEYEDVTEFLQSYD >gi568815575r:111201690_111510398|GENSCAN_predicted_CDS_9|771_bp gtctcccaagttacagtagaggggtcttcagtgacagaggcctttgaagaacccacagaa gagccaaagacccagagagtgggaagccattccaccaaggaagaaatgaccaatccctta cctctcttccctactcctatttccaccctgacatttataaggaatggcaatatctggctt cgactatatactgtgactgatccaaatactgttcccatcttcatcattcacattgatgac cttgggaccttgggcaatgcatccctctacacatttcaggtgaaggtaaaggaccgaatg atgtccctgagggcctcatacatccgagtgtgtaggctttttgaagttgtgaccatactt cctgtcacaggtgactttacaactagaatgcagccaaaacagtgtgggcctccctattcc ctgagaaacagcaatactgaaattaggccaactaataaccctatgatggcctctacgcgt tcaagtgaaaggaagtcctacatctctcactcgaaatcaaaagctagaaataaccttagt gagaaaggcatgatgaaagatatcattaagaatgtctgtggctcatgggaggaggtcaaa atatcaacattaacaggagtttggaagaagttaattccaaccctcacggatgactttgaa ggattcaaggcttcaggtgaggaagtaactgcagatgtggaaatagcaagagaactagaa ttagaatttgagtatgaagatgtgactgaatttctgcaatcttatgattaa