GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:03:57 Sequence gi568815597r:51819930_52077117 : 257188 bp : 42.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 3857 3735 123 0 0 89 88 25 0.212 2.24 1.08 Intr - 7940 7867 74 1 2 54 83 101 0.711 4.33 1.07 Intr - 14241 14088 154 1 1 77 61 213 0.754 15.71 1.06 Intr - 16283 16202 82 2 1 53 61 53 0.934 -2.61 1.05 Intr - 16509 16440 70 2 1 116 105 78 0.970 10.57 1.04 Intr - 18217 18135 83 1 2 41 97 36 0.611 -2.68 1.03 Intr - 20585 20297 289 1 1 127 80 634 0.984 62.83 1.02 Intr - 37809 37612 198 2 0 36 78 165 0.257 7.84 1.01 Init - 58539 58346 194 0 2 78 93 273 0.868 25.29 1.00 Prom - 60978 60939 40 -8.35 2.00 Prom + 65083 65122 40 -5.85 2.01 Init + 74287 74601 315 0 0 51 1 150 0.073 -0.02 2.02 Intr + 81988 82076 89 0 2 62 89 116 0.912 6.95 2.03 Term + 90739 90925 187 1 1 99 36 155 0.976 7.38 2.04 PlyA + 92108 92113 6 1.05 3.13 PlyA - 92471 92466 6 1.05 3.12 Term - 100185 99998 188 1 2 95 48 222 0.994 15.57 3.11 Intr - 113513 113389 125 1 2 114 76 148 0.854 15.51 3.10 Intr - 117483 117365 119 0 2 75 64 245 0.964 19.14 3.09 Intr - 138153 138115 39 0 0 72 116 15 0.054 0.20 3.08 Intr - 139122 139012 111 0 0 64 87 70 0.001 4.16 3.07 Intr - 157342 156835 508 0 1 37 64 490 0.008 33.55 3.06 Intr - 169163 169039 125 2 2 72 90 89 0.407 5.96 3.05 Intr - 169529 169353 177 2 0 19 59 133 0.603 2.69 3.04 Intr - 170682 170623 60 0 0 68 91 67 0.379 3.01 3.03 Intr - 178109 178020 90 2 0 81 85 85 0.481 6.77 3.02 Intr - 181441 181374 68 2 2 64 84 34 0.476 -1.69 3.01 Init - 182596 182518 79 1 1 93 37 59 0.660 2.57 3.00 Prom - 183217 183178 40 -8.25 4.00 Prom + 185090 185129 40 -7.65 4.01 Init + 186683 186905 223 1 1 65 59 182 0.601 11.77 4.02 Intr + 187224 187256 33 1 0 76 64 83 0.387 1.98 4.03 Term + 198649 198896 248 2 2 -31 43 172 0.008 -4.23 4.04 PlyA + 198979 198984 6 1.05 5.06 PlyA - 200221 200216 6 1.05 5.05 Term - 201083 201004 80 0 2 89 38 109 0.927 2.95 5.04 Intr - 203645 203562 84 0 0 81 75 93 0.949 6.17 5.03 Intr - 204650 204581 70 2 1 58 82 98 0.834 4.04 5.02 Intr - 207419 207346 74 0 2 80 94 69 0.949 4.91 5.01 Init - 208688 208649 40 2 1 56 115 -1 0.699 -0.19 5.00 Prom - 209794 209755 40 -4.45 6.03 PlyA - 210012 210007 6 1.05 6.02 Term - 214024 212768 1257 1 0 12 47 1271 0.556 105.99 6.01 Init - 219335 219213 123 2 0 74 68 63 0.464 3.12 6.00 Prom - 227628 227589 40 -10.05 7.00 Prom + 228233 228272 40 -4.25 7.01 Init + 235103 235440 338 1 2 81 36 123 0.074 2.41 7.02 Intr + 239906 239972 67 2 1 115 113 77 0.897 10.89 7.03 Intr + 244896 245009 114 2 0 104 64 120 0.466 10.92 7.04 Term + 251855 251890 36 1 0 109 48 32 0.168 -2.34 7.05 PlyA + 255293 255298 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 58891 59409 519 0 0 82 48 234 0.925 14.59 S.002 Init + 140142 140213 72 2 0 75 97 42 0.895 5.00 S.003 Intr + 140695 140855 161 0 2 76 89 102 0.940 6.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_1|423_aa MPGRNKAKSTCSCPDLQPNGQDLGENSRVARLGADESEEEGRRGSLSNAGDPEIVKSPSD PKQYRLLTESKSVNYQIQSNHGSSPVSVVTSKPSSDQKFAQTRFQTEICRASESQRELTT ISVASRKQWTSYIKLQNGLQALLISDLSNMEGKTGNTTDDEEEEEVEEEEEDDDEDSGAE IEDDDEEGFDDEDEFDDEHDDDLDTEDNELEELEERAEARKKTTEKQWNLSSLKTRNCFD SEAMGSLVKGKIPRSSAPVQHLAGWQAEEQQGETDTVLSAAALCVGVGSFADPDDLPGLA HFLEHMVFMGSLKYPDENGFDAFLKKHGGSDNASTDCERTVFQFDVQRKYFKEALDRWAQ FFIHPLMIRDAIDREVEAVDSGNAETLKHEPRKNNIDTHARLREFWMRYYSSHYMTLVVQ SKX >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_1|1269_bp atgcctggaaggaacaaggcgaagtctacctgcagctgccctgacctgcagcccaatgga caggatctgggcgagaacagccgggttgcccgtctaggagcggatgaatctgaggaagag ggacggagggggtctctcagtaatgctggggaccctgagatcgtcaagtctcccagcgac cccaagcaataccgcctcttaactgaatccaagtcagttaattatcagatccaatccaat catggatccagtccagtttctgttgtgacttccaaacccagttcggatcagaagtttgct caaactcggttccaaacagaaatctgtcgagcttcagaatctcagcgagaacttaccacg atatcagttgcttccagaaagcaatggacatcatacatcaaattacagaatggcttgcag gcacttctgatttcagacctaagtaatatggaaggtaaaacaggaaatacaacagatgat gaagaagaagaggaggtggaggaagaagaagaagatgatgatgaagattctggagctgaa atagaagatgacgatgaagagggttttgatgatgaagatgagtttgatgatgaacatgat gatgatcttgatactgaggataatgaattggaagaattagaagagagagcagaagctaga aaaaaaactactgaaaaacagtggaacctaagtagtttgaagacgagaaattgttttgat tcagaagcaatgggttcactagtgaaaggaaagatcccacgatcgtctgcacctgttcag catttggcaggatggcaagcggaggagcagcagggtgaaactgacacagttctgtctgca gcggctctttgtgttggagttgggagtttcgctgatccagatgacctgccggggctggca cactttttggagcacatggtattcatgggtagtttgaaatatccagatgagaatggattt gatgccttcctgaagaagcatgggggtagtgataatgcctcaactgattgtgaacgcact gtctttcagtttgatgtccagaggaagtacttcaaggaagctcttgatagatgggcgcag ttcttcatccacccactaatgatcagagatgcaattgaccgtgaagttgaagctgttgat agtggaaatgctgagacgctcaagcatgagccaagaaagaataatattgatacacatgct agattgagagaattctggatgcgttactactcttctcattacatgactttagtggttcaa tccaaagnn >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_2|196_aa MTKTPKAIATKAKIDKWDLIKLKSFCTAKETIIRLNRQPTEWEKILPIYPSDKSLISRIY KELKQIYKKKTNNPIKKWAKDMNRHFAKEDIYVANKPMKISSSSLRLRQQGAILEAENSP HQTPVVMEPFQPPERSYRKQPQNPKASAPREPVETIWSKLLKPRDKNDLTQAAQPLNSPL LTPETDAFESERRQES >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_2|591_bp atgactaaaacaccaaaagcaattgcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactatcatcaggctaaacaggcaacctaca gaatgggagaaaattttgccaatctatccatctgacaaaagtctaatatccagaatctat aaggaacttaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gatatgaacagacactttgcaaaagaagacatttatgtggccaacaaacctatgaaaata agctcatcatcactgaggttgcggcaacaaggtgccatcttggaagcagagaatagccct caccagacaccagttgtgatggagcctttccagcctccagaacgatcttacagaaaacag ccacagaatcccaaggcatcagcaccaagagagcctgtagaaactatctggtctaagctc ctgaagcccagagataagaatgacttaacccaagctgcacagcctcttaacagtccactg ttaaccccagaaacagatgcatttgaatctgaaaggagacaggaatcctaa >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_3|562_aa MVTGQSRESTVPISDTRGSMRLTFAEEVIQKGVWSVVDDVRPRKNFVPRIPPLLTVCALA MEQCPCCRFEPLSPRKQQLVLPSRRPSGSEPVVPHSESAESKEGTQALDLVTRAPGITEN KRIVDRKSHPTLFCSPSMEATLGRAGQQPLEAKRDVEEVGVHLFCAYKQRKTMEAVKGPI YRSGKPWCPVGAREEAFWGRCSDGVILSSVHFLSGAGVRSECVNGGVTIRRMVTQGDTEG LFLSDSRSLLQMASVTDGKTGVKDASDQNFDYMFKLLIIGNSSVGKTSFLFRYADDTFTP AFVSTVGIDFKVKTVYRHEKRVKLQIWVSPGNLGQDFPEKRLACQYKGLEVGPAWNFWAL QHQALMLFQTACSLRAGTGSDSGLQLQHQAQGLANSKGLTKTGKMKVLQLEHGEEEIRKD TAGQERYRTITTAYYRGAMGFILMYDITNEESFNAVQDWATQIKTYSWDNAQVILVGNKC DMEEERVVPTEKGQLLAEQLGFDFFEASAKENISVRQAFERLVDAICDKMSDSLDTDPSM LGSSKNTRLSDTPPLLQQNCSC >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_3|1689_bp atggttactggtcagtccagagagagtacagttccaatttcggacacaagagggagtatg agacttacctttgccgaagaggtcatccagaaaggggtctggtcagtggtggatgatgtg agaccaaggaagaactttgtgcctaggattccacccctgctgacagtgtgtgccctggcg atggagcagtgtccttgttgcagatttgaaccactttcacctcgtaaacagcagctggtc ctgccgtcccgccgaccgtccgggagcgaacccgtcgtcccgcactcggagtccgcggag tccaaggaggggacccaggcactggaccttgttacacgggcaccggggatcacagaaaat aagagaatcgtggacagaaaaagtcatccaactctgttctgtagtccaagcatggaagca actttggggagggcagggcagcagcctttggaagctaagagagatgtggaggaggtgggt gtccacctgttctgtgcctataaacagaggaagacaatggaagccgtgaaaggccctatc tatagaagtggaaagccctggtgccctgtgggagccagagaagaagcattttgggggagg tgtagtgatggggtgattctgagtagtgtgcatttcctcagtggtgctggagtaaggagt gagtgtgtgaatggtggggtgaccattagaaggatggtgactcagggtgacaccgaaggt ctgttcctgagtgactctaggtctctcttgcagatggcttcagtgacagatggtaaaact ggagtcaaagatgcctctgaccagaattttgactacatgtttaaactgcttatcattggc aacagcagtgttggcaagacctccttcctcttccgctatgctgatgacacgttcacccca gccttcgttagcaccgtgggcatcgacttcaaggtgaagacagtctaccgtcacgagaag cgggtgaaactgcagatctgggtgagtcccgggaatcttgggcaggattttcctgagaag cggctggcctgccagtacaagggcttagaggtggggccagcttggaacttctgggcctta cagcatcaggcattaatgctgtttcagactgcatgctccctgagggcagggaccgggtct gactcgggtctacagctccagcaccaggcacagggccttgccaatagtaagggtttaaca aagactggcaaaatgaaagttcttcagctagaacatggggaggaagagataaggaaggac acagctgggcaggagcggtaccggaccatcacaacagcctattaccgtggggccatgggc ttcattctgatgtatgacatcaccaatgaagagtccttcaatgctgtccaagactgggct actcagatcaagacctactcctgggacaatgcacaagttattctggtggggaacaagtgt gacatggaggaagagagggttgttcccactgagaagggccagctccttgcagagcagctt gggtttgatttctttgaagccagtgcaaaggagaacatcagtgtaaggcaggcctttgag cgcctggtggatgccatttgtgacaagatgtctgattcgctggacacagacccgtcgatg ctgggctcctccaagaacacgcgtctctcggacaccccaccgctgctgcagcagaactgc tcatgctag >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_4|167_aa MASWEGKDLTVPQPDTRKGSVLRRISKRGRNASCSSDKRKASVSCRSLGNGMSRYKTRLY APSTEIGKSRLRAGVSRSTLRETPTDTQDRGLASALEENGHLMTAPDVQLGYFNVKWSRA MIDVLGKGYFILMRKIREDSLEEVAQNGYEGRTERTYFRLRKQDERR >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_4|504_bp atggcctcgtgggaagggaaagacctgaccgtcccccagcccgacacccgtaaagggtct gtgctgaggaggattagtaaaagaggaaggaatgcctcttgcagttcagacaagaggaag gcatctgtctcctgccggtccctgggcaatggaatgtctcggtataaaacccgattgtat gctccatctactgagatagggaaaagccgccttagggctggagtctctcgttccacctta cgagaaacacccacagatacacaggacagaggactagcttctgccctcgaggagaatggc cacctgatgacagccccagatgtgcaattgggttacttcaatgtaaagtggtccagggct atgatagatgtattggggaaaggatattttatcctcatgagaaagatcagggaagactct ctggaagaagtggctcaaaatgggtatgaaggaagaacagaaagaacatattttaggctg aggaaacaagatgaacgaagatag >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_5|115_aa MVIIHKSWCGACKALKPKFAESTEISELSHNFVMVNLEDEEEPKDEDFSPDGGYIPRILF LDPSGKVHPEIINENGNPSYKYFYVSAEQVVQGMKEAQERLTGDAFRKKHLEDEL >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_5|348_bp atggtgattattcataaatcctggtgtggagcttgcaaagctctaaagcccaaatttgca gaatctacggaaatttcagaactctcccataattttgttatggtaaatcttgaggatgaa gaggaacccaaagatgaagatttcagccctgacgggggttatattccacgaatccttttt ctggatcccagtggcaaggtgcatcctgaaatcatcaatgagaatggaaaccccagctac aagtatttttatgtcagtgccgagcaagttgttcaggggatgaaggaagctcaggaaagg ctgacgggtgatgccttcagaaagaaacatcttgaagatgaattgtaa >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_6|459_aa MKPIQGAIGDAGSVPKKQRMLMTLQEKIELLDMYCRLRSTQSLYLLSEKAEPGPSWRRTP VSIETRGSVKVKGDRKSSLHFRLTSETRTTRKLAQRGCQWSLPERMPLVVFCGLPYSGKS RRAEELRVALAAEGRAVYVVDDAAVLGAEDPAVYGDSAREKALRGALRASVERRLSRHDV VILDSLNYIKGFRYELYCLARAARTPLCLVYCVRPGGPIAGPQVAGANENPGRNVSVSWR PRAEEDGRAQAAGSSVLRELHTADSVVNGSAQADVPKELEREESGAAESPALVTPDSEKS AKHGSGAFYSPELLEALTLRFEAPDSRNRWDRPLFTLVGLEEPLPLAGIRSALFENRAPP PHQSTQSQPLASGSFLHQLDQVTSQVLAGLMEAQKSAVPGDLLTLPGTTEHLRFTRPLTM AELSRLRRQFISYTKMHPNNENLPQLANMFLQYLSQSLH >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_6|1380_bp atgaaacccatacaaggtgccattggtgatgctggaagtgttcccaagaagcagagaatg ctcatgacattacaagaaaaaattgaattgcttgatatgtactgtagactgaggtctaca cagtctctgtatctcctttctgagaaagctgaacctgggcctagctggcgtagaactcct gtttccatagaaacaagagggtctgtaaaagtcaagggagacaggaagagcagtttgcac ttccggcttacgtcggagacgcgtacaacccggaagttggcgcagcgcggttgccaatgg tcgctccctgagaggatgccgctcgtggtgttttgcgggctgccgtacagcggcaagagc cggcgtgctgaagagttgcgcgtggcgctggctgccgagggccgcgcggtgtacgtggtg gacgacgcagctgtcctgggcgcagaggacccagcggtgtacggcgattctgcccgtgag aaggcattgcgtggagctctgcgagcctccgtggaacggcgcctgagtcgccacgacgtg gtcatcctggactcgcttaactacatcaaaggtttccgttacgagctctactgcctggca cgggcggcgcgcaccccgctctgcctggtctactgcgtacggcccggcggcccgatcgcg ggacctcaggtggcgggcgcgaacgagaaccctggccggaacgtcagtgtgagttggcgg ccacgcgctgaggaggacgggagagcccaggcggcgggcagcagcgtcctcagggaactg catactgcggactctgtagtaaatggaagtgcccaggccgacgtacccaaggaactggag cgagaagaatccggggctgcggagtctccagctcttgtgactccggattcagagaaatct gcaaagcatgggtccggtgccttttactctcccgaactcctggaggccctaacgctgcgc tttgaggctcccgattctcggaatcgctgggaccggcctttattcactttggtgggccta gaggagccgttgcccctggcggggatccgctctgccctgtttgagaaccgggccccacca ccccatcagtctacgcagtcccagcccctcgcctccggcagctttctgcaccagttggac caggtcacgagtcaagtactggccggattgatggaagcgcagaagagcgctgtccccggg gacttgctcacgcttcctggtaccacagagcacttgcggtttacccggcccttgaccatg gcagaactgagtcgccttcgtcgccagtttatttcgtacactaaaatgcatcccaacaat gagaacttgccgcaactggccaacatgtttcttcagtatttgagccagagcctgcactga >gi568815597r:51819930_52077117|GENSCAN_predicted_peptide_7|184_aa MTRSRKLKPSKQVAPRRGRVSMAVGARGHGAERTQGRSPSRRSTQFAERRSAQHLYFPDF FFSNSEGSDVRRMWVVPPGSGLGWSELKGADHVEGYESCSRARVAFRGFWVSRFTNSMNQ EKLAKLQAQVRIGGKGTARRKKKVVHRTATADDKKLQSSLKKLAVNNIAGIEELEPQVPI DRQL >gi568815597r:51819930_52077117|GENSCAN_predicted_CDS_7|555_bp atgacgaggagcaggaaactgaagcccagcaaacaggtggccccgagacgaggccgcgtc tccatggcagtaggtgcgcggggccacggggctgagcggacgcagggccggagtcccagc agacggtccacacagttcgccgagcgccgctcagcacaacacctctacttcccagatttt tttttttcaaactctgaaggaagtgatgttagacggatgtgggtggtgcctccggggtcc ggtttgggatggagtgagcttaaaggtgcagatcacgtagaagggtacgagagctgttct cgagcgcgagttgcttttcgtggattctgggtgtccagatttaccaacagcatgaatcaa gaaaagttagccaaacttcaggctcaggtccggatagggggcaagggtacagctcgcaga aagaagaaggtggtacatagaacagccacagctgatgacaaaaagcttcagagttctcta aaaaaactggctgtgaataatatagctggtattgaagagctagagcctcaggtgcccata gatagacagctctga