GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:21:36 Sequence gi568815596f:216399269_216601401 : 202133 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15456 15620 165 1 0 84 63 94 0.168 6.43 1.02 Intr + 16989 17039 51 1 0 91 80 86 0.905 6.98 1.03 Intr + 21031 21114 84 2 0 91 45 108 0.716 6.59 1.04 Intr + 29328 29514 187 1 1 111 58 153 0.965 13.35 1.05 Intr + 33450 33600 151 0 1 80 38 239 0.809 18.26 1.06 Intr + 36070 36228 159 0 0 112 50 151 0.999 13.88 1.07 Intr + 39113 39217 105 1 0 101 113 62 0.971 10.41 1.08 Intr + 47750 47890 141 1 0 101 79 199 0.995 20.85 1.09 Intr + 51578 51796 219 1 0 109 74 282 0.964 27.40 1.10 Intr + 56294 56456 163 1 1 70 71 51 0.438 1.15 1.11 Term + 56541 56587 47 1 2 73 38 72 0.324 -1.93 1.12 PlyA + 56888 56893 6 1.05 2.00 Prom + 57876 57915 40 -4.96 2.01 Sngl + 59978 60928 951 1 0 75 44 267 0.699 17.61 2.02 PlyA + 61565 61570 6 1.05 3.00 Prom + 62781 62820 40 -3.96 3.01 Init + 64937 64945 9 1 0 54 103 0 0.285 -1.41 3.02 Intr + 65329 65399 71 0 2 77 103 78 0.976 6.18 3.03 Intr + 68676 68778 103 0 1 49 105 46 0.960 2.58 3.04 Intr + 76001 76183 183 1 0 118 78 356 0.999 37.58 3.05 Intr + 77841 77941 101 2 2 61 52 167 0.961 9.31 3.06 Intr + 78935 79031 97 2 1 88 90 67 0.971 6.91 3.07 Term + 83470 83709 240 0 0 45 42 271 0.988 14.23 3.08 PlyA + 83728 83733 6 1.05 4.00 Prom + 84241 84280 40 -7.76 4.01 Init + 99607 99609 3 0 0 105 101 0 0.771 3.00 4.02 Intr + 100002 100134 133 2 1 87 69 178 0.834 16.02 4.03 Intr + 100681 100763 83 2 2 71 89 105 0.722 8.26 4.04 Intr + 110177 110336 160 1 1 74 75 48 0.185 1.66 4.05 Term + 114256 114347 92 2 2 52 49 74 0.205 -2.22 4.06 PlyA + 114811 114816 6 1.05 5.00 Prom + 117752 117791 40 -5.06 5.01 Init + 121448 121506 59 1 2 58 111 39 0.176 3.98 5.02 Intr + 121633 121683 51 1 0 122 100 24 0.939 5.02 5.03 Intr + 122785 122896 112 1 1 84 94 29 0.847 3.48 5.04 Term + 143457 143621 165 2 0 36 42 100 0.059 -1.88 5.05 PlyA + 144123 144128 6 1.05 6.06 PlyA - 146896 146891 6 1.05 6.05 Term - 159277 159158 120 1 0 29 54 124 0.259 1.57 6.04 Intr - 160358 160217 142 1 1 85 69 79 0.306 6.06 6.03 Intr - 190029 189944 86 0 2 86 110 54 0.166 6.02 6.02 Intr - 191815 191706 110 2 2 95 1 90 0.118 1.00 6.01 Intr - 195013 194860 154 1 1 10 69 139 0.456 3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_1|490_aa XEQRKKIEENRQKALARRAEKLLAEQHQRTSSGTSIAGNPFQAKQGPSQNFPRESYPDTK TWNFSMNDYSALMKAAQSLPTVNLQPLEWAYGSSESPSTSIAKVRCLPQVQLDPLPTTLT LAFASQLKKTSLSLTPDVPEADLSEVDPKLVSNLMPFQRAGVNFAIAKGGRLLLADDMGL GKTIQAICIAAFYRKEWPLLVVVPSSVRFTWEQAFLRWLPSLSPDCINVVVTGKDRLTAG LINIVSFDLLSKLEKQLKTPFKVVIIDWILYTYVATSFQDESHFLKNSRTARCRAAMPVL KVAKRVILLSGTPAMSRPAELYTQIIAVKPTFFPQFHAFGLRYCDAKRMPWGWDYSGSSN LGELKLLLEEAVMLRRLKSDVLSQLPAKQRKIVVIAPGRINARTRAALDAAAKEMTTKDK TPPLLIPSQTGSGVVLQQTPTDLQLRVLTVRRKTNKQKGRAHQNPICTSPSSKSKGMQLL TSNGTKLDGE >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_1|1473_bp naggagcagaggaaaaagattgaagagaatcgacaaaaggctctggcccgcagagctgag aagttattggcagaacagcatcagaggactagctcgggcacctccattgctggcaaccca ttccaggccaagcaaggcccatcccaaaatttcccaagggagtcttatcctgacaccaag acgtggaacttcagcatgaatgactatagtgccctgatgaaagcagcccagagcctcccc acggtcaacctgcagcctctggaatgggcctatggcagcagcgagtcaccctccaccagc attgcaaaggtgcgctgcctcccacaagttcagctggaccctctgcccacgactctcacc ctggcgtttgcttctcagctcaagaagacatctctcagtctcacgccagatgtcccagag gcagacctttctgaagtggaccccaagctcgtgtctaatctgatgccctttcagagagct ggagtcaattttgccatagccaaaggaggccgcctgctgctcgctgacgacatgggcctg gggaagaccatccaagccatctgcatcgcagccttttaccggaaggagtggccgctcctg gtggtggtgccatcctccgtgcgcttcacctgggagcaggccttccttcggtggctgcca tctctgagcccagattgcatcaacgtcgtggtgactgggaaggaccgcctgacagctggc ctgatcaacattgtcagctttgaccttcttagcaagttggaaaaacagctaaaaacccct tttaaagttgtcatcattgattggatcttgtacacttatgtggctacttcttttcaggat gaatctcacttcctcaaaaacagtaggactgcccgctgtcgagcagctatgccggtccta aaggttgccaagagggtgatcctgttgtcgggcacaccagccatgtcccggcccgcagag ctctacacgcagatcatcgcagtcaagccaactttcttcccccagtttcatgcctttgga cttcgctactgtgatgccaaacggatgccttgggggtgggactactcaggttcctccaac ctgggagagctgaagctcctgctggaggaagcagtcatgctgcggcgcctcaagtccgac gtcctttcccagctgcctgccaagcagcgcaagatagtggtgattgccccaggacggatc aatgccaggaccagagctgccctggatgctgcagccaaggaaatgaccaccaaggacaaa actcctccgctgctgataccgagccaaacagggtctggagtggtcctccagcaaactcca acagacctgcagctgagagtcctgactgttagaaggaaaactaacaaacagaaaggacgt gcacaccaaaaccccatctgtacgtcaccatcatcaaagagcaaaggaatgcagctcctc accagcaatggaacaaagctggacggagaatga >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_2|316_aa MAILPRVIYRFNAVPIKLPMTFFTELEKITLKFIWNQKRAHIAKSILSQKNKAGVITLPD FKLYYKATVTKTAWYWYQNRDLDQWNRTEPSETIPHIYNYLILDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRKPTEWEKSFAIYSSDKGLISRIY KELKQICKKKTNNPINKRAKDMNRHFSKEDIYSANRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRC >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_2|951_bp atggccatactgcccagggtaatttatagattcaatgccgtccccatcaagctaccaatg actttcttcacagaattggaaaaaattactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaagccaaaagaacaaagctggagtcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatctagaccaatggaacagaacagagccctcagaaacaataccacacatctacaactat ctgatccttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaacaaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggaaacctaca gaatgggagaaaagttttgcaatctactcatctgacaaagggctaatatccagaatctac aaagaactcaaacaaatttgcaagaaaaaaacgaacaaccccatcaacaagcgggcaaag gatatgaacagacacttctcaaaagaagacatttattcagccaacagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctag >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_3|267_aa MSHKQQQKDALILFFNRTAEAKIPSVIEYILDLLESGREKFLVFAHHKVVLDAITQELER KHVQHIRIDGSTSSAEREDLCQQFQLSERHAVAVLSITAANMGLTFSSADLVVFAELFWN PGVLIQAEDRVHRIGQTSSVGIHYLVAKGTADDYLWPLIQEKIKVLAEAGLSETNFSEMT ESTDYLYKDPKQQKIYDLFQKSFEKEGSDMELLEAAESFDPGSASGTSGSSSQNMGDTLD ESSLTASPQKKRRFEFFDNWDSFTSPL >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_3|804_bp atgtcacataaacagcagcagaaagatgccctcattctcttcttcaacagaacagctgaa gctaaaatcccatctgtcattgaatatatcttggacctactggaaagtggaagagagaag tttttagtatttgcacaccataaggtggtcctggacgcaattacgcaagagcttgagaga aagcacgtgcagcacatccgcatcgatggctccacctcatcagctgagcgggaggacctg tgccagcagttccaactgtcggagaggcatgctgtggccgtgctgtccatcaccgctgcc aatatgggcctcaccttctcctcggctgacctggtggtgtttgctgagctgttttggaac ccaggggtgctgatccaggctgaggaccgcgtgcaccgcattggacagaccagctccgtg ggcattcactacctcgtggcaaagggcacagctgatgactacctttggcccctgattcaa gagaagattaaagttctggcagaagccgggctttctgagaccaatttttcagaaatgaca gaatccactgattacctctacaaggacccaaagcagcagaagatctacgacctattccag aagtcctttgagaaagaaggaagtgatatggagctcctggaagcagcagagtcctttgac ccaggaagtgcttcaggaacatctggaagtagttcccagaacatgggagacaccctggat gaaagctcattgacagccagtccacagaagaaaaggagatttgaattttttgataactgg gacagctttacgtctcccctgtaa >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_4|156_aa MAKRTKKVGIVGKYGTRYGASLRKMVKKIEISQHAKYTCSFCGKVNQDEETSCGDLALWF LHEDSGWRCLDVQNCEDSHYQSLIEKQLTKVSHLGMVANWEFGLISEGMNAFLVPHVILR GIFGIASLCFRYLAISKENIQKEAPKKEKNKYVYKT >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_4|471_bp atggccaaacgtaccaagaaagtcgggatcgtcggtaaatacgggacccgctatggggcc tccctccggaaaatggtgaagaaaattgaaatcagccagcacgccaagtacacttgctct ttctgtggcaaagtaaaccaagatgaagagacgagctgtggggatctggcactgtggttc ctgcatgaagacagtggctggcggtgcctggacgtacaaaattgtgaggatagtcattac cagtctctgattgagaagcagcttacaaaggtatcgcatcttggtatggtagctaattgg gaatttgggttgattagtgaagggatgaatgcttttttggttccacatgtgatacttagg ggcatctttggaattgcatctctgtgcttcagataccttgccatatcaaaagaaaatatc cagaaagaagcaccaaaaaaggaaaagaacaaatacgtgtacaagacatag >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_5|128_aa MPKWNQISMNSANGRRGKGRLYTLVVWTGGELDIAGRTKVDEVHGQPFGSDSDHFCLHFM NKASHIAYGSSTVWQLPACKVLPRLLIGFLQYEGKVIILTLDENTDSEKLSDLLKVTQLA SSRAKAGT >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_5|387_bp atgccaaaatggaaccaaatctccatgaactctgctaatggaagacgaggaaaaggcagg ttgtacacactagtagtgtggacaggaggagagttggatattgctggaaggaccaaggta gatgaagtacacggccagccctttggaagtgactcagatcacttctgcctacatttcatg aacaaagcaagccacatagcttatgggagttcaacagtatggcaacttccagcttgcaag gtgctacctcgtcttctcattggattcttacaatatgaaggcaaagttatcattctcact ttagatgaaaacactgactcagagaagttaagtgacttgctcaaagtcacccagctggct agcagcagggccaaggccggaacctag >gi568815596f:216399269_216601401|GENSCAN_predicted_peptide_6|203_aa VKVRQKINSQGKYDEDKKRLVDLENEVAVMILPWHYQSDLDGNQVTKEGGMVRGHKKPVT PSPGTVPTSHTDRCFTPPLLGSTSYSPWSFGQLAPEIFQWPIREQAGRAVTPAMEVRIED FINKVLSSSLVVWILKPALKEEGEGFRGNAPRADPQDGFLQQMQPPLIGCLLQEKHQEVR IAMGGTEDGGPLICGIGNGLWVT >gi568815596f:216399269_216601401|GENSCAN_predicted_CDS_6|612_bp gttaaagtgaggcaaaagataaacagtcaaggaaaatatgatgaggataagaaaaggcta gtggatttggaaaatgaggtggcagtgatgatcttgccatggcactaccagagtgacctg gatggaaaccaggtcacaaaggaaggaggaatggtacgtggccataagaaacccgtgaca ccctcccctggaactgtacccacgtcccacacggacagatgcttcactcctccacttctt ggcagtacttcctacagcccctggagctttggccagctagccccagaaatcttccagtgg cccatcagagagcaagctggcagagctgtcactcccgccatggaagtcaggatagaggac ttcatcaacaaggttctttcatcttccttagttgtttggatccttaagcctgccttgaaa gaagagggagaaggcttcaggggaaatgctccccgcgctgacccacaggatggcttcctt caacaaatgcagccacctttgattggctgcctgctacaagagaaacatcaggaggtacgt attgccatggggggaactgaagatggaggacccctcatttgtggaattgggaacgggctt tgggtcacatga