GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:16:52 Sequence gi568815582r:51037115_51241930 : 204816 bp : 44.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 107 238 132 1 0 72 103 32 0.528 3.94 1.02 Intr + 841 942 102 0 0 14 100 70 0.475 1.17 1.03 Term + 9936 10142 207 2 0 18 36 367 0.859 21.84 1.04 PlyA + 15259 15264 6 1.05 2.06 PlyA - 16648 16643 6 1.05 2.05 Term - 21402 21380 23 1 2 129 40 21 0.221 -0.13 2.04 Intr - 21625 21471 155 0 2 90 70 4 0.236 -1.48 2.03 Intr - 25298 25132 167 2 2 116 59 80 0.561 6.66 2.02 Intr - 28076 27928 149 0 2 77 97 38 0.374 3.55 2.01 Init - 41115 41097 19 0 1 90 87 3 0.080 0.64 2.00 Prom - 49063 49024 40 -3.86 3.00 Prom + 53648 53687 40 -2.46 3.01 Init + 78863 79000 138 1 0 55 7 280 0.426 14.74 3.02 Intr + 83431 83480 50 0 2 47 89 75 0.142 0.98 3.03 Intr + 88877 88929 53 2 2 97 58 31 0.329 -0.45 3.04 Term + 94821 94969 149 1 2 87 45 152 0.833 8.86 3.05 PlyA + 96471 96476 6 1.05 4.08 PlyA - 99586 99581 6 1.05 4.07 Term - 100438 99998 441 1 0 94 42 626 0.991 53.86 4.06 Intr - 105031 101574 3458 2 2 76 77 2687 0.598 253.79 4.05 Intr - 116898 116644 255 1 0 1 60 168 0.002 2.72 4.04 Intr - 135435 135304 132 1 0 63 71 80 0.545 4.42 4.03 Intr - 140546 140522 25 2 1 67 107 10 0.055 -1.60 4.02 Intr - 148479 148289 191 1 2 84 99 58 0.652 5.80 4.01 Init - 156086 156011 76 2 1 91 107 28 0.918 6.34 4.00 Prom - 158390 158351 40 -4.96 5.05 PlyA - 159013 159008 6 1.05 5.04 Term - 160538 160519 20 0 2 109 36 40 0.680 -0.62 5.03 Intr - 160990 160877 114 2 0 111 93 65 0.697 9.72 5.02 Intr - 177093 176923 171 1 0 9 92 91 0.022 1.51 5.01 Init - 189467 189455 13 2 1 114 98 20 0.563 4.90 5.00 Prom - 196865 196826 40 -1.46 6.04 PlyA - 197432 197427 6 1.05 6.03 Term - 202884 202096 789 0 0 9 44 737 0.972 54.72 6.02 Intr - 203244 202914 331 2 1 72 -78 261 0.467 3.53 6.01 Init - 204086 204013 74 2 2 92 39 42 0.506 0.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_1|146_aa GHPLPISREAEKLMCREVVGKEAFPEVTDRHIGCDPLPLEFTMQGCFHLLAVTNNADMKS NAQLHVVFVSYSSKQIYKPGRQERDSVSKNKKKKEKEKEKEKEKEKEKEKKKEEEEEEEE EEEEEEEEEEEEAEAAAAAGHSGSCL >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_1|441_bp gggcatccactgcccatcagcagagaggccgagaagctgatgtgcagggaagtggtggga aaggaagcctttccagaagtgactgaccgtcacattgggtgcgatccactgccactagaa ttcaccatgcagggctgtttccaccttttggctgttacaaataatgctgatatgaagtcc aatgcacaacttcacgtggtttttgtcagttactcttcaaaacaaatctacaagcctggg cgacaagagcgagactctgtctcaaaaaataagaagaagaaggagaaggagaaggagaag gagaaggagaaggagaaggagaaggagaagaagaaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagcagaagcagcagcagcagctggg cacagtggctcatgcctgtaa >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_2|170_aa MAKRGQGTWMKLEAIILSKLTQEQKTKHQMFSFISGSRIMRTHGHREGNIIHRGLLMENL QPREASVQMHTENLAKAPTDVLLLLLGIIIIAILYKLQHPSPAPGGYSLTCWACCPMSNF PLSAPAGLCLKFCWLSRAHCRPKGWGVNQQPPSAAFDECVTGAVPCGIKL >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_2|513_bp atggctaaaaggggccaagggacatggatgaagctggaagccatcattctcagcaaacta acacaggaacagaaaaccaaacaccaaatgttctcattcataagtgggagtcgaataatg agaacacatggacacagggaggggaacatcatacaccggggcctgttgatggaaaattta cagcccagagaagccagtgttcaaatgcacactgagaatctagcaaaagcgcccacagac gtcctcctcctcctccttggcatcatcatcattgcaatcctctataaattgcagcaccca tcaccagcccctggtggctactcactcacctgctgggcatgctgtccaatgtccaacttc ccactgtcagcacctgcaggtctttgtcttaagttttgctggctgtccagggctcactgc aggccaaaaggctggggagttaatcagcagcctccctcagcagcctttgacgagtgtgtg acgggagctgtcccttgtggaattaagctctag >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_3|129_aa MLLLLLLLLLLQLQALALVPLEQNLSPRPRVKSAAPTQQPVTCLLRQCLHESMDPIEDEF QNHISHKKFCRKSRVLSEPKDVSIYRMFPGHWLKAIKSAVKVLPSTYTVLQLSCENINEL PNCVDPKPG >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_3|390_bp atgctgctgctgctcctgttgctgctgctgctacagctccaggcccttgctctggtgcca ctggagcaaaatctctccccaagaccccgggtgaagagtgctgctccaacacaacaacca gtcacctgcttgctgaggcaatgtttacatgaaagtatggacccaatcgaggatgaattc cagaatcacatctcccacaagaaattctgcaggaaatccagggtcctgtctgaacccaaa gatgtttctatctatcgaatgttccctggtcattggttgaaggccatcaagtcggcggtg aaagtcttacccagtacttacacagtacttcagcttagctgcgaaaatatcaatgagctg ccgaactgtgtggaccccaagccaggctga >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_4|1525_aa MGKKLSMAVVLSDVKLSLREVGPLRDTILDNYSLLILLVLAAPGIVFLMLAGAPSQLHHS LRFSGGKALPALIRRELLPGPRPLESLWEGEEHNTQRGPGCISTVLASSRGLSRIQSLRC KPTHPVCHTYLLTVYVTVSSADQVTNTCEVIPGSFLLRTRRAPGEKLCGTASSARFLAAC GAREARTFKKPRLLEKDAGGSKHGHIPGKECASAHKRVRVPGTVWIGDTEKGQPSRPTKS KDAHVCGRCCAEFFELSDLLLHKKNCTKNQLVLIVNENPASPPETFSPSPPPDNPDEQMN DTVNKTDQVDCSDLSEHNGLDREESMEVEAPVANKSGSGTSSGSHSSTAPSSSSSSSSSS GGGGSSSTGTSAITTSLPQLGDLTTLGNFSVINSNVIIENLQSTKVAVAQFSQEARCGGA SGGKLAVPALMEQLLALQQQQIHQLQLIEQIRHQILLLASQNADLPTSSSPSQGTLRTSA NPLSTLSSHLSQQLAAAAGLAQSLASQSASISGVKQLPPIQLPQSSSGNTIIPSNSGSSP NMNILAAAVTTPSSEKVASSAGASHVSNPAVSSSSSPAFAISSLLSPASNPLLPQQASAN SVFPSPLPNIGTTAEDLNSLSALAQQRKSKPPNVTAFEAKSTSDEAFFKHKCRFCAKVFG SDSALQIHLRSHTGERPFKCNICGNRFSTKGNLKVHFQRHKEKYPHIQMNPYPVPEHLDN IPTSTGIPYGMSIPPEKPVTSWLDTKPVLPTLTTSVGLPLPPTLPSLIPFIKTEEPAPIP ISHSATSPPGSVKSDSGGPESATRNLGGLPEEAEGSTLPPSGGKSEESGMVTNSVPTASS SVLSSPAADCGPAGSATTFTNPLLPLMSEQFKAKFPFGGLLDSAQASETSKLQQLVENID KKATDPNECIICHRVLSCQSALKMHYRTHTGERPFKCKICGRAFTTKGNLKTHYSVHRAM PPLRVQHSCPICQKKFTNAVVLQQHIRMHMGGQIPNTPVPDSYSESMESDTGSFDEKNFD DLDNFSDENMEDCPEGSIPDTPKSADASQDSLSSSPLPLEMSSIAALENQMKMINAGLAE QLQASLKSVENGSIEGDVLTNDSSSVGGDMESQSAGSPAISESTSSMQALSPSNSTQEFH KSPSIEEKPQRAVPSEFANGLSPTPVNGGALDLTSSHAEKIIKEDSLGILFPFRDRGKFK NTACDICGKTFACQSALDIHYRSHTKERPFICTVCNRGFSTKGNLKQHMLTHQMRDLPSQ LFEPSSNLGPNQNSAVIPANSLSSLIKTEVNGFVHVSPQDSKDTPTSHVPSGPLSSSATS PVLLPALPRRTPKQHYCNTCGKTFSSSSALQIHERTHTGEKPFACTICGRAFTTKGNLKV HMGTHMWNSTPARRGRRLSVDGPMTFLGGNPVKFPEMFQKDLAARSGSGDPSSFWNQYAA ALSNGLAMKANEISVIQNGGIPPIPGSLGSGNSSPVSGLTGNLERLQNSEPNAPLAGLEK MASSENGTNFRFTRFVEDSKEIVTS >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_4|4578_bp atggggaagaagctgagcatggctgtggtcttgagtgatgttaagcttagcctgagggag gtggggcctctgagagacacaattcttgataattatagtttattgattctcctggtgtta gctgcccccggcatcgtcttcctcatgttagctggggcaccttcccagctccatcacagc ctgcgtttctcaggaggcaaagcactcccagcgctaatccggagagagctgctgcctgga ccaaggcccctggagtccctctgggagggtgaagaacacaacacacagagaggtcctggt tgcatctccacggtgctggcatcctccagaggactcagccgcattcagtccttaagatgc aagcccacgcaccccgtttgccacacatacctgttgacagtttatgtgacagtttcctct gcagaccaagtaacaaacacgtgtgaagttattcccgggagcttcctgctgcgcacgcgg cgcgcaccaggagaaaagctttgcgggaccgccagctccgctaggttcctggctgcatgc ggtgcccgggaggcccggactttcaagaagccaagattgttagagaaggatgcgggcggc tccaaacacgggcacattccgggaaaggagtgtgcaagtgcacacaaacgcgttcgtgta cctgggacagtgtggattggagacacagaaaagggtcaaccgagtcgccctactaagagc aaggatgcccacgtctgtggccggtgctgtgccgagttctttgaattatcagatcttctg ctccacaagaagaactgtactaaaaatcaattagttttaatcgtaaatgaaaatccagcc tccccacccgaaaccttctcccccagcccccctcctgataatcctgatgaacaaatgaat gacacagttaacaaaacagatcaagtggactgcagcgacctttcagaacacaacggactt gacagggaagagtccatggaggtggaggccccggttgctaacaaaagcggcagcggcact tccagcggcagccacagcagtaccgccccaagcagcagcagcagcagcagcagcagcagc ggcggcggcggcagctcctccacaggtacctcagcgatcacaacctctctacctcaactc ggggacctgacaacactgggcaacttctccgtaatcaacagcaacgtcatcatcgagaac ctccagagcaccaaggtggcggtggcccagttctcccaggaagcgaggtgcggcggggcc tctgggggcaagctggccgtcccagccctcatggaacaactcctagctctgcagcagcag cagatccaccagctgcaattgatcgaacagattcgtcaccaaatattgctgttggcttct cagaatgcagacttgccaacatcttctagtccttctcaaggtactttacgaacatctgcc aaccccttgtccacgctaagttcccatttatctcagcagctggcagcagcagctggattg gcacagagcctcgccagccaatctgccagcattagtggtgtgaaacagctacccccaatc cagctacctcagagcagttctggcaacaccatcattccatccaacagcggctcttctccc aatatgaacatattggcagcggcagttaccaccccgtcctctgaaaaagtggcttcaagt gctggggcctcccatgtcagcaacccagcggtctcatcatcgtcctcaccagcttttgca ataagcagtttattaagtcctgcgtctaatccacttctacctcagcaagcctccgctaac tcggttttccccagccctttgcccaacatcggaacaactgcagaggatttaaactccttg tctgccttggcccagcaaagaaaaagcaagccaccaaatgtcactgcctttgaagcgaaa agtacttccgatgaggcattcttcaaacacaagtgcaggttctgcgcgaaggtctttggg agtgacagtgccttgcagatccacttgcgttcccataccggagagaggccattcaagtgc aacatctgcgggaacaggttctccaccaaggggaatctgaaagtccactttcagcgccac aaagagaaataccctcatatccagatgaacccctatcctgtgcctgagcatttggacaat atccccacgagtactggcatcccatatggcatgtccatccctccagagaagccagtcacc agctggctagacaccaaaccagtcctgcctactctgaccacttcagtcggcctgccgttg cccccaaccctcccaagcctcatacccttcatcaagacggaagagccagcccccatcccc atcagccattctgccaccagccccccaggctcagtcaaaagtgactccgggggccctgag tcagccacaagaaacctaggtgggctcccagaggaagccgaagggtccactctgccaccc tctggtggcaaaagcgaagagagtggcatggtcaccaactcagtcccgacggcgagcagt agcgtcctgagctccccagcggcagactgcggccccgcgggcagtgccaccaccttcacc aaccctttgttgccgctcatgtccgagcagttcaaggccaagtttccttttgggggactc ctggactcagctcaggcatcagagacgtccaagcttcagcaactggtagaaaacattgac aagaaggccactgaccccaatgagtgcatcatctgccaccgggttctcagctgccagagc gccttgaaaatgcactacaggacacacactggggagaggccctttaagtgtaagatctgt ggccgggctttcaccacgaaagggaatcttaaaacccactacagtgtccatcgtgctatg cccccgctcagagtccagcattcctgccccatctgccagaagaagttcacgaacgctgtg gtcctgcagcagcacatccgaatgcatatgggaggccagatccccaacaccccagtcccc gacagctactctgagtccatggagtctgacacaggttcctttgatgagaaaaattttgat gacctagacaacttctctgatgaaaacatggaagactgtcctgagggcagcatccctgat acacctaagtctgcagacgcctcccaagacagcttatcctcttcgcctttgcccctcgag atgtcgagcatcgctgctttggaaaatcagatgaagatgatcaatgctggcctggcagag cagctacaggccagcctgaagtcagtggagaatgggtccatcgagggggatgtcctgacc aatgattcatcctcagtgggtggtgacatggaaagccaaagtgctggcagcccagccatc tcagagtctacctcttccatgcaggctctgtccccgtccaacagcacgcaggagttccac aagtcacccagcattgaggagaaaccacagagagcggtcccaagcgagtttgccaatggt ttgtctcccaccccagtgaatggtggggctttggatttgacatctagtcacgcagagaaa atcatcaaagaagattctttggggatcctcttcccttttagagaccggggtaaatttaaa aacactgcttgtgacatttgtggcaaaacatttgcttgtcagagtgccttggacattcac tatagaagtcataccaaagagagaccatttatttgcacagtttgcaatcgtggcttttcc acaaagggtaatttgaagcagcacatgttgacacatcagatgcgagatctgccatcccag ctctttgagcccagttccaaccttggccccaatcagaactcagcggtgattcccgccaac tcgttgtcatctctcatcaagacagaggtcaacggcttcgtgcatgtttctcctcaggac agtaaggacacccccaccagtcacgtcccgtctgggcctctgtcttcctctgccacatcc ccagttctgctccctgctctgcccaggagaactcccaagcagcactactgcaacacatgt ggcaaaaccttctcctcatcgagtgccctgcagattcacgagagaactcacactggagag aaaccctttgcttgcactatttgtggaagagctttcacgactaaaggcaatcttaaggta cacatgggcactcacatgtggaatagcacccctgcacgacggggtcggcggctctctgtg gatggccccatgacatttctaggaggcaatcccgtcaagttcccagaaatgttccagaag gatttggcggcaagatcaggaagtggggatccttccagcttctggaatcagtatgcagca gcgctctccaacgggctggcgatgaaggccaacgagatctccgtcattcagaacggtggc atccctccaattcctggaagcctcggcagtgggaacagctcacctgttagtgggctgacg ggaaacctggagaggctccagaactcagagcccaatgctcccctggccggcctggagaaa atggcaagcagtgagaacggaaccaacttccgcttcacccgcttcgtggaggacagcaag gagatcgtcacgagttaa >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_5|105_aa MPGRGLQLLPAKAAAAAQAFGSPGFIHLTMPLVKVLTRPSKHSGPQGPALHECYPQSYLK KGIHAPQQFPASCTLRRRDAETSAPTPYISFISKCDSEAGDIEFQ >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_5|318_bp atgcccggccggggcttacagctgttacctgctaaagctgcagcagcagcccaggccttt gggtctccagggttcatacacttaactatgcctctggttaaggttcttacaagaccaagc aagcactcaggtcctcaaggcccagctctccacgaatgctaccctcaatcttacctcaag aaaggaatccatgctccacagcagttcccagcaagctgcactcttcgaaggcgggatgct gaaacctctgcccccaccccctacattagctttatatccaaatgtgactcggaggctggt gacatcgagttccaataa >gi568815582r:51037115_51241930|GENSCAN_predicted_peptide_6|397_aa MSRNSLRSPPEEDAGTVLPVQPTEVSSPPGELCLRLKGSKRGSSLLSCQQITTSSLFITI TTITPITSTTISTSTNSTSTTTIFINISTTIFTSTTSITISTATTTTSTTTTISMTVFTS TTSTTTRITTTIFTTHLHYHLHHHHHHLISTTIFTSNTTTTAATTISIIISTTITTTSTT IITTSNTSTAITTTSTSTTIPPPPPSPPFPPPSPSPPPPSHHLHHHHHLHHNLHLTTFTT TIIFITISTTIFTSTTTTTTTIFITISTTSPPPSPPHPPLSSPSLPPSPPSSSPHPPLSS PSPPPPIPLPLLPPSLPTISITIFTTTTISTIITTSTTSTTITTITTTTITITITSTTIT ISITTNSIFTTTTSTTISTTISITSTTTCTGVIVPSL >gi568815582r:51037115_51241930|GENSCAN_predicted_CDS_6|1194_bp atgagcagaaacagcctgaggtccccaccagaagaagatgctggcactgtgcttcctgta cagcctacagaagtctcaagtccacctggggaactttgtctcagactgaaaggttcaaag agaggatcttctctgttaagctgccagcaaatcaccacctcctccttgttcatcaccatc accaccatcacccccatcacctccaccaccatctctacttctaccaactccacctccacc accacaattttcatcaacatctccaccaccatctttacctccaccacctccatcaccatt tccactgccaccaccaccacctcaaccaccaccaccatttccatgactgtcttcacctct accacttccaccactacccgcatcaccaccaccatctttaccacccatctccattaccac cttcaccaccatcaccaccacctcatctccacaaccattttcacttccaacaccaccacc accgctgccaccaccatctccattatcatctccaccaccatcaccaccacctccaccaca atcattaccacctccaacaccagcactgccatcaccaccacctccacctccaccaccatc ccaccacctccaccatcaccaccatttccaccaccatcaccatcacctccaccaccatcc caccacctccaccatcaccaccatctccaccacaatcttcacctcacaacattcaccacc accatcatcttcatcaccatctctaccaccatcttcacctccaccaccactaccaccacc accatcttcattaccatctccaccacatcaccaccaccatcaccaccacatccaccacta tcatcaccatctctaccaccatctccaccatcatcttcacctcacccaccactatcatca ccatcaccaccacctccaataccactaccactactgccaccatcactaccaactatctct atcaccatctttaccaccaccaccatctccaccatcatcaccacctctaccaccagcacc accatcaccaccatcaccaccaccaccatcaccatcaccatcacctccaccaccatcacc atctccatcaccaccaactccatcttcaccactaccacgagcaccaccatatctaccacc atctccatcaccagcacaaccacctgcactggggtgatcgtgccttctttgtga