GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:14:21 Sequence gi568815585r:96886749_97087759 : 201011 bp : 38.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1998 2077 80 2 2 49 54 88 0.032 2.08 1.02 Term + 20728 20875 148 1 1 80 41 175 0.409 8.39 1.03 PlyA + 21076 21081 6 1.05 2.00 Prom + 21209 21248 40 -4.95 2.01 Init + 21708 21772 65 2 2 73 66 52 0.880 2.17 2.02 Term + 22832 23006 175 2 1 44 48 184 0.111 6.25 2.03 PlyA + 24356 24361 6 1.05 3.00 Prom + 30456 30495 40 -4.05 3.01 Init + 32678 32821 144 1 0 68 49 56 0.559 -0.13 3.02 Term + 33075 33530 456 2 0 58 46 278 0.990 14.74 3.03 PlyA + 33891 33896 6 1.05 4.00 Prom + 34439 34478 40 -6.15 4.01 Sngl + 36347 37225 879 1 0 60 45 208 0.663 9.44 4.02 PlyA + 38124 38129 6 1.05 5.05 PlyA - 38749 38744 6 1.05 5.04 Term - 41221 41127 95 2 2 134 42 31 0.336 0.11 5.03 Intr - 42705 42467 239 2 2 17 57 163 0.018 2.44 5.02 Intr - 47250 47178 73 1 1 98 103 1 0.012 0.15 5.01 Init - 55918 55780 139 1 1 58 94 156 0.309 13.55 5.00 Prom - 58407 58368 40 -5.45 6.13 PlyA - 58826 58821 6 -0.45 6.12 Term - 59585 59472 114 2 0 72 34 147 0.132 5.29 6.11 Intr - 61106 60967 140 0 2 59 49 128 0.072 5.26 6.10 Intr - 61324 61159 166 1 1 20 58 144 0.075 3.21 6.09 Intr - 61748 61487 262 1 1 -39 -27 354 0.010 7.77 6.08 Intr - 61943 61750 194 2 2 89 -36 256 0.033 10.67 6.07 Intr - 71179 71015 165 1 0 45 72 78 0.005 1.14 6.06 Intr - 81781 81708 74 2 2 110 107 33 0.225 5.61 6.05 Intr - 83036 82919 118 2 1 40 92 123 0.045 7.02 6.04 Intr - 100956 100052 905 1 2 46 80 444 0.000 29.48 6.03 Intr - 105824 105760 65 1 2 97 78 96 0.159 6.94 6.02 Intr - 107815 107659 157 2 1 35 27 162 0.051 2.95 6.01 Init - 117864 117792 73 0 1 78 92 71 0.629 7.78 6.00 Prom - 138390 138351 40 -4.05 7.03 PlyA - 139335 139330 6 1.05 7.02 Term - 140381 140173 209 0 2 97 47 71 0.239 0.42 7.01 Init - 146743 146641 103 1 1 82 56 89 0.299 5.55 7.00 Prom - 162919 162880 40 -3.65 8.06 PlyA - 164108 164103 6 1.05 8.05 Term - 167436 167323 114 0 0 68 46 175 0.087 8.89 8.04 Intr - 182534 182372 163 1 1 18 63 184 0.161 7.96 8.03 Intr - 191812 191721 92 1 2 53 66 51 0.060 -2.73 8.02 Intr - 192645 192554 92 1 2 90 101 23 0.300 2.59 8.01 Intr - 196955 196854 102 0 0 122 75 72 0.698 8.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 22856 23006 151 2 1 72 48 184 0.856 9.20 S.002 Sngl - 101011 99998 1014 1 0 87 44 522 0.991 44.46 S.003 Term - 164469 164021 449 1 2 62 43 204 0.878 7.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_1|75_aa MSDIQVMLMQDKGSHDLGQLLPCGFAGLWVLKYKTNEQQLFCSWPFALELQQSGRHRSGG QDWHNVLVSGTQSHC >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_1|228_bp atgtctgacatccaagtcatgctgatgcaagacaagggttcccatgatcttggacagctt ctcccctgtggctttgcaggactctgggttcttaagtacaagacgaatgagcagcagctc ttctgttcatggccatttgccttagagttgcagcagagtggacggcaccggagtggaggt caagattggcataacgtcttggtgtcagggacacaatctcattgctga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_2|79_aa MRSLLRSEIEENTEHYTNQDKWLCQNLSPRLNVAECLKVVPGVYVVVGFPVDSDSQQELN IAVLADEQTQIIVKEDNVF >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_2|240_bp atgaggtccttgctcaggagcgaaatagaagagaacactgaacattacactaatcaagat aaatggctgtgccaaaatctgtcccctagactgaatgtagctgagtgtctgaaggtcgtg cctggcgtctatgttgtagtcgggttcccagttgattctgactcacagcaagaactgaac attgctgtgttagctgatgagcaaacacaaataattgtcaaagaagacaatgtgttctga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_3|199_aa MPPSRGRQTLHTGELPPAPGRCPSGMKLQEEGASNNLCCSAASAGDTHDHNSSPAREQNW TENEFDELTEVGFRRWVITNSSKLKEHVLVQCKEAKNLDKGLQELLTRITSLENNINDLM DLKNTARELCEAYTSINSRIDQVEERISEIEDGLNEIKHEDKIREKRMRRNKQSLQKIQD YVKRPNLHLIGLLESDREN >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_3|600_bp atgcctcccagcaggggtcgacaaacacttcatacaggagagctccctccagcacctggc aggtgcccctctgggatgaagcttcaagaggaaggggcaagcaacaatctttgctgttct gcagcctctgctggtgatacccacgatcacaactcttcaccagcaagggaacaaaactgg acagagaatgaatttgatgaattaacagaagtaggcttcagaaggtgggtaataacaaac tcctccaagctaaaggagcatgttctagtccaatgcaaggaagctaagaaccttgataaa gggttacaggaactgctaactagaataaccagtttagagaataacataaatgacctgatg gacctgaaaaacacagcaagagaactttgtgaagcatacacaagtatcaatagccgaatc gatcaagtggaagaaaggatatcagagattgaagatggacttaatgaaataaagcatgaa gacaagattagagaaaaaagaatgagaaggaacaaacaaagcctccaaaaaatacaggac tatgtgaaaagaccaaacctacatttgattggtttacttgaaagtgacagggagaactga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_4|292_aa MSELPFTIATKKIKYLEIQLTRDMKDFFKQNYKPLFKGIREDTNKWKNIACSWIGRANIM KIAILPKVIHRFNSIPIKLPLTFFKELEKSTLNFMRNQERAHIAKTIISKKNKAGGIMLT DFKLYYKATVIKTAWNWYHNRHIDQWNRTQASEMTPHTYNNLIFNKPDKNKQWGKDSLFN NWCWENWIAICRKQKLDPFLTRYTKINSRWIKDLNIRPKALKTLEENLGNMIQDIGMGKD FMTKTPKAMAIKAKIDKWDLIKLMSFCKAKETIIRVEQATYRMGENFCNLSI >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_4|879_bp atgagtgaactcccattcacaattgctacaaagaaaataaaatacctagaaatacaactt acaagggacatgaaggacttcttcaagcagaactacaaaccactgttcaaaggaataaga gaggacacaaacaaatggaaaaacattgcatgctcatggataggaagagccaatatcatg aaaatagccatattgcccaaagtaattcatagattcaattctattcccatcaagctacca ttgactttcttcaaagaattagaaaaatctactttaaatttcatgcggaaccaagaaaga gcccatatagccaagacaatcataagcaaaaagaacaaagctggaggcatcatgctaact gacttcaaattatactataaggctacagtaatcaaaacagcatggaattggtaccataac agacacatagaccaatggaacagaacacaggcctcagaaatgactccacacacctacaac aatctgattttcaacaaacctgacaaaaacaagcaatggggaaaggattccctatttaat aactggtgttgggaaaactggatagccatatgcagaaagcagaaactggaccccttcctt acacgttatacaaaaattaactcaagatggattaaagacttaaacataagacctaaagcc ttaaaaaccctagaagaaaacctaggcaatatgattcaggacataggcatgggcaaagac ttcatgactaaaacaccaaaagcgatggcaataaaagctaaaatagacaaatgggatcta attaaactaatgagcttctgcaaagcaaaagaaactatcatcagagttgaacaggcaacc tacagaatgggagaaaatttttgcaatctatccatctga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_5|181_aa MGVLQTKGVKLAKAEAKERPDTFKPLTEFSFKWSIKSKDGEGAGEGVNKLSLLLSTTYSP SSPSFPRTSLIFKWYGGLFGLLPWFPMSFWCGPADGHYLNQVAVPVRRRFSPERGKVGDS LNGSNHMYIGVMVNTECQFDWVEGYKVLILGSELLQGAFWENSRKQGYVLDKVLSESRHN L >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_5|546_bp atgggagtgcttcagacaaagggagtgaaattggcaaaagctgaagcaaaagaaaggcct gatactttcaagcctctgactgaattcagttttaagtggagcatcaagagtaaggatgga gaaggagctggagagggagtgaataaactctctctgcttctatcaaccacatattcacca tcctctccgtccttcccgagaacttcactcatttttaagtggtatggaggcctgtttgga ctcttgccttggtttcccatgagtttttggtgtgggccagctgatggtcattacctgaac caagtagcagtaccggtaagaagaaggttttctccagagagaggcaaagttggggacagt ttaaatggcagcaaccacatgtacattggtgtgatggttaatactgagtgtcaatttgat tgggttgaaggatacaaagtattaatcctgggatctgagcttctgcaaggtgctttttgg gagaattccaggaagcagggatatgttctggataaggtattgtcagaaagcaggcataat ctatga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_6|810_aa MPSEQIILVDPQGHRVQRYLKQVLVSRQTGMEDVQRSREWKPLSLCLLGSGRGGMLLARL LGARTREHALARWASEGILESQSVRFLGVLASGQRSTNAAFGNCTDENIPLKMHYLPVIY GIIFLVGFPGNAVVISTYIFKMRPWKSSTIIMLNLACTDLLYLTSLPFLIHYYASGENWI FGDFMCKFIRFSFHFNLYSSILFLTCFSIFRYCVIIHPMSCFSIHKTRCAVVACAVVWII SLVAVIPMTFLITSTNRTNRSACLDLTSSDELNTIKWYNLILTATTFCLPLVIVTLCYTT IIHTLTHGLQTDSCLKQKARRLTILLLLAFYVCFLPFHILRVIRIESRLLSISCSIENQI HEAYIVSRPLAALNTFGNLLLYVVVSDNFQQAVCSTVRCKGIDMIALFVRHIRDSVVSRK EPKLVMPPDDAAFSVELKHKKIQLLDNIKIKASPNGLGLSVPKTAVVIGGDVHLVFLCVF LSRMKEMFFSFRVLLHDKKEFLGRATTSLANSIPSVSAEQCRLHGIDGPAAVMKPDAQDL ETKVHVLSVSSPVLGENAEESVDGENTLDTASKPDLQKIFQKQYISDSLNFDEETDGEEE EGKKIRSQATYSEKERPNSTSSLHSTTDANASGSYAAAQPAGNELGEVEELEDFASVFDF AAQPLHNRLRELESGKRANHPTALSPLIQLIYLVKEKGILANLDLTSGEPSLQFMTAASA PSWPESGEVGSEGPRKMSVIIPGMTVNHEQIPFQPRNNDGSLFSRWQNKTMVRADKLAFT KLLWLHTESLNGSSINIPMAGYCFYESIYV >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_6|2433_bp atgcccagtgagcagataattctggtggacccacaaggccatagagttcagcgatactta aagcaagtgctggtttcccgccaaaccgggatggaggatgtccagaggtcccgggagtgg aagccgctgtccctgtgtctactgggaagtgggcgaggcgggatgctgctggccaggctg ctcggggcgaggacccgagaacacgcacttgctcgctgggcctcggagggaatcttggaa tcccaatccgtgaggttcctgggtgtgctggcatcaggacagcggtccacgaacgctgct tttggaaattgcactgatgaaaacatcccactcaagatgcactacctccctgttatttat ggcattatcttcctcgtgggatttccaggcaatgcagtagtgatatccacttacattttc aaaatgagaccttggaagagcagcaccatcattatgctgaacctggcctgcacagatctg ctgtatctgaccagcctccccttcctgattcactactatgccagtggcgaaaactggatc tttggagatttcatgtgtaagtttatccgcttcagcttccatttcaacctgtatagcagc atcctcttcctcacctgtttcagcatcttccgctactgtgtgatcattcacccaatgagc tgcttttccattcacaaaactcgatgtgcagttgtagcctgtgctgtggtgtggatcatt tcactggtagctgtcattccgatgaccttcttgatcacatcaaccaacaggaccaacaga tcagcctgtctcgacctcaccagttcggatgaactcaatactattaagtggtacaacctg attttgactgcaactactttctgcctccccttggtgatagtgacactttgctataccacg attatccacactctgacccatggactgcaaactgacagctgccttaagcagaaagcacga aggctaaccattctgctactccttgcattttacgtatgttttttacccttccatatcttg agggtcattcggatcgaatctcgcctgctttcaatcagttgttccattgagaatcagatc catgaagcttacatcgtttctagaccattagctgctctgaacacctttggtaacctgtta ctatatgtggtggtcagcgacaactttcagcaggctgtctgctcaacagtgagatgcaaa ggtatagatatgatagcactctttgtccggcatattagagatagtgtagtttctagaaaa gagcctaagttagtgatgcccccagatgatgctgccttctcagtagaacttaaacataaa aagatccaactgttggataatattaaaatcaaggcttcccccaacgggctaggtttgtct gtccccaagacggcggttgttattggtggtgatgttcatcttgtgtttctgtgtgtcttt ctctccagaatgaaagaaatgtttttcagctttagggtcctgcttcatgacaaaaaggaa tttctaggtcgtgctacaacttcccttgccaacagcattccttctgtgtctgctgagcaa tgtcgtcttcatggcattgatggtccggctgcagtcatgaagccagatgctcaagatttg gaaaccaaagttcatgttctttcagtcagctcccctgttctgggagaaaatgctgaagag agtgttgatggggaaaacaccttggatactgcttccaagccagatcttcagaagattttc caaaaacaatatatctcagatagtctgaactttgatgaggagactgatggcgaggaagaa gaaggaaagaaaataagatcccaggctacatattcagaaaaagaaagacccaattccaca tccagcctgcactcgaccacagatgcaaatgcttccggttcctatgctgctgcccaaccg gccggtaacgagctgggggaagtggaggaactggaggactttgcttccgttttcgatttt gcagcccaaccgctgcataaccggctgcgagaactagaaagcgggaaaagagcaaatcat ccaactgccttatctccactgatccaactgatttatctcgtgaaggagaaaggtatactg gcaaacttagatctaacctcaggggaaccaagtttacagtttatgaccgcggcgtctgcc ccatcatggccggagtctggtgaagtgggatctgaaggtcctaggaaaatgtctgtgatc attcccggaatgacggtgaatcacgagcagatcccatttcagccacgaaataacgacggc agcttgttctcaagatggcagaacaaaactatggtccgggctgacaagctcgccttcact aagctcctctggctgcatactgagtctctcaatggcagcagcatcaacattcccatggcc ggctattgcttctatgaatccatttatgtttga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_7|103_aa MVRFPLEIGEADGHVGMDSKEAMGPEEGLLYESFFYCKQEEESRSFPEHVAGTYPQLNTQ VYLLQAPASHITAGHNSRKLSAFPNKGILSFAPLSNNTILLSF >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_7|312_bp atggtcagattcccacttgagataggcgaggctgatggccatgttgggatggactcaaag gaggccatgggtccagaagaaggcctactctatgaaagtttcttttactgtaagcaggaa gaagaaagcaggtcatttcctgagcatgttgctgggacataccctcagctaaatacgcaa gtttatctcttacaagctcctgcttcccacataactgcaggacataattccaggaagctt tctgcattccctaacaagggaattctctcctttgccccactttccaataacacaatcctc ctttccttctga >gi568815585r:96886749_97087759|GENSCAN_predicted_peptide_8|187_aa XYKNRKASPRILPMTDGKITPGETQKRDAGKNYTGLELGPGGRLLDHGGGFSWFNIIPLG AVITMIQPSQNGLLPLENNLVAYYLSDFPTLLGKAILLANSTLETDYGSMGVRWRMDTAC SVSNSSGTMWRINILPVQKEGSYYGQQRGKECSSSPATEQSWMENDFGELREEGFRRSNF SELKEEV >gi568815585r:96886749_97087759|GENSCAN_predicted_CDS_8|564_bp ngttacaagaataggaaggcctctcccagaatactccctatgacagatggtaaaattacg ccaggagagacccagaagagagatgcaggcaagaactacacagggctggagttggggcct ggtgggaggttactggatcatgggggtggtttctcatggtttaacatcatcccccttggt gctgttatcacaatgatacaaccatcccaaaacggcctgctaccacttgaaaataacctg gtggcctactatctcagtgattttcccaccttacttggtaaagccatactactggcaaat tctaccctggaaactgactatgggtcaatgggagtccgctggcgaatggataccgcctgt tctgtttccaattcatcaggaaccatgtggagaattaatatcctgcccgtccagaaggaa gggagttattatggacagcaaagaggaaaggaatgcagctcctcaccagcaacagaacaa agctggatggagaatgactttggcgagttgagagaagaaggcttcagacgatcaaacttc tccgagctaaaggaggaagtttga