GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:18:09 Sequence gi568815578f:31505388_31705834 : 200447 bp : 49.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9072 9347 276 2 0 63 97 426 0.840 38.71 1.02 Intr + 22097 22195 99 1 0 115 106 137 0.920 18.51 1.03 Intr + 32792 32874 83 1 2 95 71 106 0.899 8.04 1.04 Intr + 37041 37196 156 0 0 98 70 43 0.809 2.63 1.05 Intr + 39560 39648 89 2 2 134 86 65 0.993 10.51 1.06 Intr + 42052 42305 254 2 2 68 103 111 0.502 7.75 1.07 Intr + 43578 43727 150 2 0 81 113 108 0.128 12.96 1.08 Intr + 43820 43945 126 1 0 63 92 207 0.999 19.48 1.09 Intr + 44677 44734 58 0 1 146 101 98 0.999 15.36 1.10 Intr + 48152 48169 18 0 0 102 111 1 0.566 0.38 1.11 Intr + 49359 49442 84 1 0 104 99 91 0.977 11.59 1.12 Intr + 54224 54260 37 0 1 113 109 92 0.998 11.12 1.13 Intr + 56247 56349 103 0 1 88 80 200 0.991 19.38 1.14 Intr + 60823 60908 86 0 2 76 110 73 0.857 6.92 1.15 Term + 62691 62841 151 0 1 101 38 64 0.565 0.08 1.16 PlyA + 66792 66797 6 1.05 2.02 PlyA - 68140 68135 6 1.05 2.01 Sngl - 73489 73064 426 1 0 81 37 296 0.959 18.10 2.00 Prom - 77647 77608 40 -6.76 3.03 PlyA - 80909 80904 6 1.05 3.02 Term - 89548 89358 191 2 2 61 41 98 0.185 0.01 3.01 Init - 99545 99287 259 2 1 30 94 252 0.954 17.30 3.00 Prom - 99621 99582 40 -10.94 4.00 Prom + 99863 99902 40 -5.76 4.01 Init + 100001 100426 426 1 0 63 78 822 0.971 73.00 4.02 Term + 101197 101277 81 0 0 93 39 75 0.788 0.79 4.03 PlyA + 102487 102492 6 1.05 5.00 Prom + 104453 104492 40 -4.26 5.01 Init + 105294 105315 22 2 1 76 105 21 0.124 2.69 5.02 Intr + 127822 128049 228 2 0 78 58 69 0.386 0.74 5.03 Intr + 134546 134710 165 0 0 109 95 267 0.926 29.43 5.04 Intr + 138017 138148 132 0 0 47 113 163 0.998 15.32 5.05 Term + 139381 139517 137 2 2 81 38 257 0.998 18.18 5.06 PlyA + 141214 141219 6 1.05 6.03 PlyA - 142312 142307 6 1.05 6.02 Term - 160699 160562 138 1 0 150 47 254 0.999 25.46 6.01 Init - 191206 191135 72 1 0 72 100 15 0.175 2.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 114400 114292 109 0 1 71 103 40 0.903 4.09 S.002 Intr + 174100 174227 128 0 2 26 103 145 0.930 9.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_1|589_aa GNVAFPAEPVSPPASLLQQPELESDPERTLAMDSALSDPHNGSAEAGGPTNSTTRPPSTP EGIALAYGSLLLMALLPIFFGALRSVRCARGKNASDMPETITSRDAARFPIIASCTLLGL YLFFKIFSQEYINLLLSMYFFVLGILALSHTIRGEGHQDVACQWEAMTFTFDMSHQGPGR ISACYKSLHSESFSKHFSRCLAGTSPFMNKFFPASFPNRQYQLLFTQGSGENKEGDSTGA LPIPFVSLLSPASPWIMFKKFDEKESVSNCIQLKTSVIKGIKSQLVEQFPGIEPWLNQIM PKKDPVKIVRCHEHTEILTGLTGGRGSPASGSGLTWPLCSEIINYEFDTKDLVCLGLSSI VGVWYLLRKHWIANNLFGLAFSLNGVELLHLNNVSTGCILLGGLFIYDVFWVFGTNVMVT VAKSFEAPIKCDKTKAVVFPQDLLEKGLEANNFAMLGLGDVVIPGIFIALLLRFDISLKK NTHTYFYTSFAAYIFGLGLTIFIMHIFKHAQPALLYLVPACIGFPVLVALAKGEVTEMFS YESSAEILPHTPRLTHFPTVSGSPASLADSMQQKLAGPRRRRPQNPSAM >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_1|1770_bp gggaacgtggctttccctgcagagccggtgtctccgcctgcgtccctgctgcagcaaccg gagctggagtcggatcccgaacgcaccctcgccatggactcggccctcagcgatccgcat aacggcagtgccgaggcaggcggccccaccaacagcactacgcggccgccttccacgccc gagggcatcgcgctggcctacggcagcctcctgctcatggcgctgctgcccatcttcttc ggcgccctgcgctccgtacgctgcgcccgcggcaagaatgcttcagacatgcctgaaaca atcaccagccgggatgccgcccgcttccccatcatcgccagctgcacactcttggggctc tacctctttttcaaaatattctcccaggagtacatcaacctcctgctgtccatgtatttc ttcgtgctgggaatcctggccctgtcccacaccatcaggggagaagggcaccaggatgtg gcttgtcagtgggaagcaatgacctttacgtttgacatgagccatcaagggccaggaaga atctctgcctgctacaaaagtctgcattcagagtcattcagcaagcattttagcagatgt ctagcaggcaccagccccttcatgaataagttttttccagccagctttccaaatcgacag taccagctgctcttcacacagggttctggggaaaacaaggaaggggactccaccggagcc ttgccaattccgtttgtttccctgttgtcgcccgcttcaccctggatcatgttcaagaag tttgatgaaaaggaaagtgtgtccaactgcatccagttgaaaacgtcagttattaagggc attaagagccaactggtagagcaatttccaggtattgaaccatggcttaatcaaatcatg cctaagaaagatcctgtcaaaatagtccgatgccacgaacatacagaaatccttaccggg ctgacaggtgggaggggtagccctgcctcagggagtggacttacctggcctctctgctca gagatcatcaattatgaatttgacaccaaggacctggtgtgcctgggcctgagcagcatc gttggcgtctggtacctgctgaggaagcactggattgccaacaacctttttggcctggcc ttctcccttaatggagtagagctcctgcacctcaacaatgtcagcactggctgcatcctg ctgggcggactcttcatctacgatgtcttctgggtatttggcaccaatgtgatggtgaca gtggccaagtccttcgaggcaccaataaaatgtgacaaaactaaggcagtggtgtttccc caggatctgctggagaaaggcctcgaagcaaacaactttgccatgctgggacttggagat gtcgtcattccagggatcttcattgccttgctgctgcgctttgacatcagcttgaagaag aatacccacacctacttctacaccagctttgcagcctacatcttcggcctgggccttacc atcttcatcatgcacatcttcaagcatgctcagcctgccctcctatacctggtccccgcc tgcatcggttttcctgtcctggtggcgctggccaagggagaagtgacagagatgttcagc tacgagtcctcggcggaaatcctgcctcataccccgaggctcacccacttccccacagtc tcgggctccccagccagcctggccgactccatgcagcagaagctagctggccctcgccgc cggcgcccgcagaatcccagcgccatgtaa >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_2|141_aa MGLAGPALGAAGRPAAPGSEGLSTRASSCRGCTGSPSSAGPPALRWISCRALAASPWDTA GDLQPAMPPRPMGSCAAQASPTSAAPSSMAPGPIDPPRAEECGRHTAWDWQAAPPAAPVR DPQGEASRAPESSGDLENLYV >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_2|426_bp atgggcttggcgggccccgcacttggagcggctggccggcccgccgccccaggcagtgag gggcttagcacccgggccagcagctgcagagggtgcaccgggtccccaagcagtgctggc ccaccggcgctgcgctggatttcttgccgggccttagctgcctccccgtgggacacggct ggggacctgcagcccgccatgcccccccgccccatgggctcctgtgcagcccaagcctcc cctacgagcgccgccccctcctccatggcgcccggtcccatcgaccccccaagggctgag gagtgcggtcggcacacggcctgggattggcaggcagctccacctgcagcccctgtgcgg gatccacagggtgaagccagccgggctcctgagtctagtggggacttggagaacctttat gtctag >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_3|149_aa MREIEARTEKRLALDSPESLGFRTLLEDASRTLIYKYQDRNLKRALLYICSPGLSQFLNN VAVRVPETRRAYPDPSRVGAINSGKSKQYTNAHTHTPNVAFICTTDTTDKPNSHTPIDTT CQMHNAQQYTIIYNQHTDLRAHYRNTKSA >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_3|450_bp atgagagaaattgaggcccgaacggagaagcggctggctctggatagcccagagagcctg ggtttccggaccctcttggaagatgcttctcggaccctgatctacaaatatcaggaccga aatttaaagcgggcgctcctttacatctgctcccctgggctttctcaattcctaaataat gttgctgttcgtgttcctgagacccggagggcctacccagatccctcccgggtgggagca attaattcgggaaagtctaagcaatatacaaatgcacacacccacacgcccaacgtggca tttatatgtacgactgacacaacagacaaacccaacagccacacaccaatagacacaacc tgccaaatgcacaacgcacaacagtacaccataatatacaaccaacacactgacctccgc gcacactaccgaaacaccaaatctgcataa >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_4|168_aa MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG RGLPVRAPLSTLNGEISALTAELVLGGELEGYIWIVALTGLNECFRCL >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_4|507_bp atgaaagtcgccagtggcagcaccgccaccgccgccgcgggccccagctgcgcgctgaag gccggcaagacagcgagcggtgcgggcgaggtggtgcgctgtctgtctgagcagagcgtg gccatctcgcgctgcgccgggggcgccggggcgcgcctgcctgccctgctggacgagcag caggtaaacgtgctgctctacgacatgaacggctgttactcacgcctcaaggagctggtg cccaccctgccccagaaccgcaaggtgagcaaggtggagattctccagcacgtcatcgac tacatcagggaccttcagttggagctgaactcggaatccgaagttggaacccccgggggc cgagggctgccggtccgggctccgctcagcaccctcaacggcgagatcagcgccctgacg gccgagctggttctgggaggagaattggagggctacatctggattgttgctcttaccggc ctgaatgagtgtttccggtgtctttaa >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_5|227_aa MAAGKKGGLSLSQSSQHVGPVTTSGLNAVSGVPSTLGPPAVPGEDPYSSALGPRVACLKG QSVSSQVQDLERRLRSCPAGYIHPRGGGKMSPYTNCYAQRYYPMPEEPFCTELNAEEQAL KEKEKGSWTQLTHAEKVALYRLQFNETFAEMNRRSNEWKTVMGCVFFFIGFAALVIWWQR VYVFPPKPITLTDERKAQQLQRMLDMKVNPVQGLASRWDYEKKQWKK >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_5|684_bp atggctgccggaaagaaaggaggtctcagcctatcccagagctctcagcatgtcgggcct gtcacgacaagtggcctgaatgccgtgtccggagtcccttccacccttggaccccccgcg gttccaggagaagacccttattcctcggctctgggaccccgagtggcctgccttaaggga cagtccgtctcttcccaggttcaggaccttgaaaggaggctccgcagttgtcctgcaggc tatattcacccccgtggtggggggaagatgtccccctacaccaactgctatgcccagcgc tactaccccatgccagaagagcccttctgcacagaactcaacgctgaggagcaggccctg aaggagaaggagaagggaagctggacccagctgacccacgccgaaaaggtggccttgtac cggctccagttcaatgagacctttgcggagatgaaccgtcgctccaatgagtggaagaca gtgatgggttgtgtcttcttcttcattggattcgcagctctggtgatttggtggcagcgg gtctacgtatttcctccaaagccgatcaccttgacggacgagcggaaagcccagcagctg cagcgcatgctggacatgaaggtgaatcctgtgcagggcctggcctcccgctgggactat gagaagaagcagtggaagaagtga >gi568815578f:31505388_31705834|GENSCAN_predicted_peptide_6|69_aa MYTVHGHMTGRPWYASSGKDSHLEDTFVELYGNNAAAESRKGQERFNRWFLTGMTVAGVV LLGSLFSRK >gi568815578f:31505388_31705834|GENSCAN_predicted_CDS_6|210_bp atgtatacagtgcatgggcatatgactgggagaccttggtacgccagcagtggcaaggac agtcacctagaagatacttttgtggaactctatgggaacaatgcagcagccgagagccga aagggccaggaacgcttcaaccgctggttcctgacgggcatgactgtggccggcgtggtt ctgctgggctcactcttcagtcggaaatga