GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:00:49 Sequence gi568815578r:40587882_40788850 : 200969 bp : 45.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 21373 21412 40 -2.46 1.01 Sngl + 32359 33372 1014 0 0 88 43 601 0.844 52.62 1.02 PlyA + 33600 33605 6 1.05 2.00 Prom + 33769 33808 40 -4.96 2.01 Sngl + 34231 35295 1065 0 0 66 51 380 0.621 29.25 2.02 PlyA + 35487 35492 6 1.05 3.00 Prom + 35635 35674 40 -8.96 3.01 Sngl + 35876 36448 573 1 0 86 47 243 0.988 16.17 3.02 PlyA + 36620 36625 6 -0.45 4.00 Prom + 36990 37029 40 -2.46 4.01 Init + 60138 60234 97 2 1 57 61 67 0.116 1.37 4.02 Intr + 65577 65708 132 1 0 119 86 14 0.231 4.92 4.03 Intr + 73050 73123 74 1 2 64 52 67 0.070 -0.17 4.04 Intr + 76450 76549 100 0 1 107 110 1 0.241 3.88 4.05 Intr + 82821 82856 36 1 0 117 68 13 0.005 0.43 4.06 Intr + 91371 91550 180 1 0 64 92 119 0.243 9.74 4.07 Intr + 94839 95005 167 1 2 -1 -11 179 0.206 -1.22 4.08 Term + 95338 95700 363 0 0 44 46 183 0.439 4.27 4.09 PlyA + 97974 97979 6 1.05 5.17 PlyA - 98359 98354 6 1.05 5.16 Term - 101065 99998 1068 1 0 111 47 1620 0.400 152.33 5.15 Intr - 102604 102396 209 2 2 76 91 128 0.549 10.70 5.14 Intr - 104956 104843 114 2 0 21 22 144 0.525 1.52 5.13 Intr - 109984 109795 190 1 1 79 52 116 0.883 6.26 5.12 Intr - 111589 111427 163 0 1 81 99 11 0.216 1.48 5.11 Intr - 115744 115672 73 2 1 65 103 42 0.173 1.86 5.10 Intr - 118430 118314 117 0 0 88 53 37 0.072 0.64 5.09 Intr - 125956 125836 121 1 1 91 36 57 0.009 0.87 5.08 Intr - 137430 137312 119 1 2 83 74 36 0.296 1.88 5.07 Intr - 139391 139276 116 1 2 79 80 80 0.450 6.39 5.06 Intr - 163474 163348 127 2 1 65 47 69 0.001 0.24 5.05 Intr - 176033 175929 105 0 0 61 74 54 0.002 1.49 5.04 Intr - 178409 178266 144 0 0 70 68 81 0.002 4.55 5.03 Intr - 185247 185124 124 0 1 52 49 51 0.001 -2.24 5.02 Intr - 187032 186937 96 0 0 120 105 1 0.008 5.21 5.01 Init - 193350 193345 6 0 0 102 95 2 0.022 3.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 84700 84753 54 0 0 91 98 100 0.940 12.58 S.002 Intr + 182312 182358 47 2 2 88 119 47 0.924 5.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:40587882_40788850|GENSCAN_predicted_peptide_1|337_aa MGKKQNRKTGNSKKQSASPPPKERSSSPAMEQSWTENDFDELREGFRRSNYSELREDIQT KGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSAM EDEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDI IQENFPNLARQANVKIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVT LKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRNSYPAKLSFISEGEIKYFTDKQ MLRDFVTTRPALKELLKEVLNMERNNWYQLLQNHAKM >gi568815578r:40587882_40788850|GENSCAN_predicted_CDS_1|1014_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggacggagaacgactttgac gagctgagagaaggcttcagacgatcaaattactctgagctacgggaggacattcaaacc aaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaataacc aatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaactacgt gaagaatgcagaagcctcaggagccgctgcgatcaactggaagaaagggtatcagcgatg gaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaaaga aatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctgatt ggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggatatt atccaggagaacttccccaatctagcaaggcaggccaacgttaagattcaggaaatacag agaactccacaaagatactcctcgagaagagcaactccaagacacataattgtcagattc accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttacc ctcaaagggaagcccatcagactaacagcggatctctcggcagaaaccctacaagccaga agagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaaattca tatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaa atgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagtgcta aacatggaaaggaacaactggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815578r:40587882_40788850|GENSCAN_predicted_peptide_2|354_aa MKAEIKMFFETNENKDTTYQNLWDAFKAVCTGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKR EKNQIDTIKNDKGDITTDPTQIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGIL PNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVG FIPGMQDWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNK >gi568815578r:40587882_40788850|GENSCAN_predicted_CDS_2|1065_bp atgaaggcagaaataaagatgttcttcgaaaccaacgagaacaaagacacaacataccag aatctctgggacgcattcaaagcagtgtgtacagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaatagatagaccactagcaagactaataaagaaaaaaaga gagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccactgatcccaca caaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctggacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatcaatagctta ccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaag gaggaactggtaccatttcttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcattctgataccaaagcctggcagagacacaacc aaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaacgaatccagcagcacatcaaaaagcttatacaccatgatcaagtgggc ttcatccctgggatgcaagactggttcaatatacgcaaatcaataaatgtaatccagcat ataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaatga >gi568815578r:40587882_40788850|GENSCAN_predicted_peptide_3|190_aa MAILPKVIYRFNAIPIELPMPFFTDLEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDYLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMAT >gi568815578r:40587882_40788850|GENSCAN_predicted_CDS_3|573_bp atggccatactgcccaaggtaatttacagattcaatgccatccccatcgagctaccaatg cctttcttcacagatttggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattacctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacataa >gi568815578r:40587882_40788850|GENSCAN_predicted_peptide_4|382_aa MERAGNEAGSRGKPSQTESSTAPKKEALIQDARIFCGPLSQLGSVPPLLGSQSPRALSYA MVIWVPLEVNSTMKIEDCWGKVKEDNFGGEFGCGSAEEKEVRALVIITSKPAQPVPEDLM AWPGLAWPVPAPEAESLSVAPYAQGLAAAAAALKCFLLLLANPRSGDRPWHICWDAWLGK ESAVCSSTYVAALMEQMLDCTALDLTGPRLSFPGDPFPENRLVPVPPQRGADPQRTSPLG EPGREKRDSWSGQDESPEQRALWRRRIQSAQPPAHPPEIRGRKRRDSRRTRLQLLQPRAR SEGKPSSGGLLCPVRASGCLSVPVSECVSGCRRQPVYVGRVLTWKTPGPGGFSPGDVKDC KEPASRRERWERKRRMTVAGCC >gi568815578r:40587882_40788850|GENSCAN_predicted_CDS_4|1149_bp atggagcgggcagggaatgaagcaggcagcagaggaaagccctcccagacagagtcctca acagcaccaaagaaggaagccctcatccaagatgcaaggatcttctgtggacctctgtcc cagctgggctcagtgcctcctcttctaggctcacagagcccccgagctttgtcctatgcc atggtcatttgggtccctctggaagtaaactctacaatgaagattgaagattgttgggga aaagttaaagaagacaactttgggggtgaatttggatgtggctctgcagaagagaaggaa gtgagagccctggtaataatcacctccaaaccggcccaaccggtcccagaggacctgatg gcctggcctggcctggcctggcctgtccctgccccagaggcagagagcctttctgtggct ccttatgcacaaggcctggcagcagcagcggcagcattgaaatgcttccttctgctgctc gcgaatccgaggagtggggaccgaccttggcacatttgctgggatgcctggctaggcaag gagagtgccgtgtgctccagcacgtatgtggccgccctcatggagcaaatgctggactgc acagccctggatctgacagggcccaggctcagcttccccggagatcccttcccggagaac cgactggtgccggttcccccgcagcggggcgccgatccccagcgaacctcgccactgggg gaacctggaagggaaaagcgggatagttggagcggtcaagacgagagcccggagcagcgg gccctgtggcggcggcggatccagtccgcacagcctcccgcacaccccccagagatccgg ggtcgcaagaggcgggattcgcgccggactagactgcagctgctccagccacgggcgcgg tccgaagggaagccctcgtccgggggcctgctgtgccccgtgcgtgcgtccgggtgtctg agcgtgcctgtctctgagtgcgtctccgggtgcaggcgccagcctgtctacgtgggacgg gtgctgacttggaagactcccggccctggaggattttcccccggggatgtgaaggattgc aaggagcccgcgagccgcagggagcggtgggaacggaaaaggaggatgacggtggcgggg tgctgctag >gi568815578r:40587882_40788850|GENSCAN_predicted_peptide_5|963_aa MPINSLPFYTLILVGSNGYAAIILAAPTLSGLFLVSIKSPSSSAHWSLSLQVNLLFFEEK CTPYLNHRVMDEEFGGQKQATHTTAGDGPKDIICEGLTNLLWELSVCCPYSAELSLNPPQ GLVGPCPLTLLAAHEAEITLTHFTDEKTEAQRYLIIFPGTFSESWTHLEAYIFPPVYKDI LKAFLLNGCMLHSGSKMWNPRGFLNQDRKEYSRMSPIGATPERGTKERQWAGQDNKLDSG LKTSTLHAIRHVYSCTNTFTHTQRCALTCTDTFTRTQTQKQKHWGWGGHGQQVWPPGCPM RTAAKEDTGLEVDSYGTFFGPSPLEACIAVEMESGQKQPLEMNCTQTGRHGGVIDTGAYF SGFVKLYLRNGRFNKRLTLLPSPEAEFGGGHRYNRCSGQSVYIRCYENMRQPAGRLPGGG NAQEKSTCVAAGKCLFQGSCKPNPYIKRELEKYLEQKLLTVHPCCMLALILNQEMALNSR SEHRNRGCQHSNQKQSLTGRETPPRSPTTCNPQGNNAEGERRTFGLNEDARNAALVSRRP SGPGSSRRAPPNAGARNFNGATRAELAARGSRLSTKTGPEDLYIARGEPVLGSGAHIVCV LGWLGLGQLFSALPPRAWLGALRPAAKFPGRQRRLRLASAMAAELSMGPELPTSPLAMEY VNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTPLSTPCSSVPSSPSFSPTEQKT HLEDLYWMASNYQQMNPEALNLTPEDAVEALIGSHPVPQPLQSFDSFRGAHHHHHHHHPH PHHAYPGAGVAHDELGPHAHPHHHHHHQASPPPSSAASPAQQLPTSHPGPGPHATASATA AGGNGSVEDRFSDDQLVSMSVRELNRHLRGFTKDEVIRLKQKRRTLKNRGYAQSCRYKRV QQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLANSGFREAGSTSDSPSSPE FFL >gi568815578r:40587882_40788850|GENSCAN_predicted_CDS_5|2892_bp atgcctataaattctcttcccttttacactctgattttagttgggagcaatggctatgcc gctatcattcttgcagcccctactctctcagggctgttcctggtgagcattaagagtccc agttctagtgcccattggtccctgtccctgcaagtcaatttgctcttctttgaagagaaa tgcaccccctatttaaatcacagagtcatggatgaggaatttgggggacagaaacaagcc actcacacaaccgctggggacggacccaaagacatcatctgtgagggcctcactaacctg ctgtgggaactgtctgtctgctgcccctactctgctgaattatctttaaatccaccccag ggcttggtaggtccatgtcctctgaccctcctggcagcccacgaggcagagattactctt acccattttacggatgagaaaactgaggctcagagatatctgatcatcttcccaggtaca ttcagcgagtcatggacccacctcgaagcttacattttcccacctgtctacaaggatatc ttgaaagcatttctgctgaatggatgcatgcttcattctggatccaaaatgtggaacccc agaggatttctcaaccaagacagaaaagagtacagcagaatgtctcctattggagctaca ccagagagagggaccaaggaaaggcagtgggctggccaggacaacaagctggattcagga ctcaagacatccacattacatgcaatccggcatgtatactcgtgcacaaacaccttcact catacacagagatgtgcactcacatgcacagataccttcactcgcacacaaacacagaaa cagaagcactgggggtggggaggacatgggcagcaggtgtggccccctggctgtcccatg aggactgctgctaaggaagacacaggcctggaagtggattcttatggcaccttctttggc cccagccccttggaggcgtgcattgcagtggaaatggaatcaggacagaaacagccactt gaaatgaactgcacccagacaggccgccatggtggcgtgattgatactggagcatacttt tcagggtttgtgaagctgtatctgcgtaatggcagatttaacaagcgtttgactctcctg ccttcaccagaggctgagtttggtggtggacacagatataacagatgctctgggcaaagt gtctatataaggtgctatgaaaacatgcgccagccagcaggaaggcttcctggaggaggt aatgcccaagagaaaagcacatgtgtggctgctggaaagtgcctatttcagggaagctgc aaacccaacccttacatcaaaagggaacttgaaaaatacttggagcaaaaactgctaact gtgcatccttgctgtatgctggcgctgatattaaatcaggaaatggcactgaacagccgc tccgaacacagaaacaggggttgtcaacacagcaaccaaaagcaatctctaacgggcaga gagacgcctcctcgctcaccaaccacctgcaaccctcagggaaataacgccgaaggtgaa cggcgcaccttcgggctgaatgaggatgccaggaatgccgctctggtttccaggcgacct tccggcccagggagcagccgacgggcacccccgaacgctggggcacggaatttcaacggg gcgaccagagcggagctcgcggctcgaggctcgcggctgagtacgaagacgggacccgag gacctgtacatcgctcgaggggaacctgtcctgggctctggtgcacacatagtatgcgtt ctgggctggcttgggttgggccagctcttctccgctcttcccccccgcgcttggctcggc gcgctccggccggccgcaaagtttcccgggcggcagcggcggctgcgcctcgcttcagcg atggccgcggagctgagcatggggccagagctgcccaccagcccgctggccatggagtat gtcaacgacttcgacctgctcaagttcgacgtgaagaaggagccactggggcgcgcggag cgtccgggcaggccctgcacacgcctgcagccagccggctcggtgtcctccacaccgctc agcactccgtgtagctccgtgccctcgtcgcccagcttcagcccgaccgaacagaagaca cacctcgaggatctgtactggatggcgagcaactaccagcagatgaaccccgaggcgctc aacctgacgcccgaggacgcggtggaagcgctcatcggctcgcacccagtgccacagccg ctgcaaagcttcgacagctttcgcggcgctcaccaccaccaccatcaccaccaccctcac ccgcaccacgcgtacccgggcgccggcgtggcccacgacgagctgggcccgcacgctcac ccgcaccatcaccatcatcaccaagcgtcgccgccgccgtccagcgccgctagcccggcg caacagctgcccactagccaccccgggcccgggccgcacgcgacggcctcggcgacggcg gcgggcggcaacggcagcgtggaggaccgcttctccgacgaccagctcgtgtccatgtcc gtgcgcgagctgaaccgccacctgcggggcttcaccaaggacgaggtgatccgcctgaag cagaagcggcggaccctgaagaaccggggctacgcccagtcttgcaggtataaacgcgtc cagcagaagcaccacctggagaatgagaagacgcagctcattcagcaggtggagcagctt aagcaggaggtgtcccggctggcccgcgagagagacgcctacaaggtcaagtgcgagaaa ctcgccaactccggcttcagggaggcgggctccaccagcgacagcccctcctctcccgag ttctttctgtga