GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:50:31 Sequence gi568815578r:383964_608551 : 224588 bp : 45.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4048 4338 291 0 0 95 102 213 0.610 20.73 1.02 Intr + 7324 7616 293 0 2 104 105 391 0.978 38.33 1.03 Term + 12235 12727 493 1 1 106 48 566 0.998 48.48 1.04 PlyA + 13574 13579 6 1.05 2.00 Prom + 19827 19866 40 -7.06 2.01 Init + 23628 23708 81 2 0 45 82 91 0.464 5.17 2.02 Intr + 24039 24253 215 2 2 85 35 17 0.131 -6.39 2.03 Intr + 24728 24816 89 2 2 98 86 35 0.464 3.91 2.04 Intr + 25867 26062 196 2 1 63 94 164 0.761 12.97 2.05 Intr + 33563 33656 94 2 1 101 101 122 0.988 14.77 2.06 Intr + 33769 33967 199 0 1 122 100 295 0.999 33.02 2.07 Intr + 35384 35505 122 0 2 103 119 89 0.999 13.71 2.08 Intr + 35595 35768 174 2 0 84 80 236 0.918 22.54 2.09 Intr + 36908 37068 161 1 2 64 80 285 0.964 24.09 2.10 Intr + 38164 38275 112 1 1 141 91 203 0.987 26.28 2.11 Intr + 43350 43529 180 2 0 18 81 172 0.933 9.56 2.12 Intr + 44528 44626 99 1 0 66 100 181 0.948 17.41 2.13 Intr + 44988 45131 144 2 0 70 84 286 0.894 26.88 2.14 Intr + 46387 46422 36 0 0 109 61 42 0.515 2.06 2.15 Term + 51066 51200 135 2 0 15 48 111 0.327 -2.18 2.16 PlyA + 51216 51221 6 -0.45 3.23 PlyA - 51535 51530 6 1.05 3.22 Term - 54878 54623 256 1 1 132 48 240 0.990 19.46 3.21 Intr - 55332 55145 188 0 2 94 83 132 0.917 11.79 3.20 Intr - 56426 56285 142 1 1 135 66 66 0.901 9.46 3.19 Intr - 57726 57625 102 2 0 97 73 112 0.980 9.89 3.18 Intr - 58080 57894 187 1 1 90 81 227 0.927 20.95 3.17 Intr - 61167 61087 81 1 0 83 85 94 0.990 8.21 3.16 Intr - 64111 63926 186 2 0 114 110 233 0.998 27.76 3.15 Intr - 67745 67693 53 1 2 30 37 36 0.002 -8.75 3.14 Intr - 78011 77891 121 0 1 71 50 159 0.051 9.95 3.13 Intr - 78588 78373 216 1 0 -25 96 214 0.019 9.38 3.12 Intr - 102499 102413 87 2 0 85 82 86 0.971 7.64 3.11 Intr - 103612 103464 149 0 2 95 65 175 0.998 15.78 3.10 Intr - 104815 104715 101 1 2 46 111 93 0.999 6.31 3.09 Intr - 105918 105817 102 0 0 53 90 45 0.737 1.57 3.08 Intr - 108401 108291 111 2 0 75 110 78 0.998 9.28 3.07 Intr - 111839 111756 84 2 0 126 98 37 0.996 8.52 3.06 Intr - 113817 113758 60 0 0 57 115 99 0.995 8.43 3.05 Intr - 115342 115292 51 1 0 87 70 34 0.586 0.60 3.04 Intr - 115971 115870 102 0 0 103 89 74 0.959 9.37 3.03 Intr - 121266 121155 112 2 1 115 113 129 0.999 18.48 3.02 Intr - 158992 158967 26 1 2 93 64 14 0.035 -3.68 3.01 Init - 159927 159709 219 0 0 101 82 212 0.958 18.53 3.00 Prom - 160262 160223 40 -3.96 4.05 PlyA - 160445 160440 6 -0.45 4.04 Term - 162294 162281 14 1 2 110 54 4 0.029 -2.44 4.03 Intr - 175473 175372 102 1 0 54 30 143 0.459 5.25 4.02 Intr - 192589 192509 81 2 0 82 92 27 0.165 2.11 4.01 Init - 195207 195075 133 0 1 78 47 77 0.127 2.90 4.00 Prom - 199488 199449 40 -3.56 5.04 PlyA - 200711 200706 6 1.05 5.03 Term - 205242 205211 32 1 2 135 38 24 0.639 0.12 5.02 Intr - 213951 213819 133 0 1 46 96 60 0.197 2.82 5.01 Intr - 220702 220667 36 1 0 128 100 43 0.843 7.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:383964_608551|GENSCAN_predicted_peptide_1|358_aa MRATPLAAPAGSLSRKKRLELDDNLDTERPVQKRARSGPQPRLPPCLLPLSPPTAPDRAT AVATASRLGPYVLLEPEEGGRAYQALHCPTGTEYTCKVYPVQEALAVLEPYARLPPHKHV ARPTEVLAGTQLLYAFFTRTHGDMHSLVRSRHRIPEPEAAVLFRQMATALAHCHQHGLVL RDLKLCRFVFADRERKKLVLENLEDSCVLTGPDDSLWDKHACPAYVGPEILSSRASYSGK AADVWSLGVALFTMLAGHYPFQDSEPVLLFGKIRRGAYALPAGLSAPARCLVRCLLRREP AERLTATGILLHPWLRQDPMPLAPTRSHLWEAAQVVPDGLGLDEAREEEGDREVVLYG >gi568815578r:383964_608551|GENSCAN_predicted_CDS_1|1077_bp atgcgagccacccctctggctgctcctgcgggttccctgtccaggaagaagcggttggag ttggatgacaacttagataccgagcgtcccgtccagaaacgagctcgaagtgggccccag cccagactgcccccctgcctgttgcccctgagcccacctactgctccagatcgtgcaact gctgtggccactgcctcccgtcttgggccctatgtcctcctggagcccgaggagggcggg cgggcctaccaggccctgcactgccctacaggcactgagtatacctgcaaggtgtacccc gtccaggaagccctggccgtgctggagccctatgcgcggctgcccccgcacaagcatgtg gctcggcccactgaggtcctggctggtacccagctcctctacgcctttttcactcggacc catggggacatgcacagcctggtgcgaagccgccaccgtatccctgagcctgaggctgcc gtgctcttccgccagatggccaccgccctggcgcactgtcaccagcacggtctggtcctg cgtgatctcaagctgtgtcgctttgtcttcgctgaccgtgagaggaagaagctggtgctg gagaacctggaggactcctgcgtgctgactgggccagatgattccctgtgggacaagcac gcgtgcccagcctacgtgggacctgagatactcagctcacgggcctcatactcgggcaag gcagccgatgtctggagcctgggcgtggcgctcttcaccatgctggccggccactacccc ttccaggactcggagcctgtcctgctcttcggcaagatccgccgcggggcctacgccttg cctgcaggcctctcggcccctgcccgctgtctggttcgctgcctccttcgtcgggagcca gctgaacggctcacagccacaggcatcctcctgcacccctggctgcgacaggacccgatg cccttagccccaacccgatcccatctctgggaggctgcccaggtggtccctgatggactg gggctggacgaagccagggaagaggagggagacagagaagtggttctgtatggctag >gi568815578r:383964_608551|GENSCAN_predicted_peptide_2|678_aa MRQTNANFTNSSTKRQTVTAERQRRIWPPAPPAPPPAPPRPRPPRLRPSSFPRLCDAARR VTRAGVRGPAISPGGRGGVAPNRSAGTAGTPHSASAAAGRPGGSISQEARSPPGGWAQPR QMDEKTKKAPENKPCANWLLLYWLSAEEMALSLTRAVAGGDEQVAMKCAIWLAEQRVPLS VQLKPEVSPTQDIRLWVSVEDAQMHTVTIWLTVRPDMTVASLKDMVFLDYGFPPVLQQWV IGQRLARDQETLHSHGVRQNGDSAYLYLLSARNTSLNPQELQRERQLRMLEDLGFKDLTL QPRGPLEPGPPKPGVPQEPGRGQPDAVPEPPPVGWQCPGCTFINKPTRPGCEMCCRARPE AYQVPASYQPDEEERARLAGEEEALRQYQQRKQQQQEGNYLQHVQLDQRSLVLNTEPAEC PVCYSVLAPGEAVVLRECLHTFCRECLQGTIRNSQEAEVSCPFIDNTYSCSGKLLEREIK ALLTPEDYQRFLDLGISIAENRSAFSYHCKTPDCKGWCFFEDDVNEFTCPVCFHVNCLLC KAIHEQMNCKEYQEDLALRAQNDVAARQTTEMLKVMLQQGEAMRCPQCQIVVQKKDGCDW IRCTVCHTEICWVTKGPRWGPGGPGDTSGGCRCRKGHVEKRHIKLWLGSKTKCQLLGSPD QGVKHTKSTPDPHQLCRL >gi568815578r:383964_608551|GENSCAN_predicted_CDS_2|2037_bp atgaggcagacgaatgcaaacttcaccaatagctcaacgaaaagacagacggtgaccgct gagaggcagcgtcggatctggcccccggcccccccggccccgcccccggccccgccccgg ccccgcccaccccgcctgcggcccagctccttcccgcggctctgcgatgcggcccgcagg gtgacccgggcgggagtccggggacccgcgatcagccccggaggacggggtggggtcgcc ccaaacaggagcgccgggaccgctgggaccccgcactcggcgtccgccgccgccgggcga cccggaggtagcatttcccaggaggcacggtcccccccagggggatgggcacagccacgc cagatggacgagaagaccaagaaagctcctgaaaataaaccctgtgctaactggctcctg ctgtactggctttcagcagaggaaatggccctgagcctcacccgagcagtggcgggcggg gatgaacaggtggcaatgaagtgtgccatctggctggcagagcaacgggtgcccctgagt gtgcaactgaagcctgaggtctccccaacgcaggacatcaggctgtgggtgagcgtggag gatgctcagatgcacaccgtcaccatctggctcacagtgcgccctgatatgacagtggcg tctctcaaggacatggtttttctggactatggcttcccaccagtcttgcagcagtgggtg attgggcagcggctggcacgagaccaggagaccctgcactcccatggggtgcggcagaat ggggacagtgcctacctctatctgctgtcagcccgcaacacctccctcaaccctcaggag ctgcagcgggagcggcagctgcggatgctggaagatctgggcttcaaggacctcacgctg cagccgcggggccctctggagccaggccccccaaagcccggggtcccccaggaacccgga cgggggcagccagatgcagtgcctgagcccccaccggtgggctggcagtgccccgggtgc accttcatcaacaagcccacgcggcctggctgtgagatgtgctgccgggcgcgccccgag gcctaccaggtccccgcctcataccagcccgacgaggaggagcgagcgcgcctggcgggc gaggaggaggcgctgcgtcagtaccagcagcggaagcagcagcagcaggaggggaactac ctgcagcacgtccagctggaccagaggagcctggtgctgaacacggagcccgccgagtgc cccgtgtgctactcggtgctggcgcccggcgaggccgtggtgctgcgtgagtgtctgcac accttctgcagggagtgcctgcagggcaccatccgcaacagccaggaggcggaggtctcc tgccccttcattgacaacacctactcgtgctcgggcaagctgctggagagggagatcaag gcgctcctgacccctgaggattaccagcgatttctagacctgggcatctccattgctgaa aaccgcagtgccttcagctaccattgcaagaccccagattgcaagggatggtgcttcttt gaggatgatgtcaatgagttcacctgccctgtgtgtttccacgtcaactgcctgctctgc aaggccatccatgagcagatgaactgcaaggagtatcaggaggacctggccctgcgggct cagaacgatgtggctgcccggcagacgacagagatgctgaaggtgatgctgcagcagggc gaggccatgcgctgcccccagtgccagatcgtggtacagaagaaggacggctgcgactgg atccgctgcaccgtctgccacaccgagatctgctgggtcaccaagggcccacgctggggc cctgggggcccaggagacaccagcgggggctgccgctgcaggaaagggcatgtggagaag cggcacatcaagctctggttaggatccaaaaccaaatgccagctcctaggatccccagac cagggtgtgaaacacaccaagagcaccccagacccgcaccagttgtgtcgactgtga >gi568815578r:383964_608551|GENSCAN_predicted_peptide_3|911_aa MAAFREIPLGAESEAVSPRLLVGGGFRFRQQRLQPRSGPCGWRPSRVSPPPSPPYCLCEQ RGERPPPLPLPPQCWNIARLSRNQDDYQLVRKLGRGKYSEVFEAINITNNEKVVVKILKP VKKKKIKREIKILENLRGGPNIITLADIVKDPVSRTPALVFEHVNNTDFKQLYQTLTDYD IRFYMYEILKALDYCHSMGIMHRDVKPHNVMIDHEHRKLRLIDWGLAEFYHPGQEYNVRV ASRYFKGPELLVDYQMYDYSLDMWSLGCMLASMIFRKEPFFHGHDNYDQLVRIAKVLGTE DLYDYIDKYNIELDPRFNDILGRHSRKRWERFVHSENQHLVSPEALDFLDKLLRYDHQSR LTAREAMEHPYFYTVVKDQARMGSSSMPGGSTPVSSANMMSAGRLRACAPLADGGEPADA ERVLRPAVGAASSPGSATSRGSGGGPGPRGMALRSAQGDGPTSGHWDGGAEKAGSRSRLP EGYQLSRTAFTVAGLIDALPAYRRCNYHTVLPLRMGFAGKGRRYLEAIGDKNFNAKRKKK VAEIHQALNSDPTDVAALRRMAISEGGLLTDEIRRKVWPKLLNVNANDPPPISGKNLRQM SKDYQQVLLDVRRSLRRFPPGMPEEQREGLQEELIDIILLILERNPQLHYYQGYHDIVVT FLLVVGERLATSLVEKLSTHHLRDFMDPTMDNTKHILNYLMPIIDQVNPELHDFMQSAEV GTIFALSWLITWFGHVLSDFRHVVRLYDFFLACHPLMPIYFAAVIVLYREQEVLDCDCDM ASVHHLLSQIPQDLPYETLISRAGDLFVQFPPSELAREAAAQQQAERTAASTFKDFELAS AQQRPDMVLRQRFRGLLRPEDRTKDVLTKPRTNRFVKLAVMGLTVALGAAALAVVKSALE WAPKFQLQLFP >gi568815578r:383964_608551|GENSCAN_predicted_CDS_3|2736_bp atggccgcattcagagagattcctctgggggcggagtcggaagctgtctcgccccgcctc ctggtaggagggggtttccgcttccggcagcagcggctgcagcctcgctctggtccctgc ggctggcggccgagccgtgtgtctcctcctccatcgccgccatattgtctgtgtgagcag aggggagagcggccgccgccgctgccgcttccaccacagtgttggaacattgctagatta agcagaaatcaagatgactaccagctggttcgaaaattaggccgaggtaaatacagtgaa gtatttgaagccatcaacatcacaaataatgaaaaagttgttgttaaaattctcaagcca gtaaaaaagaagaaaattaagcgtgaaataaagattttggagaatttgagaggaggtccc aacatcatcacactggcagacattgtaaaagaccctgtgtcacgaacccccgccttggtt tttgaacacgtaaacaacacagacttcaagcaattgtaccagacgttaacagactatgat attcgattttacatgtatgagattctgaaggccctggattattgtcacagcatgggaatt atgcacagagatgtcaagccccataatgtcatgattgatcatgagcacagaaagctacga ctaatagactggggtttggctgagttttatcatcctggccaagaatataatgtccgagtt gcttcccgatacttcaaaggtcctgagctacttgtagactatcagatgtacgattatagt ttggatatgtggagtttgggttgtatgctggcaagtatgatctttcggaaggagccattt ttccatggacatgacaattatgatcagttggtgaggatagccaaggttctggggacagaa gatttatatgactatattgacaaatacaacattgaattagatccacgtttcaatgatatc ttgggcagacactctcgaaagcgatgggaacgctttgtccacagtgaaaatcagcacctt gtcagccctgaggccttggatttcctggacaaactgctgcgatatgaccaccagtcacgg cttactgcaagagaggcaatggagcacccctatttctacactgttgtgaaggaccaggct cgaatgggttcatctagcatgccagggggcagtacgcccgtcagcagcgccaatatgatg tcagcggggcgcctgcgcgcctgcgcgccgctggccgacggaggggagcctgccgatgcc gagcgggtgctacgtcccgcggtcggagccgcgtcttctcccggctccgccaccagccgg ggctcgggtgggggcccggggccccggggcatggccctccggagtgcgcagggcgacggc cccacctccggccactgggacggcggcgcggagaaggcaggctcccggagccgcctcccc gaggggtatcagctctcccgaactgcattcaccgtcgccggcttgattgacgcgctgccc gcgtaccgccgatgtaattaccacacagttcttcctttaagaatgggctttgcaggaaaa ggaagaaggtatctggaagcaataggagacaaaaactttaacgccaaaaggaaaaagaaa gtggcagagatacaccaggctctgaacagtgatcccactgatgtggctgcccttagacgc atggctatcagtgaaggagggctcctgactgatgagatcagacgaaaagtgtggcccaag ctcctcaatgtcaatgccaatgacccacctcctatatcagggaagaacctacggcagatg agcaaggactaccaacaagtgttgctggacgtccggcggtcattgcggcggttccctcct ggcatgccagaggaacagagagaagggctccaggaagaactgattgacatcatcctcctc atcttggagcgcaaccctcagctgcactactaccagggctaccatgacattgtggtcaca tttctgctggtggtaggcgagaggctggcaacatccctggtagaaaaattatctacccac cacctcagggattttatggatccaacaatggacaacaccaagcatatattaaactatctg atgcccatcattgaccaggtgaatccagagctccatgacttcatgcagagtgctgaggta gggaccatctttgccctcagctggctcatcacctggtttgggcatgtcctgtctgacttc aggcacgtcgtgcggttatatgacttcttcctggcctgccacccactgatgccgatttac tttgcagccgtgattgtgttgtatcgcgagcaggaagtcctggactgtgactgtgacatg gcctcggtccaccacctgttgtcccagatccctcaggacttgccctatgagacactgatc agcagagcaggagacctttttgttcagtttcccccatccgaacttgctcgggaggccgct gcccaacagcaagctgagaggacggcagcctctactttcaaagactttgagctggcatca gcccagcagaggcctgatatggtgctgcggcagcggtttcggggacttctgcggcctgaa gatcgaacaaaagatgtcctgaccaagccaaggaccaaccgctttgtgaaattggcagtg atggggctgacagtggcacttggagcggctgcactggctgtggtgaaaagtgccctggaa tgggcccctaagtttcagctgcagctgtttccctga >gi568815578r:383964_608551|GENSCAN_predicted_peptide_4|109_aa MEYYAAIKKDVFMSFVETWMKLEAIILSKLSQGQKTKHRMFSLIDFGSNDSSSKRLSLPT LFGSFIMDITIYPKPGKSNIQELADSMSGEDRFLVHRWLLTVSSHERLA >gi568815578r:383964_608551|GENSCAN_predicted_CDS_4|330_bp atggaatactatgcagccataaaaaaggatgtgttcatgtcctttgtagagacatggatg aagctggaagccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagactttggctccaatgacagctcttcaaagaggctttctctgcctacc ctatttggttccttcataatggacatcacaatatatccgaagcctggcaagtccaacatc caggagctggcagattccatgtctggtgaagaccgcttcctggttcatagatggcttctc accgtgtcctcacatgaaagactagcctga >gi568815578r:383964_608551|GENSCAN_predicted_peptide_5|66_aa GGRRDLGGSCLKMRKRRHKEVKELGQGDTVAGGFESDVLAAEPHWALNHHPHMTSPGPPV LFAFSP >gi568815578r:383964_608551|GENSCAN_predicted_CDS_5|201_bp ggtggccgtcgtgacctggggggcagctgcttgaagatgaggaaacggaggcacaaggag gttaaagaacttggccaaggtgacacagtggctggtggatttgaatcagatgttctagct gcagagccccactgggctctcaaccaccacccccacatgacctcaccaggtcccccggtc ctttttgccttcagtccatga