GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:35:00 Sequence gi568815578r:338589_562405 : 223817 bp : 46.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11050 11244 195 0 0 94 116 201 0.633 23.11 1.02 Term + 14622 15047 426 2 0 131 49 473 0.999 43.00 1.03 PlyA + 15985 15990 6 1.05 2.06 PlyA - 16775 16770 6 1.05 2.05 Term - 18827 18722 106 1 1 47 54 119 0.122 2.28 2.04 Intr - 27152 27052 101 2 2 63 78 44 0.024 -0.19 2.03 Intr - 30136 30040 97 0 1 70 37 94 0.039 2.51 2.02 Intr - 30468 30372 97 1 1 42 80 86 0.569 2.27 2.01 Init - 31711 31660 52 1 1 108 89 -28 0.405 0.63 2.00 Prom - 37901 37862 40 -2.16 3.00 Prom + 41686 41725 40 -2.16 3.01 Init + 43068 43070 3 2 0 67 81 0 0.095 -2.80 3.02 Intr + 49423 49713 291 0 0 95 102 213 0.476 20.73 3.03 Intr + 52699 52991 293 0 2 104 105 391 0.978 38.33 3.04 Term + 57610 58102 493 1 1 106 48 566 0.998 48.48 3.05 PlyA + 58949 58954 6 1.05 4.00 Prom + 65202 65241 40 -7.06 4.01 Init + 69003 69083 81 2 0 45 82 91 0.464 5.17 4.02 Intr + 69414 69628 215 2 2 85 35 17 0.131 -6.39 4.03 Intr + 70103 70191 89 2 2 98 86 35 0.464 3.91 4.04 Intr + 71242 71437 196 2 1 63 94 164 0.761 12.97 4.05 Intr + 78938 79031 94 2 1 101 101 122 0.988 14.77 4.06 Intr + 79144 79342 199 0 1 122 100 295 0.999 33.02 4.07 Intr + 80759 80880 122 0 2 103 119 89 0.999 13.71 4.08 Intr + 80970 81143 174 2 0 84 80 236 0.918 22.54 4.09 Intr + 82283 82443 161 1 2 64 80 285 0.964 24.09 4.10 Intr + 83539 83650 112 1 1 141 91 203 0.987 26.28 4.11 Intr + 88725 88904 180 2 0 18 81 172 0.933 9.56 4.12 Intr + 89903 90001 99 1 0 66 100 181 0.948 17.41 4.13 Intr + 90363 90506 144 2 0 70 84 286 0.894 26.88 4.14 Intr + 91762 91797 36 0 0 109 61 42 0.515 2.06 4.15 Term + 96441 96575 135 2 0 15 48 111 0.327 -2.18 4.16 PlyA + 96591 96596 6 -0.45 5.23 PlyA - 96910 96905 6 1.05 5.22 Term - 100253 99998 256 1 1 132 48 240 0.990 19.46 5.21 Intr - 100707 100520 188 0 2 94 83 132 0.917 11.79 5.20 Intr - 101801 101660 142 1 1 135 66 66 0.901 9.46 5.19 Intr - 103101 103000 102 2 0 97 73 112 0.980 9.89 5.18 Intr - 103455 103269 187 1 1 90 81 227 0.927 20.95 5.17 Intr - 106542 106462 81 1 0 83 85 94 0.990 8.21 5.16 Intr - 109486 109301 186 2 0 114 110 233 0.998 27.76 5.15 Intr - 113120 113068 53 1 2 30 37 36 0.002 -8.75 5.14 Intr - 123386 123266 121 0 1 71 50 159 0.051 9.95 5.13 Intr - 123963 123748 216 1 0 -25 96 214 0.019 9.38 5.12 Intr - 147874 147788 87 2 0 85 82 86 0.971 7.64 5.11 Intr - 148987 148839 149 0 2 95 65 175 0.998 15.78 5.10 Intr - 150190 150090 101 1 2 46 111 93 0.999 6.31 5.09 Intr - 151293 151192 102 0 0 53 90 45 0.737 1.57 5.08 Intr - 153776 153666 111 2 0 75 110 78 0.998 9.28 5.07 Intr - 157214 157131 84 2 0 126 98 37 0.996 8.52 5.06 Intr - 159192 159133 60 0 0 57 115 99 0.995 8.43 5.05 Intr - 160717 160667 51 1 0 87 70 34 0.586 0.60 5.04 Intr - 161346 161245 102 0 0 103 89 74 0.959 9.37 5.03 Intr - 166641 166530 112 2 1 115 113 129 0.999 18.48 5.02 Intr - 204367 204342 26 1 2 93 64 14 0.035 -3.68 5.01 Init - 205302 205084 219 0 0 101 82 212 0.958 18.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 220848 220747 102 1 0 54 30 143 0.827 5.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:338589_562405|GENSCAN_predicted_peptide_1|206_aa GSMMPSCNRSCSCSRGPSVEDGKWYGVRSYLHLFYEDCAGTALSDDPEGPPVLCPRRPWP SLCWKISLSSGTLLLLLGVAALTTGYAVPPKLEGIGEGEFLVLDQRAADYNQALGTCRLA GTALCVAAGVLLAICLFWAMIGWLSQDTKAEPLDPEADSHVEVFGDEPEQQLSPIFRNAS GQSWFSPPASPFGQSSVQTIQPKRDS >gi568815578r:338589_562405|GENSCAN_predicted_CDS_1|621_bp gggtccatgatgccgagctgcaatcgttcctgcagctgcagccgcggccccagcgtggag gatggcaagtggtatggggtccgctcctacctgcacctcttctatgaggactgtgcaggc actgctctcagcgacgaccctgagggacctccggtcctgtgcccccgccggccctggccc tcactgtgttggaagatcagcctgtcctcggggaccctgcttctgctgctgggtgtggcg gctctgaccactggctatgcagtgccccccaagctggagggcatcggtgagggtgagttc ctggtgttggatcagcgggcagccgactacaaccaggccctgggcacctgtcgcctggca ggcacagcgctctgtgtggcagctggagttctgctcgccatctgcctcttctgggccatg ataggctggctgagccaggacaccaaggcagagcccttggaccccgaagccgacagccac gtggaggtcttcggggatgagccagagcagcagttgtcacccattttccgcaatgccagt ggccagtcatggttctcgccacccgccagcccctttgggcaatcttctgtgcagactatc cagcccaagagggactcctga >gi568815578r:338589_562405|GENSCAN_predicted_peptide_2|150_aa MVKPHLYKKVQQLARHGGGANRIADGLDVEFEGKGGVKVLPRFLGLNGDRNGRSREPEKE ENQQRVEAWPPSEKKYIKEQGVPQRQMWKGSSLTVGTTQAALVGNECPISEVNQGRLLRE GLAEKVMFEQRLEGTAHVVNWGGVPQQREQ >gi568815578r:338589_562405|GENSCAN_predicted_CDS_2|453_bp atggtgaaacctcatctctacaaaaaagtacaacaactagccaggcatggtggtggagcc aacaggattgctgatggtctagatgtggagtttgagggaaaaggaggtgtcaaggtgctt ccaaggtttcttggcctgaacggtgacaggaatgggagaagcagggagcctgagaaggag gaaaaccagcagagagtggaggcctggccaccaagtgaaaagaagtacatcaaagagcag ggagtgccacaaaggcagatgtggaaggggagctctctcacagttggcaccacccaggct gccctggtgggcaatgagtgtcccatctctgaggtgaaccaggggaggctgctcagggaa ggccttgctgagaaggtgatgtttgagcaacgacttgaaggaactgcccatgtcgtcaac tggggaggagtaccccagcagagggaacagtga >gi568815578r:338589_562405|GENSCAN_predicted_peptide_3|359_aa MMRATPLAAPAGSLSRKKRLELDDNLDTERPVQKRARSGPQPRLPPCLLPLSPPTAPDRA TAVATASRLGPYVLLEPEEGGRAYQALHCPTGTEYTCKVYPVQEALAVLEPYARLPPHKH VARPTEVLAGTQLLYAFFTRTHGDMHSLVRSRHRIPEPEAAVLFRQMATALAHCHQHGLV LRDLKLCRFVFADRERKKLVLENLEDSCVLTGPDDSLWDKHACPAYVGPEILSSRASYSG KAADVWSLGVALFTMLAGHYPFQDSEPVLLFGKIRRGAYALPAGLSAPARCLVRCLLRRE PAERLTATGILLHPWLRQDPMPLAPTRSHLWEAAQVVPDGLGLDEAREEEGDREVVLYG >gi568815578r:338589_562405|GENSCAN_predicted_CDS_3|1080_bp atgatgcgagccacccctctggctgctcctgcgggttccctgtccaggaagaagcggttg gagttggatgacaacttagataccgagcgtcccgtccagaaacgagctcgaagtgggccc cagcccagactgcccccctgcctgttgcccctgagcccacctactgctccagatcgtgca actgctgtggccactgcctcccgtcttgggccctatgtcctcctggagcccgaggagggc gggcgggcctaccaggccctgcactgccctacaggcactgagtatacctgcaaggtgtac cccgtccaggaagccctggccgtgctggagccctatgcgcggctgcccccgcacaagcat gtggctcggcccactgaggtcctggctggtacccagctcctctacgcctttttcactcgg acccatggggacatgcacagcctggtgcgaagccgccaccgtatccctgagcctgaggct gccgtgctcttccgccagatggccaccgccctggcgcactgtcaccagcacggtctggtc ctgcgtgatctcaagctgtgtcgctttgtcttcgctgaccgtgagaggaagaagctggtg ctggagaacctggaggactcctgcgtgctgactgggccagatgattccctgtgggacaag cacgcgtgcccagcctacgtgggacctgagatactcagctcacgggcctcatactcgggc aaggcagccgatgtctggagcctgggcgtggcgctcttcaccatgctggccggccactac cccttccaggactcggagcctgtcctgctcttcggcaagatccgccgcggggcctacgcc ttgcctgcaggcctctcggcccctgcccgctgtctggttcgctgcctccttcgtcgggag ccagctgaacggctcacagccacaggcatcctcctgcacccctggctgcgacaggacccg atgcccttagccccaacccgatcccatctctgggaggctgcccaggtggtccctgatgga ctggggctggacgaagccagggaagaggagggagacagagaagtggttctgtatggctag >gi568815578r:338589_562405|GENSCAN_predicted_peptide_4|678_aa MRQTNANFTNSSTKRQTVTAERQRRIWPPAPPAPPPAPPRPRPPRLRPSSFPRLCDAARR VTRAGVRGPAISPGGRGGVAPNRSAGTAGTPHSASAAAGRPGGSISQEARSPPGGWAQPR QMDEKTKKAPENKPCANWLLLYWLSAEEMALSLTRAVAGGDEQVAMKCAIWLAEQRVPLS VQLKPEVSPTQDIRLWVSVEDAQMHTVTIWLTVRPDMTVASLKDMVFLDYGFPPVLQQWV IGQRLARDQETLHSHGVRQNGDSAYLYLLSARNTSLNPQELQRERQLRMLEDLGFKDLTL QPRGPLEPGPPKPGVPQEPGRGQPDAVPEPPPVGWQCPGCTFINKPTRPGCEMCCRARPE AYQVPASYQPDEEERARLAGEEEALRQYQQRKQQQQEGNYLQHVQLDQRSLVLNTEPAEC PVCYSVLAPGEAVVLRECLHTFCRECLQGTIRNSQEAEVSCPFIDNTYSCSGKLLEREIK ALLTPEDYQRFLDLGISIAENRSAFSYHCKTPDCKGWCFFEDDVNEFTCPVCFHVNCLLC KAIHEQMNCKEYQEDLALRAQNDVAARQTTEMLKVMLQQGEAMRCPQCQIVVQKKDGCDW IRCTVCHTEICWVTKGPRWGPGGPGDTSGGCRCRKGHVEKRHIKLWLGSKTKCQLLGSPD QGVKHTKSTPDPHQLCRL >gi568815578r:338589_562405|GENSCAN_predicted_CDS_4|2037_bp atgaggcagacgaatgcaaacttcaccaatagctcaacgaaaagacagacggtgaccgct gagaggcagcgtcggatctggcccccggcccccccggccccgcccccggccccgccccgg ccccgcccaccccgcctgcggcccagctccttcccgcggctctgcgatgcggcccgcagg gtgacccgggcgggagtccggggacccgcgatcagccccggaggacggggtggggtcgcc ccaaacaggagcgccgggaccgctgggaccccgcactcggcgtccgccgccgccgggcga cccggaggtagcatttcccaggaggcacggtcccccccagggggatgggcacagccacgc cagatggacgagaagaccaagaaagctcctgaaaataaaccctgtgctaactggctcctg ctgtactggctttcagcagaggaaatggccctgagcctcacccgagcagtggcgggcggg gatgaacaggtggcaatgaagtgtgccatctggctggcagagcaacgggtgcccctgagt gtgcaactgaagcctgaggtctccccaacgcaggacatcaggctgtgggtgagcgtggag gatgctcagatgcacaccgtcaccatctggctcacagtgcgccctgatatgacagtggcg tctctcaaggacatggtttttctggactatggcttcccaccagtcttgcagcagtgggtg attgggcagcggctggcacgagaccaggagaccctgcactcccatggggtgcggcagaat ggggacagtgcctacctctatctgctgtcagcccgcaacacctccctcaaccctcaggag ctgcagcgggagcggcagctgcggatgctggaagatctgggcttcaaggacctcacgctg cagccgcggggccctctggagccaggccccccaaagcccggggtcccccaggaacccgga cgggggcagccagatgcagtgcctgagcccccaccggtgggctggcagtgccccgggtgc accttcatcaacaagcccacgcggcctggctgtgagatgtgctgccgggcgcgccccgag gcctaccaggtccccgcctcataccagcccgacgaggaggagcgagcgcgcctggcgggc gaggaggaggcgctgcgtcagtaccagcagcggaagcagcagcagcaggaggggaactac ctgcagcacgtccagctggaccagaggagcctggtgctgaacacggagcccgccgagtgc cccgtgtgctactcggtgctggcgcccggcgaggccgtggtgctgcgtgagtgtctgcac accttctgcagggagtgcctgcagggcaccatccgcaacagccaggaggcggaggtctcc tgccccttcattgacaacacctactcgtgctcgggcaagctgctggagagggagatcaag gcgctcctgacccctgaggattaccagcgatttctagacctgggcatctccattgctgaa aaccgcagtgccttcagctaccattgcaagaccccagattgcaagggatggtgcttcttt gaggatgatgtcaatgagttcacctgccctgtgtgtttccacgtcaactgcctgctctgc aaggccatccatgagcagatgaactgcaaggagtatcaggaggacctggccctgcgggct cagaacgatgtggctgcccggcagacgacagagatgctgaaggtgatgctgcagcagggc gaggccatgcgctgcccccagtgccagatcgtggtacagaagaaggacggctgcgactgg atccgctgcaccgtctgccacaccgagatctgctgggtcaccaagggcccacgctggggc cctgggggcccaggagacaccagcgggggctgccgctgcaggaaagggcatgtggagaag cggcacatcaagctctggttaggatccaaaaccaaatgccagctcctaggatccccagac cagggtgtgaaacacaccaagagcaccccagacccgcaccagttgtgtcgactgtga >gi568815578r:338589_562405|GENSCAN_predicted_peptide_5|911_aa MAAFREIPLGAESEAVSPRLLVGGGFRFRQQRLQPRSGPCGWRPSRVSPPPSPPYCLCEQ RGERPPPLPLPPQCWNIARLSRNQDDYQLVRKLGRGKYSEVFEAINITNNEKVVVKILKP VKKKKIKREIKILENLRGGPNIITLADIVKDPVSRTPALVFEHVNNTDFKQLYQTLTDYD IRFYMYEILKALDYCHSMGIMHRDVKPHNVMIDHEHRKLRLIDWGLAEFYHPGQEYNVRV ASRYFKGPELLVDYQMYDYSLDMWSLGCMLASMIFRKEPFFHGHDNYDQLVRIAKVLGTE DLYDYIDKYNIELDPRFNDILGRHSRKRWERFVHSENQHLVSPEALDFLDKLLRYDHQSR LTAREAMEHPYFYTVVKDQARMGSSSMPGGSTPVSSANMMSAGRLRACAPLADGGEPADA ERVLRPAVGAASSPGSATSRGSGGGPGPRGMALRSAQGDGPTSGHWDGGAEKAGSRSRLP EGYQLSRTAFTVAGLIDALPAYRRCNYHTVLPLRMGFAGKGRRYLEAIGDKNFNAKRKKK VAEIHQALNSDPTDVAALRRMAISEGGLLTDEIRRKVWPKLLNVNANDPPPISGKNLRQM SKDYQQVLLDVRRSLRRFPPGMPEEQREGLQEELIDIILLILERNPQLHYYQGYHDIVVT FLLVVGERLATSLVEKLSTHHLRDFMDPTMDNTKHILNYLMPIIDQVNPELHDFMQSAEV GTIFALSWLITWFGHVLSDFRHVVRLYDFFLACHPLMPIYFAAVIVLYREQEVLDCDCDM ASVHHLLSQIPQDLPYETLISRAGDLFVQFPPSELAREAAAQQQAERTAASTFKDFELAS AQQRPDMVLRQRFRGLLRPEDRTKDVLTKPRTNRFVKLAVMGLTVALGAAALAVVKSALE WAPKFQLQLFP >gi568815578r:338589_562405|GENSCAN_predicted_CDS_5|2736_bp atggccgcattcagagagattcctctgggggcggagtcggaagctgtctcgccccgcctc ctggtaggagggggtttccgcttccggcagcagcggctgcagcctcgctctggtccctgc ggctggcggccgagccgtgtgtctcctcctccatcgccgccatattgtctgtgtgagcag aggggagagcggccgccgccgctgccgcttccaccacagtgttggaacattgctagatta agcagaaatcaagatgactaccagctggttcgaaaattaggccgaggtaaatacagtgaa gtatttgaagccatcaacatcacaaataatgaaaaagttgttgttaaaattctcaagcca gtaaaaaagaagaaaattaagcgtgaaataaagattttggagaatttgagaggaggtccc aacatcatcacactggcagacattgtaaaagaccctgtgtcacgaacccccgccttggtt tttgaacacgtaaacaacacagacttcaagcaattgtaccagacgttaacagactatgat attcgattttacatgtatgagattctgaaggccctggattattgtcacagcatgggaatt atgcacagagatgtcaagccccataatgtcatgattgatcatgagcacagaaagctacga ctaatagactggggtttggctgagttttatcatcctggccaagaatataatgtccgagtt gcttcccgatacttcaaaggtcctgagctacttgtagactatcagatgtacgattatagt ttggatatgtggagtttgggttgtatgctggcaagtatgatctttcggaaggagccattt ttccatggacatgacaattatgatcagttggtgaggatagccaaggttctggggacagaa gatttatatgactatattgacaaatacaacattgaattagatccacgtttcaatgatatc ttgggcagacactctcgaaagcgatgggaacgctttgtccacagtgaaaatcagcacctt gtcagccctgaggccttggatttcctggacaaactgctgcgatatgaccaccagtcacgg cttactgcaagagaggcaatggagcacccctatttctacactgttgtgaaggaccaggct cgaatgggttcatctagcatgccagggggcagtacgcccgtcagcagcgccaatatgatg tcagcggggcgcctgcgcgcctgcgcgccgctggccgacggaggggagcctgccgatgcc gagcgggtgctacgtcccgcggtcggagccgcgtcttctcccggctccgccaccagccgg ggctcgggtgggggcccggggccccggggcatggccctccggagtgcgcagggcgacggc cccacctccggccactgggacggcggcgcggagaaggcaggctcccggagccgcctcccc gaggggtatcagctctcccgaactgcattcaccgtcgccggcttgattgacgcgctgccc gcgtaccgccgatgtaattaccacacagttcttcctttaagaatgggctttgcaggaaaa ggaagaaggtatctggaagcaataggagacaaaaactttaacgccaaaaggaaaaagaaa gtggcagagatacaccaggctctgaacagtgatcccactgatgtggctgcccttagacgc atggctatcagtgaaggagggctcctgactgatgagatcagacgaaaagtgtggcccaag ctcctcaatgtcaatgccaatgacccacctcctatatcagggaagaacctacggcagatg agcaaggactaccaacaagtgttgctggacgtccggcggtcattgcggcggttccctcct ggcatgccagaggaacagagagaagggctccaggaagaactgattgacatcatcctcctc atcttggagcgcaaccctcagctgcactactaccagggctaccatgacattgtggtcaca tttctgctggtggtaggcgagaggctggcaacatccctggtagaaaaattatctacccac cacctcagggattttatggatccaacaatggacaacaccaagcatatattaaactatctg atgcccatcattgaccaggtgaatccagagctccatgacttcatgcagagtgctgaggta gggaccatctttgccctcagctggctcatcacctggtttgggcatgtcctgtctgacttc aggcacgtcgtgcggttatatgacttcttcctggcctgccacccactgatgccgatttac tttgcagccgtgattgtgttgtatcgcgagcaggaagtcctggactgtgactgtgacatg gcctcggtccaccacctgttgtcccagatccctcaggacttgccctatgagacactgatc agcagagcaggagacctttttgttcagtttcccccatccgaacttgctcgggaggccgct gcccaacagcaagctgagaggacggcagcctctactttcaaagactttgagctggcatca gcccagcagaggcctgatatggtgctgcggcagcggtttcggggacttctgcggcctgaa gatcgaacaaaagatgtcctgaccaagccaaggaccaaccgctttgtgaaattggcagtg atggggctgacagtggcacttggagcggctgcactggctgtggtgaaaagtgccctggaa tgggcccctaagtttcagctgcagctgtttccctga