GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:25:42 Sequence gi568815593f:175778906_175980042 : 201137 bp : 47.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 461 456 6 1.05 1.05 Term - 18321 17494 828 0 0 62 49 277 0.265 14.24 1.04 Intr - 25085 24969 117 2 0 28 109 78 0.297 4.56 1.03 Intr - 27481 27456 26 2 2 66 113 26 0.250 0.64 1.02 Intr - 28217 28188 30 0 0 84 105 19 0.203 1.30 1.01 Init - 31622 31565 58 2 1 79 51 46 0.154 1.47 1.00 Prom - 32287 32248 40 -5.36 2.00 Prom + 32939 32978 40 -7.56 2.01 Init + 33302 33420 119 1 2 90 99 101 0.955 9.07 2.02 Intr + 37722 37849 128 0 2 28 48 119 0.669 2.22 2.03 Term + 45569 45918 350 0 2 70 49 170 0.713 5.95 2.04 PlyA + 48853 48858 6 1.05 3.05 PlyA - 49154 49149 6 1.05 3.04 Term - 54518 54379 140 0 2 110 37 47 0.098 -0.07 3.03 Intr - 57722 57672 51 0 0 80 85 28 0.071 0.58 3.02 Intr - 71935 71600 336 2 0 89 89 118 0.246 7.49 3.01 Init - 81286 81127 160 1 1 12 36 145 0.011 1.69 3.00 Prom - 82899 82860 40 -2.86 4.00 Prom + 89832 89871 40 -3.46 4.01 Init + 99835 99865 31 0 1 97 78 63 0.964 6.30 4.02 Intr + 100003 100178 176 2 2 117 82 381 0.997 40.06 4.03 Intr + 100943 101136 194 1 2 82 15 399 0.215 30.29 4.04 Intr + 101713 101824 112 1 1 38 91 54 0.150 1.08 4.05 Intr + 101864 101984 121 1 1 71 38 33 0.416 -3.33 4.06 Intr + 104274 104489 216 1 0 49 89 95 0.547 4.18 4.07 Intr + 105340 105477 138 2 0 103 64 13 0.558 0.84 4.08 Intr + 121630 121743 114 2 0 116 68 23 0.021 3.52 4.09 Intr + 126537 126671 135 1 0 -2 76 163 0.088 6.64 4.10 Intr + 132748 132786 39 2 0 96 100 8 0.518 1.00 4.11 Intr + 144195 144234 40 1 1 118 77 68 0.111 5.98 4.12 Intr + 171263 171350 88 2 1 81 80 -5 0.003 -2.03 4.13 Intr + 176018 176092 75 1 0 115 68 38 0.919 4.21 4.14 Intr + 176729 176803 75 1 0 50 80 73 0.624 2.41 4.15 Term + 177509 177649 141 1 0 114 54 78 0.979 4.93 4.16 PlyA + 179056 179061 6 1.05 5.07 PlyA - 179730 179725 6 1.05 5.06 Term - 181227 181064 164 1 2 76 54 146 0.966 8.10 5.05 Intr - 182246 182146 101 1 2 74 93 69 0.984 5.75 5.04 Intr - 182529 182368 162 2 0 99 111 160 0.953 18.49 5.03 Intr - 186250 186046 205 2 1 49 94 226 0.522 17.56 5.02 Intr - 188362 188206 157 1 1 88 57 66 0.522 3.08 5.01 Init - 189384 189037 348 0 0 45 85 533 0.561 43.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 93042 92777 266 1 2 41 52 214 0.801 8.77 S.002 Term - 120082 119864 219 1 0 51 51 124 0.907 2.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:175778906_175980042|GENSCAN_predicted_peptide_1|352_aa MGLEYPNKLESALVLQEPKGEGPVEHALAEKLEGFSEKSWWPYVVSLPLHPPGIRMLPAR HCSAHTAASANGFISQQARWGPQTRGFVRGSRCRRRELSAAAVLALGAARGGEGRSLAAA RNQGGGCGGARPAAAVRRCSADPIGTRCGAGAVPEMVLEREARGALCLAPEEPPAAESSR EGERPEEAASRKLSSAGPVAGVSPPTAPAEEGGSATGSERAFYEPVGGCSSSAADEGARA EVLARFSLPLFLDSSAEALPPAWASTRARGGRQAPFLSASNACRAAGHRAQRPPPKGRPA PRETQASRERRRGLAAEMREDASRGGSEIKETWESSHPGSSALKFEDLGVCH >gi568815593f:175778906_175980042|GENSCAN_predicted_CDS_1|1059_bp atggggctggagtacccaaacaagctggagtcagctcttgtcctgcaagagcccaaaggc gaggggcccgttgagcacgcacttgctgagaagctggagggcttctctgaaaagtcctgg tggccttatgtcgtctccctccccctgcatcctcccggcattaggatgctgcccgcaagg cactgcagcgcccacacggctgccagcgccaatggatttatttcccagcaggcgcgctgg gggccccagacccgcggctttgttcgcggttcgcggtgccgccggagagagctgtccgcc gccgcggtgctagcgttgggagcggcccgcgggggcgagggacggtccttagcagccgcg cggaaccagggcggagggtgcggaggagcccgcccggccgccgctgtccgccgctgcagt gctgacccgatagggacccggtgcggcgcgggagctgtccccgagatggtgctggagcga gaagcccgaggcgccctctgcctcgccccggaagagccgccggcggctgagagctccagg gagggggagaggccggaggaagccgcctctaggaagctctcgagcgcggggcctgtggcc ggggtctctccgccaaccgctcccgcggaggagggcggctcagccacagggagcgagcgc gcgttctacgaacctgttggaggctgcagctcctccgcggccgacgagggcgcccgggcc gaggttctggcgcgtttctccctccccctcttccttgactcctccgccgaggcgctccct cctgcctgggctagcacacgtgcacgcggggggcgccaagccccgtttctgtccgcctcg aatgcctgccgagctgcagggcacagagcccagaggcctccccctaagggcagacccgct cccagggagacccaagcttcccgagaaaggaggcgggggctggcggcggagatgagagag gacgccagcagaggtggcagtgaaatcaaggagacatgggagagttcccacccaggcagt tctgccctgaagttcgaggacttgggggtctgccactag >gi568815593f:175778906_175980042|GENSCAN_predicted_peptide_2|198_aa MRRAASTAQAPWPAPSQALPTTECSQHLRETGMRTPVSFSAGEPGVQKEDLGYAWGRGSQ EPEVLETQKLASITENQRCLMAGGNQGPERQMDLPKATGSVRPKGQGLASAVSNDLELGS ETGQAGDLKVKRRAGEREAMRASTFIVLLLRTRPTRKCLPPCSHTTTRKAEIDIPVAQRK KLSIKEDPWFYIIHSAGK >gi568815593f:175778906_175980042|GENSCAN_predicted_CDS_2|597_bp atgcggagggcagccagcacagctcaggccccttggccagccccatcccaggccctgccc accacagaatgcagccagcacctgagggagacaggaatgagaacaccagtgagcttcagt gctggggagccaggggtccaaaaggaggacttgggctatgcgtggggcagaggcagccag gaaccagaggtgctcgagacacagaaacttgccagcatcactgagaaccagaggtgcctc atggctggaggaaaccaaggcccagaaaggcaaatggacttgcccaaggccacaggatca gtgagacctaaaggccaaggtctggcctctgcagtttcaaatgacctagaacttggcagt gaaacaggccaagcaggggacctgaaagtcaagaggagggcaggagaaagagaagccatg agagccagcaccttcattgtgctcctactacgcaccagacccactagaaagtgtctacca ccttgttctcacacaacaactcggaaggctgagattgacattcccgttgcacagaggaag aaattgagcatcaaagaggatccgtggttttacatcatccactcagctgggaagtga >gi568815593f:175778906_175980042|GENSCAN_predicted_peptide_3|228_aa MGLAGLAECGTEQLVTGLPPPGEGLPEGDANAEKHTAERRAGDKFLMTVCEGPALTWFRL SEPSSEHIALPLLSNSPLSMVAFFKVDLADISGGSGDEGALGQMSGDCWAKWNQTDFPCR TSQGLCCAPVDDDFPRGEAACGISRTYLIMHLLMWNIKGVFRGSAERGPLFPDDLQLVGQ TPDLSTLEAFCAFVFLHHEFSGVRDCTLFISVSQAASPSHLQNQGGNS >gi568815593f:175778906_175980042|GENSCAN_predicted_CDS_3|687_bp atgggattagcagggctggcagaatgtgggactgagcagctggtgactggtttgccacca cctggggagggcctgcccgagggtgatgccaatgcagagaagcacacagctgagcgaagg gcaggagacaaattcctgatgacagtgtgtgagggcccagccctcacctggttccgactc agtgagcccagctctgaacacatcgctctgcccctgctcagcaattcacccctttctatg gtggcatttttcaaagtggatcttgcagatattagtgggggttctggggatgagggtgcc ctcggtcaaatgagtggggattgctgggctaaatggaatcagacagacttcccctgcagg acttctcagggcctttgctgtgctcctgtggatgatgattttccgagaggtgaagcagca tgtggcatttctagaacttatttgatcatgcaccttttgatgtggaacatcaagggggtg ttccgaggaagtgcagaaagagggcccctctttccagatgacttgcagctggttggacaa acacctgacttgagcaccctagaggccttctgtgcgtttgtgtttcttcaccacgagttc tctggggtcagggactgtactctcttcatctctgtatcccaagcagccagcccttctcat ctgcaaaatcagggtggcaattcctaa >gi568815593f:175778906_175980042|GENSCAN_predicted_peptide_4|564_aa MDFVMKQALGGATKDMGKMLGGEEEKDPDAQKKEEERQEALRQQEEERKAKHARMEAERE KVRQQIRDKYGLKKKEEKEAEEKAALEQPCEGSLTRPKKAIPAGCGDEEEEEEESILDTV LKYLPGPLQDMFKNPGAPPSPAPTWRDMAPAKLVPSNGSFVDFSSFVEEPQVLGPHCQAG PSLLSQGAARPPENTWSHRVAMVYAMGNTSIGKGPGSHAWEELPERSKKAAVDRPGLGTS DRSGSPNMDVLPSKSRSSQKVSLTAKLCRVLLQMESGSRDTIPESILQPQGLSVPHGPPP HPPHTHTISDTPSPEPLWVPLPENAVKSHGSWGPFDAGPTLIYPRNTVSGPCSVSPGSHT DTCHMRSQLCHDHPKRSGSSDILQNITLIQYTDVIMLIRQIDKQYLACWKPKGKVRPMDT VPAYEGHLEKFEDESDIKWPNSKATSSEAGPWPAPQSKSAFAHIRTSMVSKLKLLCPLAS AHQQCDIFPTKNQIYGAPVVPDSGQVVPGDSVTDAAGQRHLLLPLTIFTSRTLTATPWLG SKLQQRPPTCPLSCTHLCSTQGLH >gi568815593f:175778906_175980042|GENSCAN_predicted_CDS_4|1695_bp atggacttcgtcatgaagcaggcccttggaggggccacaaaggacatggggaagatgctg gggggagaggaggagaaggaccccgacgcgcagaaaaaggaggaggagcggcaggaggcg ctgcggcagcaggaggaggagcgtaaggccaagcacgcgcgcatggaggcggagcgggag aaggtccggcagcagatccgagataagtatgggctgaagaagaaggaggagaaggaagca gaggagaaagcagccctggagcagccctgcgaggggagcctgacccggcccaagaaggcc atccctgcgggctgcggggacgaggaggaggaggaagaggagagcatcctggacacggtg ctcaaatacctgcccgggccgctgcaggacatgttcaagaacccaggggcacccccttcc ccagcccccacctggagagacatggcccctgccaagctggtcccttcaaatggatccttt gtggactttagctcatttgtggaggaaccccaggtcctgggcccccactgccaggctggg cccagcttgctcagtcaaggggctgccaggcccccagaaaacacttggagccatcgggta gcgatggtctatgccatggggaacacctccattgggaagggcccaggatcccatgcctgg gaggagctgccagagagaagcaaaaaggcggctgtggatcgccctgggctgggcaccagt gacaggtcaggatctccaaacatggacgtcctcccctccaaatccagaagctcccagaag gtgtccttaactgcaaagctgtgcagggtactcctccagatggaatcaggaagtcgagac accatcccagaatcaattcttcagccccaggggctgagtgttccccatggaccacccccc caccccccgcacacacacaccatctctgacaccccttcccccgagcccctctgggtgccg cttcctgagaatgctgtcaagtcgcacggaagctggggcccctttgatgctggacccact ctgatataccccagaaacacggtcagtgggccatgcagtgtgtctccaggcagccacaca gacacgtgtcacatgcgatcacagctctgtcatgaccatccaaaaagatctggatcatcg gacatcctgcagaacatcacactgatccagtacactgatgtcatcatgctgatcaggcag atagacaagcagtacctagcatgctggaagccaaagggaaaagtgagacccatggatact gtcccagcgtacgaaggccatttggagaaatttgaagatgaatcagatattaaatggcct aattccaaagccacctcctctgaagctggcccatggcctgcaccccagtccaaaagtgct ttcgcccacatcaggacctccatggtctccaagctaaagcttctctgtcccctggcttct gcccatcagcagtgtgacatctttcccacaaagaaccagatctatggtgccccagtggtc cctgacagcgggcaagttgtacccggtgactctgtcactgacgctgctggtcagcgacat ctcctactgcccctgaccatcttcacgtccaggacactgacggccacaccttggcttggc tccaagctgcagcaacgcccccccacctgtcccctgtcctgcactcacctctgctctacc caaggcctgcactga >gi568815593f:175778906_175980042|GENSCAN_predicted_peptide_5|378_aa MRAAAAPGLTAPWRLLQCCELEAGELGMAVPAAAMGPSALGQSGPGSMAPWCSVSSGPSR YVLGMQELFRGHSKTREFLAHSAKVHSVAWSCDGRRLASGSFDKTASVFLLEKDRLVKEN NYRGHGDSVDQLCWHPSNPDLFVTASGDKTIRIWDVRTTKCIATVNTKGENINICWSPDG QTIAVGNKDDVVTFIDAKTHRSKAEEQFKFEVNEISWNNDNNMFFLTNGNGCINILSYPE LKPVQSINAHPSNCICIKFDPMGKYFATGSADALVSLWDVDELVCVRCFSRLDWPVRTLS FSHDGKMLASASEDHFIDIAEVETGDKLWEVQCESPTFTVAWHPKRPLLAFACDDKDGKY DSSREAGTVKLFGLPNDS >gi568815593f:175778906_175980042|GENSCAN_predicted_CDS_5|1137_bp atgcgtgcggcagcggcgccaggactgactgcgccgtggaggctgctgcagtgttgtgag ttggaagctggggagctcggcatggcggtccccgctgcagccatggggccctcggcgttg ggccagagcggccccggctcgatggccccgtggtgctcagtgagcagcggcccgtcgcgc tacgtgcttgggatgcaggagctgttccggggccacagcaagacgcgcgagttcctggcg cacagcgccaaggtgcactcggtggcctggagttgcgacgggcgtcgcctagcctcgggg tccttcgacaagacggccagcgtcttcttgctggagaaggaccggttggtcaaagaaaac aattatcggggacatggggatagtgtggaccagctttgttggcatccaagtaatcctgac ctatttgttacggcgtccggagataaaaccattcgcatctgggatgtgaggactacaaaa tgcattgccactgtgaacactaaaggggagaacattaatatctgctggagtcctgatggg cagaccattgctgtaggcaacaaggatgatgtggtgacctttattgatgccaagacacac cgttccaaagcagaagagcagttcaagttcgaggtcaacgaaatctcctggaacaatgac aataatatgttcttcctgacaaatggcaatggttgtatcaacatcctcagctacccagaa ctgaagcctgtgcagtccatcaacgcccatccttccaactgcatctgtatcaagtttgac cccatggggaagtactttgccacaggaagtgcagatgctttggtcagcctctgggatgtg gatgagttagtgtgtgttcggtgcttttccaggctggattggcctgtaagaaccctcagt ttcagccatgatgggaaaatgctggcgtcagcatcggaagatcattttattgacattgct gaagtggagacaggggacaaactatgggaggtacagtgtgagtctccgaccttcacagtg gcgtggcaccccaaaaggcctctgctggcatttgcctgtgatgacaaagacggcaaatat gacagcagccgggaagccggaactgtgaagctgtttgggcttcctaatgattcttga