
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144724.7 - phase: 0 /pseudo
(1419 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BE122516 78 9e-15
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ... 68 1e-11
AV410603 65 1e-10
BI418821 61 1e-09
BG662087 60 2e-09
AU089582 56 4e-09
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 59 5e-09
AU251673 53 3e-07
TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, part... 45 6e-05
TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, par... 43 4e-04
TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 39 0.006
TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 38 0.010
TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 37 0.017
AV420911 37 0.017
TC9039 31 1.2
TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, ... 31 1.2
AV773507 31 1.2
TC17929 31 1.6
BP034485 29 5.9
BP079571 26 6.5
>BE122516
Length = 364
Score = 78.2 bits (191), Expect = 9e-15
Identities = 48/114 (42%), Positives = 70/114 (61%), Gaps = 4/114 (3%)
Frame = +2
Query: 402 GMDWLLFFGVSINCLTKSVTFSKPVEKLDRKFLTAEQVKKSLDGEACVFMMFASLKENSE 461
GM+WL ++NC K+VTF E ++ ++V K+ + E+ V ++ A + S+
Sbjct: 17 GMNWLTANDATLNCRKKTVTFGTS-EGDAKRVKRTDKVGKASECESDV-LLGALETDKSD 190
Query: 462 KGVGDLPIVQEF----PEDITELPLEREVEFAIDLVPGMSPIWITPYPMSASEL 511
GV +P+V+EF PE+++ELP EREVEF+ID VPG PI I PY MS EL
Sbjct: 191 TGVEGIPVVREFSDVFPEEVSELPPEREVEFSID*VPGTGPISIAPYRMSLVEL 352
>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
[imported] - Agrobacterium tumefaciens
(strain C58, Dupont) {Agrobacterium tumefaciens;},
partial (6%)
Length = 561
Score = 67.8 bits (164), Expect = 1e-11
Identities = 37/115 (32%), Positives = 53/115 (45%), Gaps = 23/115 (20%)
Frame = -2
Query: 251 DISCHKCGKAGHKAAECKDVARDVTCYNCGEKGHISTKCTKPKK---------------- 294
+I CH+C K GH A C D+ C+NC + GH CT PK
Sbjct: 338 EIVCHRCSKKGHFANRCPDLV----CWNCQKTGHSGKDCTNPKVEAATNAIAARRPAPAA 171
Query: 295 -------AAGKVFALNAEEVEQPDNLIRGMCFINSTHLIGIIDIGATHSFISVSC 342
A+ +V+ ++ E + D LIR + +N L + D GATHSFI ++C
Sbjct: 170 NKGKRPVASARVYTVSGAESHRADGLIRSVGSVNCKPLTILFDSGATHSFIDLAC 6
>AV410603
Length = 162
Score = 64.7 bits (156), Expect = 1e-10
Identities = 28/53 (52%), Positives = 40/53 (74%)
Frame = +1
Query: 561 TIKNRYPLLRIDDLMDQLVGAEVFSKIDFRSGYHHNRVKAEDISKTVFRTRYG 613
T+K+ +P+ +D+L+D+L G++ FSK+D RSGYH VK ED KTVFRT +G
Sbjct: 4 TVKDSFPMPTVDELLDELRGSQFFSKLDLRSGYHQILVKPEDRHKTVFRTHHG 162
>BI418821
Length = 614
Score = 61.2 bits (147), Expect = 1e-09
Identities = 40/154 (25%), Positives = 63/154 (39%), Gaps = 17/154 (11%)
Frame = +2
Query: 155 ESGLRPDIYQYMCVQEIRDFDTLV---HKCRKFDDAGRVKANYYKAQSEKRGKGHGVGKP 211
+S +R D Y+ + ++ +F K + D G A + + +G G
Sbjct: 167 QSSIRSDGYRTLLEGDLVEFSIATGDNDKTKAVDVTGPNGAALQPTRKDSAPRGFG---G 337
Query: 212 YNKDKRKKREVGGGSKPSLADIKCYRCGTLGHYANDCKKD---------ISCHKCGKAGH 262
+ +R+ GGG CY CG GH A DC + +C+ CG AGH
Sbjct: 338 WRGGERRNGGGGGGGGGG-----CYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGH 502
Query: 263 KAAECKDVARD-----VTCYNCGEKGHISTKCTK 291
A +C + CYNCG+ GH++ C +
Sbjct: 503 LARDCNRSNNNSGGGGAGCYNCGDTGHLARDCNR 604
>BG662087
Length = 373
Score = 60.5 bits (145), Expect = 2e-09
Identities = 34/85 (40%), Positives = 44/85 (51%)
Frame = +1
Query: 546 GSVRLCVDYRQLNKVTIKNRYPLLRIDDLMDQLVGAEVFSKIDFRSGYHHNRVKAEDISK 605
G R+ VDY LNK K+ YPL ID L+D E+ S +D SGYH ++ D K
Sbjct: 16 GKWRMWVDYTDLNKACPKDSYPLPSIDKLVDGASDNELLSLMDAYSGYHQIKMHPSDEDK 195
Query: 606 TVFRTRYGHYEYFVMQFGVYNVPGT 630
T F T +Y Y + FG+ N T
Sbjct: 196 TAFMTARVNYCYQTIPFGLKNAGAT 270
>AU089582
Length = 383
Score = 55.8 bits (133), Expect(2) = 4e-09
Identities = 29/75 (38%), Positives = 41/75 (54%)
Frame = +1
Query: 1106 CGEAS*DLH*ADCEVAWYSF*YCVRSESEVYFYVLGELANCFGN*VEIEFCLSSPDRWAD 1165
C *DL DC +AW + Y RS S ++ L +NCFGN ++ E+ SS + W+
Sbjct: 10 CVSIC*DLLG*DCFLAWCTCVYNFRSRSSIHITFLEVFSNCFGNSIKNEYRFSSSN*WSV 189
Query: 1166 IEDYLEFRGFVESLC 1180
EDY + RG+ LC
Sbjct: 190 REDYSDLRGYASCLC 234
Score = 23.1 bits (48), Expect(2) = 4e-09
Identities = 16/43 (37%), Positives = 23/43 (53%)
Frame = +2
Query: 1184 RYQLGRMFTVD*VYLQQQFSF*YWNGTV*GFVWSEV*DTVVLV 1226
R LG +F D + + S *+ +GT+* VW E+ T LV
Sbjct: 245 RGXLGSVFVFDGIRI***LSV*HLDGTI*SLVW*EMQVTYRLV 373
>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
Length = 555
Score = 58.9 bits (141), Expect = 5e-09
Identities = 29/79 (36%), Positives = 41/79 (51%)
Frame = +2
Query: 211 PYNKDKRKKREVGGGSKPSLADIKCYRCGTLGHYANDCKKDISCHKCGKAGHKAAECKDV 270
PY +D R+ G S+ +L C C GH+A +C CH CG GH A+EC
Sbjct: 299 PYRRDSRR-----GFSRDNL----CKNCKRPGHFARECPNVAICHNCGLPGHIASECTTK 451
Query: 271 ARDVTCYNCGEKGHISTKC 289
+ C+NC E GH+++ C
Sbjct: 452 S---LCWNCKEPGHMASSC 499
Score = 44.3 bits (103), Expect = 1e-04
Identities = 19/41 (46%), Positives = 26/41 (63%)
Frame = +2
Query: 250 KDISCHKCGKAGHKAAECKDVARDVTCYNCGEKGHISTKCT 290
+D C C + GH A EC +VA C+NCG GHI+++CT
Sbjct: 332 RDNLCKNCKRPGHFARECPNVA---ICHNCGLPGHIASECT 445
>AU251673
Length = 413
Score = 53.1 bits (126), Expect = 3e-07
Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 2/102 (1%)
Frame = +2
Query: 54 SDDQKVRLTTHMLAEEAEYWWTNAKGRLEIVGEVV-TWAKFKAEFLRKYFPEDLRTRKEV 112
SD + V L + L A W+ N R + VG TWA F AEF+ ++ P+ +R
Sbjct: 8 SDTRAVELASFQLEGVARDWY-NVLTRAKPVGSPPWTWADFSAEFMNRFLPQSVRDGFVR 184
Query: 113 AFLNLK*GS-ISVAEYAAKFEKLSRFCPYIIAEDAMVSKCVK 153
F L+ ++V+EY+A F LSR+ PY + E+ V + V+
Sbjct: 185 DFERLEQAEGMTVSEYSAHFTHLSRYVPYPLLEEERVKRFVR 310
>TC9521 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (56%)
Length = 598
Score = 45.4 bits (106), Expect = 6e-05
Identities = 39/134 (29%), Positives = 54/134 (40%), Gaps = 2/134 (1%)
Frame = +1
Query: 163 YQYMCVQEIRDFDTLVHKCRKFD-DAGRVKANYYKAQSEKRGKGHGVGKPYNKDKRKKRE 221
Y ++ + RD D + D D R+ + A+ RG G G +D R +
Sbjct: 196 YAFVEFSDPRDADDARYNLDGRDVDGSRLIVEF--AKGVPRGSREGGG---GRD-RDREY 357
Query: 222 VGGGSKPSLADIKCYRCGTLGHYANDCKKDISCHKCGKAGHKAAECKDVARDVTCYNCGE 281
+G G P +C+ CG GH+A DCK +KC Y CGE
Sbjct: 358 MGRGPPPGSG--RCFNCGIDGHWARDCKAGDWKNKC-------------------YRCGE 474
Query: 282 KGHISTKC-TKPKK 294
+GHI C PKK
Sbjct: 475 RGHIEKNCKNSPKK 516
>TC10011 similar to UP|Q9FYA7 (Q9FYA7) Splicing factor RSZ33, partial (62%)
Length = 684
Score = 42.7 bits (99), Expect = 4e-04
Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Frame = +2
Query: 225 GSKPSLADIKCYRCGTLGHYANDCKKDISCHKCGKAGHKAAECKDVARDVTCYNCGEKGH 284
G P +C+ CG GH+A DC KA + K+ CY CG++GH
Sbjct: 341 GRGPPPGSGRCFNCGLDGHWARDC--------------KAGDWKN-----KCYRCGDRGH 463
Query: 285 ISTKC-TKPKK 294
+ C PKK
Sbjct: 464 VERNCKNSPKK 496
>TC13053 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (9%)
Length = 450
Score = 38.9 bits (89), Expect = 0.006
Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Frame = +3
Query: 233 IKCYRCGTLGHYANDCKKDIS-CHKCGKAGHKAAEC 267
+ C C LGH + DC + CH CG GH A EC
Sbjct: 3 VVCRNCQQLGHMSRDCMGPLMICHNCGGRGHLAYEC 110
Score = 35.4 bits (80), Expect = 0.063
Identities = 12/38 (31%), Positives = 22/38 (57%)
Frame = +3
Query: 252 ISCHKCGKAGHKAAECKDVARDVTCYNCGEKGHISTKC 289
+ C C + GH + +C + C+NCG +GH++ +C
Sbjct: 3 VVCRNCQQLGHMSRDCMGPL--MICHNCGGRGHLAYEC 110
>TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (89%)
Length = 912
Score = 38.1 bits (87), Expect = 0.010
Identities = 25/91 (27%), Positives = 40/91 (43%)
Frame = +1
Query: 159 RPDIYQYMCVQEIRDFDTLVHKCRKFDDAGRVKANYYKAQSEKRGKGHGVGKPYNKDKRK 218
RP Y ++ + RD + R+ D N ++ + +G G G + +
Sbjct: 190 RPPGYAFIDFDDRRDAQDAI---RELDGK-----NGWRVELSHNSRGGGGGGGGGRGGGR 345
Query: 219 KREVGGGSKPSLADIKCYRCGTLGHYANDCK 249
GGGS D+KCY CG GH+A +C+
Sbjct: 346 SGGGGGGS-----DMKCYECGEPGHFARECR 423
Score = 29.6 bits (65), Expect = 3.5
Identities = 9/17 (52%), Positives = 12/17 (69%)
Frame = +1
Query: 273 DVTCYNCGEKGHISTKC 289
D+ CY CGE GH + +C
Sbjct: 370 DMKCYECGEPGHFAREC 420
Score = 29.3 bits (64), Expect = 4.5
Identities = 9/18 (50%), Positives = 14/18 (77%)
Frame = +1
Query: 251 DISCHKCGKAGHKAAECK 268
D+ C++CG+ GH A EC+
Sbjct: 370 DMKCYECGEPGHFARECR 423
>TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (42%)
Length = 587
Score = 37.4 bits (85), Expect = 0.017
Identities = 18/57 (31%), Positives = 26/57 (45%)
Frame = +1
Query: 193 NYYKAQSEKRGKGHGVGKPYNKDKRKKREVGGGSKPSLADIKCYRCGTLGHYANDCK 249
N ++ + KG G G+ GGG D+KCY CG GH+A +C+
Sbjct: 76 NGWRVELSHNSKGGGGGR------------GGGRGRGGEDLKCYECGEPGHFARECR 210
Score = 31.2 bits (69), Expect = 1.2
Identities = 13/44 (29%), Positives = 21/44 (47%)
Frame = +1
Query: 246 NDCKKDISCHKCGKAGHKAAECKDVARDVTCYNCGEKGHISTKC 289
N + ++S + G G + D+ CY CGE GH + +C
Sbjct: 76 NGWRVELSHNSKGGGGGRGGGRGRGGEDLKCYECGEPGHFAREC 207
Score = 30.0 bits (66), Expect = 2.7
Identities = 9/19 (47%), Positives = 15/19 (78%)
Frame = +1
Query: 250 KDISCHKCGKAGHKAAECK 268
+D+ C++CG+ GH A EC+
Sbjct: 154 EDLKCYECGEPGHFARECR 210
>AV420911
Length = 418
Score = 37.4 bits (85), Expect = 0.017
Identities = 18/57 (31%), Positives = 26/57 (45%)
Frame = +2
Query: 193 NYYKAQSEKRGKGHGVGKPYNKDKRKKREVGGGSKPSLADIKCYRCGTLGHYANDCK 249
N ++ + KG G G+ GGG D+KCY CG GH+A +C+
Sbjct: 284 NGWRVELSHNSKGGGGGR------------GGGRGRGGEDLKCYECGEPGHFARECR 418
Score = 31.2 bits (69), Expect = 1.2
Identities = 13/44 (29%), Positives = 21/44 (47%)
Frame = +2
Query: 246 NDCKKDISCHKCGKAGHKAAECKDVARDVTCYNCGEKGHISTKC 289
N + ++S + G G + D+ CY CGE GH + +C
Sbjct: 284 NGWRVELSHNSKGGGGGRGGGRGRGGEDLKCYECGEPGHFAREC 415
Score = 30.0 bits (66), Expect = 2.7
Identities = 9/19 (47%), Positives = 15/19 (78%)
Frame = +2
Query: 250 KDISCHKCGKAGHKAAECK 268
+D+ C++CG+ GH A EC+
Sbjct: 362 EDLKCYECGEPGHFARECR 418
>TC9039
Length = 1218
Score = 31.2 bits (69), Expect = 1.2
Identities = 17/52 (32%), Positives = 24/52 (45%), Gaps = 3/52 (5%)
Frame = +2
Query: 243 HYANDCKKDISCHKCGKAGHKAAEC---KDVARDVTCYNCGEKGHISTKCTK 291
H+ + K K + ++AA+ K D TCY C GH+ KCTK
Sbjct: 737 HFVSTSKDKGKRKKTVEPKNEAADAPAPKKQKEDDTCYFCNVSGHMKKKCTK 892
Score = 29.6 bits (65), Expect = 3.5
Identities = 23/93 (24%), Positives = 34/93 (35%), Gaps = 7/93 (7%)
Frame = +2
Query: 165 YMCVQEIRDFDTLVHKCRKFDDAGRVKANYYKAQSEKRGKGHGVGKPYNKDKRKK----- 219
Y C +E + L+ C + ++ + + E++ H V +K KRKK
Sbjct: 641 YNCPKEKWSLNELISFCVQEEE---------RLKQERKESAHFVSTSKDKGKRKKTVEPK 793
Query: 220 REVGGGSKPSLA--DIKCYRCGTLGHYANDCKK 250
E P D CY C GH C K
Sbjct: 794 NEAADAPAPKKQKEDDTCYFCNVSGHMKKKCTK 892
>TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
(58%)
Length = 881
Score = 31.2 bits (69), Expect = 1.2
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 8/64 (12%)
Frame = +2
Query: 196 KAQSEKRGKGHGVGKPYNKDKRKKREVG--------GGSKPSLADIKCYRCGTLGHYAND 247
KAQ + + G Y + R + + G GGS+ S +C++CG GH+A +
Sbjct: 281 KAQPQGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGG-ECFKCGKPGHFARE 457
Query: 248 CKKD 251
C +
Sbjct: 458 CPSE 469
>AV773507
Length = 496
Score = 31.2 bits (69), Expect = 1.2
Identities = 27/129 (20%), Positives = 55/129 (41%), Gaps = 11/129 (8%)
Frame = +2
Query: 150 KCVKFESGLRPDIYQYMCVQEIRDFDTLVHKCRKF--DDAGRVKANYYKAQSEKRGKGHG 207
K +++ GL D+ + ++ D D + +C+ D GR + + +++RG
Sbjct: 8 KRIEYRGGLSWDVTERQ-LEHAFDRDGKILECQIMMERDTGRPRGFGFITFADRRGMEDA 184
Query: 208 VGKPYNKD--------KRKKREVGGGSKPSLADIKCYRCGTLGHY-ANDCKKDISCHKCG 258
+ + + ++ + + ++GG Y G G Y A D C +CG
Sbjct: 185 IKEMHGREIGDRIISVNKAQPKMGGDDADQGYRGGGYSSGGRGSYGAGDRVGQDDCFECG 364
Query: 259 KAGHKAAEC 267
+ GH+A +C
Sbjct: 365 RPGHRARDC 391
>TC17929
Length = 791
Score = 30.8 bits (68), Expect = 1.6
Identities = 11/25 (44%), Positives = 13/25 (52%)
Frame = +2
Query: 275 TCYNCGEKGHISTKCTKPKKAAGKV 299
TCY CGE GH C K + K+
Sbjct: 59 TCYRCGESGHKMRNCPKEHSSGEKI 133
Score = 29.6 bits (65), Expect = 3.5
Identities = 10/22 (45%), Positives = 15/22 (67%)
Frame = +2
Query: 246 NDCKKDISCHKCGKAGHKAAEC 267
ND K +C++CG++GHK C
Sbjct: 38 NDSKFRQTCYRCGESGHKMRNC 103
>BP034485
Length = 485
Score = 28.9 bits (63), Expect = 5.9
Identities = 12/29 (41%), Positives = 18/29 (61%)
Frame = +3
Query: 198 QSEKRGKGHGVGKPYNKDKRKKREVGGGS 226
+ K+G G G G+ NK+K K + GGG+
Sbjct: 132 KKSKKGGGGGGGETANKNKNKNKNNGGGN 218
>BP079571
Length = 414
Score = 26.2 bits (56), Expect(2) = 6.5
Identities = 12/27 (44%), Positives = 14/27 (51%)
Frame = -1
Query: 222 VGGGSKPSLADIKCYRCGTLGHYANDC 248
VGGG C+R G +GH A DC
Sbjct: 372 VGGGG--------CFRFGEVGHLARDC 316
Score = 20.8 bits (42), Expect(2) = 6.5
Identities = 5/9 (55%), Positives = 8/9 (88%)
Frame = -3
Query: 276 CYNCGEKGH 284
C++CG+ GH
Sbjct: 262 CFHCGKPGH 236
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.350 0.157 0.564
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 26,899,891
Number of Sequences: 28460
Number of extensions: 410425
Number of successful extensions: 3591
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 3423
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3564
length of query: 1419
length of database: 4,897,600
effective HSP length: 102
effective length of query: 1317
effective length of database: 1,994,680
effective search space: 2626993560
effective search space used: 2626993560
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 61 (28.1 bits)
Medicago: description of AC144724.7