
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144730.2 - phase: 0 /pseudo
(866 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC8301 66 2e-11
TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 48 8e-06
TC9039 46 3e-05
BP054557 40 0.002
BI418821 39 0.005
AV421607 38 0.006
TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 38 0.008
TC8302 37 0.010
AV773507 36 0.023
TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, ... 36 0.030
BP065763 34 0.087
BP042550 32 0.33
AV420911 32 0.56
TC12416 similar to GB|AAH56066.1|33417132|BC056066 sop-prov prot... 30 1.3
TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, parti... 30 1.6
TC18927 similar to PIR|AI2934|AI2934 chromate transport protein ... 30 2.1
TC11719 29 3.7
BP030995 29 3.7
BI420796 28 4.8
BP078157 28 6.2
>TC8301
Length = 1243
Score = 66.2 bits (160), Expect = 2e-11
Identities = 39/102 (38%), Positives = 62/102 (60%)
Frame = +2
Query: 1 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
K++I KF+GSN F LWK+K++AIL + C+ A+ G +T + EM+D AV+ +
Sbjct: 509 KFEIPKFSGSN-FSLWKLKIKAILRKDNCLPAIDGR---PADITDDKWKEMDDNAVANLH 676
Query: 61 LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQ 102
L + D VL ++ + TA + + L LY KSL H + LK++
Sbjct: 677 LAVADSVLSSIAEKKTAKEIWDTLIQLYEVKSL-HNRILKEE 799
>TC11095 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (89%)
Length = 912
Score = 47.8 bits (112), Expect = 8e-06
Identities = 22/61 (36%), Positives = 28/61 (45%)
Frame = +1
Query: 191 DSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKGN 250
D G V + G G G R RS G G + KC+ C PGHF ++C R G+
Sbjct: 259 DGKNGWRVELSHNSRGGGGGGGGGRGGGRSGGGGGGSDMKCYECGEPGHFARECRMRGGS 438
Query: 251 G 251
G
Sbjct: 439 G 441
>TC9039
Length = 1218
Score = 45.8 bits (107), Expect = 3e-05
Identities = 28/85 (32%), Positives = 41/85 (47%), Gaps = 2/85 (2%)
Frame = +3
Query: 311 VCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERG 370
+ +G+ K +++ IG RL + LKD +P RRNLIS+S D GY
Sbjct: 960 IYVGDGKTVEVEAIGHFRLLLCTGFYLDLKDTFVVPSFRRNLISVSNLDKSGYSCSFGNS 1139
Query: 371 VMRISHGALIIAKGSKI--HGLYIL 393
+S + I+ GS I LY+L
Sbjct: 1140 KAELSFNSNIVGTGSLIGYDNLYLL 1214
Score = 34.3 bits (77), Expect = 0.087
Identities = 34/163 (20%), Positives = 66/163 (39%), Gaps = 1/163 (0%)
Frame = +2
Query: 83 KLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKAL 142
+++ + A L Q L + I E + + I L + + L D D +
Sbjct: 413 EIEKRFAKSDKAETSTLLQNLISMKYQGKGNIREYIMGMSNIASKLKALKLELSD-DLLI 589
Query: 143 HLPC-ALPRSFENFKDTMLYGK*GTITLEEVQAALRTKELTKFKELKVEDSGEGLNVSRE 201
HL +LP F FK + K E + ++ +E K + ++S ++ S++
Sbjct: 590 HLVLLSLPAQFSQFKISYNCPKEKWSLNELISFCVQEEERLKQER---KESAHFVSTSKD 760
Query: 202 RSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDC 244
+ + + + KN + + + K C+ C+ GH KK C
Sbjct: 761 KGKRKKTVEPKNEAADAPAPKK-QKEDDTCYFCNVSGHMKKKC 886
>BP054557
Length = 538
Score = 40.0 bits (92), Expect = 0.002
Identities = 38/160 (23%), Positives = 66/160 (40%), Gaps = 17/160 (10%)
Frame = -2
Query: 284 VLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVR 343
+LD+G ++H+C + F+ L ++ + N + + I + ++ F L V
Sbjct: 537 LLDTGATHHVCHNLDVFQALRQVHPVLLSMPNGQ----KEIAHMADSVYFSEKFFLAXVL 370
Query: 344 YIPKLRRNLISISMFDGLGYCTRI---ERGVMRISHGALIIAKGSKIHGLYIL------- 393
Y+P NLIS+S C + E +++ I + GLY +
Sbjct: 369 YVPSFNFNLISVSALVKSLNCRLVIFDECCLLQERPTLKTIGAAEERGGLYAVIDPAVKA 190
Query: 394 -----EGS--TVIADASVASVDTLDVTKLWHLRLGHVSER 426
EG+ T ++ + D LWHLRLGH S +
Sbjct: 189 KHPPAEGNIKTQVSHFNFCQFD----CNLWHLRLGHTSSK 82
>BI418821
Length = 614
Score = 38.5 bits (88), Expect = 0.005
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Frame = +2
Query: 172 VQAALRTKELTKFKELKVED-SGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYK 230
V+ ++ T + K K + V +G L +R+ S RG G + + R+ G G
Sbjct: 218 VEFSIATGDNDKTKAVDVTGPNGAALQPTRKDSAPRGFGGWRGG--ERRNGGGGGGGGGG 391
Query: 231 CFICHNPGHFKKDCPERKGNGGG 253
C+ C + GH +DC NGGG
Sbjct: 392 CYNCGDTGHLARDCHRSNNNGGG 460
Score = 29.6 bits (65), Expect = 2.1
Identities = 13/39 (33%), Positives = 17/39 (43%)
Frame = +2
Query: 215 RSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKGNGGG 253
RS + G G C+ C + GH +DC N GG
Sbjct: 437 RSNNNGGGGGGAA---CYNCGDAGHLARDCNRSNNNSGG 544
>AV421607
Length = 245
Score = 38.1 bits (87), Expect = 0.006
Identities = 14/35 (40%), Positives = 19/35 (54%)
Frame = +3
Query: 222 GDGNKTQYKCFICHNPGHFKKDCPERKGNGGGNPS 256
G G+ T C+ C PGH+ +DCP N NP+
Sbjct: 27 GSGSGTATGCYKCGRPGHWSRDCPSSAPNSNPNPN 131
>TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana
{Arabidopsis thaliana;}, partial (42%)
Length = 587
Score = 37.7 bits (86), Expect = 0.008
Identities = 23/76 (30%), Positives = 35/76 (45%)
Frame = +1
Query: 177 RTKELTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHN 236
R +E + +E + +G N R + KG G R R +G + KC+ C
Sbjct: 16 RERERERERERDAIHALDGKNGWRVELSHNSKGGG-GGRGGGRGRGGED---LKCYECGE 183
Query: 237 PGHFKKDCPERKGNGG 252
PGHF ++C R G+ G
Sbjct: 184 PGHFARECRSRGGSRG 231
>TC8302
Length = 494
Score = 37.4 bits (85), Expect = 0.010
Identities = 34/148 (22%), Positives = 59/148 (38%), Gaps = 4/148 (2%)
Frame = +3
Query: 101 QQLYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFE----NFK 156
++LY RM ES + + + N + L+ + + + ++A L +LP S++ N K
Sbjct: 3 KRLYTLRMSESTSMSDHIDHMNTLFAQLSASNFTIGENERAELLLESLPDSYDQLVINIK 182
Query: 157 DTMLYGK*GTITLEEVQAALRTKELTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRS 216
+ + + L EV ++ K E L ++R RS++R
Sbjct: 183 NNNIVNHLPLMMLSEVLLEEDSQR*NKEDR*DSSKPMEALTMTRGRSKSR---------- 332
Query: 217 KSRSKGDGNKTQYKCFICHNPGHFKKDC 244
K KC+ C H KKDC
Sbjct: 333 --------KKINLKCYHCGQR*HLKKDC 392
>AV773507
Length = 496
Score = 36.2 bits (82), Expect = 0.023
Identities = 23/66 (34%), Positives = 28/66 (41%), Gaps = 1/66 (1%)
Frame = +2
Query: 206 RGKGKGKNSRSKSRSKGDGNKT-QYKCFICHNPGHFKKDCPERKGNGGGNPSVQIASNEE 264
RG G R S G G++ Q CF C PGH +DCP G GG S+
Sbjct: 281 RGGGYSSGGRG---SYGAGDRVGQDDCFECGRPGHRARDCPLAGGGGGRGRGGGSLSSRP 451
Query: 265 GYESAG 270
Y +G
Sbjct: 452 RYGGSG 469
>TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
(58%)
Length = 881
Score = 35.8 bits (81), Expect = 0.030
Identities = 20/62 (32%), Positives = 31/62 (49%), Gaps = 7/62 (11%)
Frame = +2
Query: 191 DSGEGLNVSRERSQN---RGKGKGKNSRS-KSRSKGDGNKTQY---KCFICHNPGHFKKD 243
D + SR+R R +G+ ++ R + RS+G G +CF C PGHF ++
Sbjct: 278 DKAQPQGSSRDRDDGDRYRERGRDRDDRGDRDRSRGYGGSRGSNGGECFKCGKPGHFARE 457
Query: 244 CP 245
CP
Sbjct: 458 CP 463
>BP065763
Length = 537
Score = 34.3 bits (77), Expect = 0.087
Identities = 15/42 (35%), Positives = 25/42 (58%)
Frame = -1
Query: 207 GKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERK 248
G+ + K ++ +G +K ++ C+ C GHFKK CP+RK
Sbjct: 450 GRKEQKKDQALKVKEGRIHK-EHVCYFCKKAGHFKKYCPKRK 328
>BP042550
Length = 489
Score = 32.3 bits (72), Expect = 0.33
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Frame = +3
Query: 257 VQIASNEEGYESAGALTVTSWEPEKGWVLDSGCSYHICPRKE--YFEMLELEEGGVVCLG 314
VQ+ NE+ ++S G + + GCS+H+CP E + L + + C+
Sbjct: 306 VQMMKNEKLFQSQGGPIILA---------QVGCSFHLCPITEVLFKNCLHMNPSTIECMF 458
Query: 315 NNKAC 319
N++C
Sbjct: 459 GNESC 473
>AV420911
Length = 418
Score = 31.6 bits (70), Expect = 0.56
Identities = 17/51 (33%), Positives = 24/51 (46%)
Frame = +2
Query: 194 EGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDC 244
+G N R + KG G R R +G + KC+ C PGHF ++C
Sbjct: 275 DGKNGWRVELSHNSKGGG-GGRGGGRGRGGED---LKCYECGEPGHFAREC 415
>TC12416 similar to GB|AAH56066.1|33417132|BC056066 sop-prov protein
{Xenopus laevis;} , partial (32%)
Length = 416
Score = 30.4 bits (67), Expect = 1.3
Identities = 13/37 (35%), Positives = 22/37 (59%)
Frame = +3
Query: 193 GEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQY 229
G G R R + RG+G+G+ R + +GDG+K ++
Sbjct: 141 GRGRGEGRGRGEGRGRGRGR-GRGRRGRRGDGDKKEW 248
>TC11885 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (26%)
Length = 555
Score = 30.0 bits (66), Expect = 1.6
Identities = 20/67 (29%), Positives = 29/67 (42%), Gaps = 6/67 (8%)
Frame = +2
Query: 185 KELKVEDSGEGLNVSRERSQNRGKGKGKNSRSK------SRSKGDGNKTQYKCFICHNPG 238
KE K + S + + SR RS++ K ++ R R G C C PG
Sbjct: 188 KEEKRKMSSDSRSRSRSRSRSPMDRKIRSDRFSYRDAPYRRDSRRGFSRDNLCKNCKRPG 367
Query: 239 HFKKDCP 245
HF ++CP
Sbjct: 368 HFARECP 388
>TC18927 similar to PIR|AI2934|AI2934 chromate transport protein chrA
[imported] - Agrobacterium tumefaciens
(strain C58, Dupont) {Agrobacterium tumefaciens;},
partial (6%)
Length = 561
Score = 29.6 bits (65), Expect = 2.1
Identities = 23/79 (29%), Positives = 33/79 (41%), Gaps = 17/79 (21%)
Frame = -2
Query: 201 ERSQNRGKGKGKNSRSKSRSKGD-------GNKTQYKCFICHNPGHFKKDCP-------E 246
+R QNRG G S S G+ + ++ C C GHF CP +
Sbjct: 425 QRPQNRGTSSGY-----SHSFGNFVPRPTQSDTSEIVCHRCSKKGHFANRCPDLVCWNCQ 261
Query: 247 RKGNGG---GNPSVQIASN 262
+ G+ G NP V+ A+N
Sbjct: 260 KTGHSGKDCTNPKVEAATN 204
>TC11719
Length = 376
Score = 28.9 bits (63), Expect = 3.7
Identities = 15/52 (28%), Positives = 26/52 (49%), Gaps = 12/52 (23%)
Frame = +3
Query: 387 IHGLYILEGSTVIADASVASVDTLDV------------TKLWHLRLGHVSER 426
+H L S +++ +S AS ++ V ++LWH RLGH S++
Sbjct: 153 LHSASTLSNSALLSSSSPASALSVSVFSSNKCAQADIDSRLWHSRLGHASQK 308
>BP030995
Length = 458
Score = 28.9 bits (63), Expect = 3.7
Identities = 15/52 (28%), Positives = 26/52 (49%), Gaps = 12/52 (23%)
Frame = -3
Query: 387 IHGLYILEGSTVIADASVASVDTLDV------------TKLWHLRLGHVSER 426
+H L S +++ +S AS ++ V ++LWH RLGH S++
Sbjct: 222 LHSASTLSNSALLSSSSPASALSVSVFSSNKCAQADIDSRLWHSRLGHASQK 67
>BI420796
Length = 552
Score = 28.5 bits (62), Expect = 4.8
Identities = 29/113 (25%), Positives = 44/113 (38%), Gaps = 20/113 (17%)
Frame = -2
Query: 198 VSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFIC-------------HNPGH---FK 241
+SR N +G +S ++RS+G+G++ + I GH +
Sbjct: 512 LSRANFTNMKRGTRNSSTGEARSEGEGDRVEQPTPIVLGDTGRDGG*GESSEVGHDVVVE 333
Query: 242 KDCPERKGNGG----GNPSVQIASNEEGYESAGALTVTSWEPEKGWVLDSGCS 290
ER G G G+PS + EE E G++ S E V GCS
Sbjct: 332 GGARERVGGGAVGAEGDPSEEEEDGEEEREGVGSVEEGSEERGADPVEQGGCS 174
>BP078157
Length = 395
Score = 28.1 bits (61), Expect = 6.2
Identities = 18/67 (26%), Positives = 34/67 (49%), Gaps = 20/67 (29%)
Frame = -2
Query: 380 IIAKGSKIHGLYILEGSTVIADA------SVASVDT--------------LDVTKLWHLR 419
+I +G+ + LY+ E + + +A SV+SV + + + +LWH R
Sbjct: 313 MIGRGNLHNKLYLFEAQSQVFEANGSPQVSVSSVFSSILCVFCFH*FKHKIKLPRLWHSR 134
Query: 420 LGHVSER 426
LGH+S++
Sbjct: 133 LGHLSDK 113
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.344 0.151 0.505
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,031,056
Number of Sequences: 28460
Number of extensions: 204568
Number of successful extensions: 1905
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 1852
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1891
length of query: 866
length of database: 4,897,600
effective HSP length: 98
effective length of query: 768
effective length of database: 2,108,520
effective search space: 1619343360
effective search space used: 1619343360
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.6 bits)
S2: 59 (27.3 bits)
Medicago: description of AC144730.2