
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144431.4 - phase: 0 /pseudo
(1510 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC229976 similar to UP|DPOA_ORYSA (O48653) DNA polymerase alpha ... 291 1e-87
TC208900 UP|DPOD_SOYBN (O48901) DNA polymerase delta catalytic s... 209 6e-54
CD418256 83 8e-16
BI315914 similar to PIR|T09854|T098 proline-rich cell wall prote... 32 1.6
TC216431 homologue to UP|Q93XA7 (Q93XA7) NAC domain protein NAC1... 32 2.7
TC216430 homologue to UP|Q93XA7 (Q93XA7) NAC domain protein NAC1... 32 2.7
TC230062 31 4.7
BM527146 similar to GP|18491265|gb| At2g46160/T3F17.19 {Arabidop... 30 6.1
TC212017 similar to UP|Q950T7 (Q950T7) NADH dehydrogenase subuni... 30 6.1
>TC229976 similar to UP|DPOA_ORYSA (O48653) DNA polymerase alpha catalytic
subunit , partial (13%)
Length = 1061
Score = 291 bits (746), Expect(2) = 1e-87
Identities = 149/231 (64%), Positives = 169/231 (72%), Gaps = 35/231 (15%)
Frame = +3
Query: 1315 KFQHKSSEA-SDDPTSSLLFAGDDE----------------------------------E 1339
+F HKSSEA +DD S L F DDE E
Sbjct: 171 QFHHKSSEALNDDSASPLSFVADDEERYRGCEPLVLSCPSCSGTFDCPPVFKSICLLGSE 350
Query: 1340 SEKPTSSGTDESDYNFWRKLCCPKCFENGAGRISAAMIANQVKRQAEKFVLMYYRGLLMC 1399
++PTS +E++YNFWRKLCCPKC + +IS MIANQVKRQAE+F+LMYYRGLL+C
Sbjct: 351 RQRPTSVAPEEAEYNFWRKLCCPKCPD---VKISPVMIANQVKRQAERFILMYYRGLLVC 521
Query: 1400 DDETCKHTTRSVSFRLVGDSERGTVCPNYPRCNGHLNRKYTEADLYKQLSYFCHVFDTVC 1459
DDETCKHTTRS+S RLVGDSERGTVCPNYPRCNG L RKYTEADLYKQLSYFCHVFDTV
Sbjct: 522 DDETCKHTTRSISLRLVGDSERGTVCPNYPRCNGRLVRKYTEADLYKQLSYFCHVFDTVS 701
Query: 1460 YIEKMEAKSRIPIEKELIKIRPIVDLAASTIQKIRDRCAFGWVKLQDLVVA 1510
IEKMEAKSRIPIEKELIKIR ++ AAST Q+IRDRCAFGWVKL++LV++
Sbjct: 702 CIEKMEAKSRIPIEKELIKIRAVIKSAASTAQEIRDRCAFGWVKLENLVIS 854
Score = 52.0 bits (123), Expect(2) = 1e-87
Identities = 23/27 (85%), Positives = 25/27 (92%)
Frame = +2
Query: 1292 RLCASIQGTSPERLADCLGLDTSKFQH 1318
RLCA IQGTSPERLADCLGLD+SK +H
Sbjct: 2 RLCAPIQGTSPERLADCLGLDSSKVKH 82
>TC208900 UP|DPOD_SOYBN (O48901) DNA polymerase delta catalytic subunit ,
partial (90%)
Length = 3070
Score = 209 bits (533), Expect = 6e-54
Identities = 179/670 (26%), Positives = 315/670 (46%), Gaps = 27/670 (4%)
Frame = +2
Query: 663 SERALLNRLMLQLHKMDSDVLVGHNISGFDLDVLLHRSQACRVPSSMWSKLGRLNRSTMP 722
+ER +L + ++D D+++G+NI FDL L+ R+ ++ + LGR+ S +
Sbjct: 1121 TEREVLLAWRDFIREVDPDIIIGYNICKFDLPYLIERALNLKIAE--FPILGRIRNSRVR 1294
Query: 723 KLDRRGKTFGFGADPAIMSCVAGRLLCDTYLCSRDLLKEVSYSLTHLAKTQLNQSRKEVA 782
D + +G + V GR+ D + K SYSL ++ L++ +++V
Sbjct: 1295 VKDTTFSSRQYGTRESKEVAVEGRVTFDLLQVMQRDYKLSSYSLNSVSSHFLSEQKEDVH 1474
Query: 783 PHEVPKMFQT--AKSLMELIEYGETDAWLSMELMFYLSVLPLTRQLTNLSGNLWGKTLQG 840
H + Q A++ L Y DA+L L+ L + ++ ++G L
Sbjct: 1475 -HSIISDLQNGNAETRRRLAVYCLKDAYLPQRLLDKLMFIYNYVEMARVTGVPISFLLSR 1651
Query: 841 ARAQRVEYLLLHEFHKKKYIVPDKFSNYAKETKLTKRRVTHGVDDGNFDDADINDANYHN 900
++ +V LL +K ++P+ AK+ G + G F+
Sbjct: 1652 GQSIKVLSQLLRRARQKNLVIPN-----AKQA---------GSEQGTFE----------- 1756
Query: 901 DASESDHKKNKKAASYAGGLVLEPKKGLYDKYILLLDFNSLYPSIIQEYNICFTTVERSS 960
G VLE + G Y+K I LDF SLYPSI+ YN+C+ T+
Sbjct: 1757 -----------------GATVLEARAGFYEKPIATLDFASLYPSIMMAYNLCYCTLVIPE 1885
Query: 961 D--------DSFPRLPSSKT-------TGVLPELLKKLVKLRREKKTWMKTASG-LKRQQ 1004
D +S R PS +T G+LPE+L++L+ R+ K +K A L++
Sbjct: 1886 DARKLNIPPESVNRTPSGETFVKSNLQKGILPEILEELLTARKRAKADLKEAKDPLEKAV 2065
Query: 1005 LDIEQQALKLTANSMYGCLGFSNSRFYAKPLAELITLQGREILQSTVDLVQNNL------ 1058
LD Q ALK++ANS+YG G + + ++ +T GR++++ T +V++
Sbjct: 2066 LDGRQLALKISANSVYGFTGATIGQLPCLEISSSVTSYGRQMIEHTKKIVEDKFTTLNGY 2245
Query: 1059 --NLEVIYGDTDSIMIYSGLDDIAKATSISKKVIQEVNKKY-RCLEIDLDGLYKRMLLLK 1115
N EVIYGDTDS+M+ G+ + +A ++ ++ + ++ + + ++++ + +Y LL+
Sbjct: 2246 EHNAEVIYGDTDSVMVQFGVSAVEEAMNLGREAAEHISGTFTKPIKLEFEKVYYPYLLIS 2425
Query: 1116 KKKYAAVKVQFKDGTPYEVIERKGLDIVRRDWSLLAKDLGDFCLTQILSGGSCEDVVESI 1175
KK+YA + D ++ ++ KG++ VRRD LL K+L + CL +IL V+ +
Sbjct: 2426 KKRYAGLFWTKPDN--FDKMDTKGIETVRRDNCLLVKNLVNDCLHKILIDRDIPGAVQYV 2599
Query: 1176 HNSLMKVQEEMRNGQVALEKYVITKTLTKPPEAYPDAKNQPHVLVAQRLKQQGYTSGCSV 1235
N++ ++ ++ L VITK LTK + Y HV +A+R++++ + +V
Sbjct: 2600 KNAI----SDLLMNRMDLSLLVITKGLTKTGDDY--EVKAAHVELAERMRKRDAATAPNV 2761
Query: 1236 GDTIPYVICCEQGGSSGSATGIALRARHPDELKQEQGTWLIDIDYYLSQQIHPVISRLCA 1295
GD +PYVI +A G R D + + ID YYL QI I R+
Sbjct: 2762 GDRVPYVII-------KAAKGAKAYERSEDPIYVLENNIPIDPHYYLENQISKPILRIFE 2920
Query: 1296 SIQGTSPERL 1305
I + + L
Sbjct: 2921 PILKNASKEL 2950
>CD418256
Length = 529
Score = 83.2 bits (204), Expect = 8e-16
Identities = 41/46 (89%), Positives = 41/46 (89%)
Frame = -3
Query: 514 ALELFLIKRKIKGPSWLQVSNFSTCSASQRVSWCKFEVIVDSPKDI 559
ALELFLIKRKIKGPSWLQVSNFS SAS RVSWCKFEV VDSPK I
Sbjct: 479 ALELFLIKRKIKGPSWLQVSNFSPSSAS*RVSWCKFEVTVDSPKQI 342
Score = 72.4 bits (176), Expect = 1e-12
Identities = 31/45 (68%), Positives = 38/45 (83%)
Frame = -1
Query: 618 WKRPGMLTHFTVIRKLDGNIFPMGFNTEVTDRNIKAGSNVLCVES 662
W+R LT FT++RKLDG IFPMGF+ EVTDRN++AGSN+LC ES
Sbjct: 310 WRRTERLTRFTIVRKLDGIIFPMGFSKEVTDRNLQAGSNILCAES 176
>BI315914 similar to PIR|T09854|T098 proline-rich cell wall protein - upland
cotton, partial (37%)
Length = 246
Score = 32.3 bits (72), Expect = 1.6
Identities = 25/74 (33%), Positives = 31/74 (41%)
Frame = -1
Query: 263 GKLRGRNGGGRQ*MW*RGFS*RESGVCERRRNGSETCCEKGGFYFECES**RSGGSEVVC 322
G+ RGR+G GR+ RG R RRR G +GG + R G
Sbjct: 189 GRGRGRSGRGRRRRRRRGGGWRR----RRRRGGGWRGRRRGGGW-------RRGRRRRAE 43
Query: 323 YGWMAGG*EWWRWR 336
+GW G WWR R
Sbjct: 42 WGWRGSGSWWWRLR 1
>TC216431 homologue to UP|Q93XA7 (Q93XA7) NAC domain protein NAC1, partial
(25%)
Length = 471
Score = 31.6 bits (70), Expect = 2.7
Identities = 19/58 (32%), Positives = 26/58 (44%)
Frame = +2
Query: 722 PKLDRRGKTFGFGADPAIMSCVAGRLLCDTYLCSRDLLKEVSYSLTHLAKTQLNQSRK 779
PKL T +G P + C+ D C R +LK V +T + + LNQS K
Sbjct: 11 PKLPANHATNAYGGAPNLGYCL------DPLSCDRKMLKAVLNQITKMERNPLNQSLK 166
>TC216430 homologue to UP|Q93XA7 (Q93XA7) NAC domain protein NAC1, complete
Length = 1289
Score = 31.6 bits (70), Expect = 2.7
Identities = 19/58 (32%), Positives = 26/58 (44%)
Frame = +3
Query: 722 PKLDRRGKTFGFGADPAIMSCVAGRLLCDTYLCSRDLLKEVSYSLTHLAKTQLNQSRK 779
PKL T +G P + C+ D C R +LK V +T + + LNQS K
Sbjct: 786 PKLPANHATNAYGGAPNLGYCL------DPLSCDRKMLKAVLNQITKMERNPLNQSLK 941
>TC230062
Length = 1000
Score = 30.8 bits (68), Expect = 4.7
Identities = 10/18 (55%), Positives = 13/18 (71%)
Frame = +2
Query: 323 YGWMAGG*EWWRWRWGGK 340
YG + G EWW W+WGG+
Sbjct: 605 YGTLHG--EWWEWKWGGE 652
>BM527146 similar to GP|18491265|gb| At2g46160/T3F17.19 {Arabidopsis
thaliana}, partial (36%)
Length = 422
Score = 30.4 bits (67), Expect = 6.1
Identities = 28/80 (35%), Positives = 31/80 (38%), Gaps = 2/80 (2%)
Frame = -1
Query: 263 GKLRGRNGGGRQ*MW*RGFS*RESGVCERR--RNGSETCCEKGGFYFECES**RSGGSEV 320
G+ R R+GG R R R G C RR G CC GG C S R G
Sbjct: 317 GR*RRRSGGWRHRRRPRRRRSRAGGRCHRRCCYGGCGGCCCGGG---GCGSRGRRRG*-- 153
Query: 321 VCYGWMAGG*EWWRWRWGGK 340
GW G RWRW +
Sbjct: 152 ---GWRGGRGRRGRWRWSSR 102
>TC212017 similar to UP|Q950T7 (Q950T7) NADH dehydrogenase subunit 4 ,
partial (4%)
Length = 1011
Score = 30.4 bits (67), Expect = 6.1
Identities = 28/111 (25%), Positives = 42/111 (37%), Gaps = 11/111 (9%)
Frame = +1
Query: 884 DDGNFDDADINDANYHNDASESDHKKNKKAASYAGGLVLEPKKGLYDKYIL--LLDF--- 938
+DG D A + NY + NK A S G ++ + ++ L+D
Sbjct: 373 EDGKIDQARDSQVNYDAPNGATPISPNKIAVSPGSGSTAAIPSYIFAEKLVPVLVDLFLQ 552
Query: 939 ------NSLYPSIIQEYNICFTTVERSSDDSFPRLPSSKTTGVLPELLKKL 983
+YP IIQ C TT + D++ RL VL + KL
Sbjct: 553 APAVEKYIIYPEIIQSLGRCMTTRRDNPDNALWRLAVEAFNRVLVHYVTKL 705
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.337 0.147 0.483
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 68,667,101
Number of Sequences: 63676
Number of extensions: 1010799
Number of successful extensions: 9197
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 8764
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 9164
length of query: 1510
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1400
effective length of database: 5,635,272
effective search space: 7889380800
effective search space used: 7889380800
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC144431.4