
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144345.7 - phase: 0 /pseudo
(366 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC211561 weakly similar to UP|O22675 (O22675) Reverse transcript... 106 1e-23
TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {A... 102 2e-22
CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polym... 102 3e-22
CO982349 96 3e-20
BM891241 77 4e-18
BI972659 85 4e-17
CF922212 78 7e-15
BM891637 67 1e-11
AI496436 64 2e-11
BU090318 weakly similar to GP|9743345|gb| F21J9.30 {Arabidopsis ... 62 5e-10
BE554941 50 2e-06
BI427218 47 1e-05
BI698930 44 1e-04
CD417230 weakly similar to GP|2244915|emb| reverse transcriptase... 39 0.004
TC216740 similar to UP|O60585 (O60585) Ser/Arg-related nuclear m... 37 0.014
BQ297454 35 0.041
BF071275 similar to PIR|S14981|S149 extensin class I (clone w1-8... 35 0.070
CF920672 35 0.070
TC234877 33 0.15
TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR ret... 33 0.26
>TC211561 weakly similar to UP|O22675 (O22675) Reverse transcriptase
(Fragment) , partial (59%)
Length = 1077
Score = 106 bits (265), Expect = 1e-23
Identities = 44/101 (43%), Positives = 67/101 (65%)
Frame = +1
Query: 1 LPIGGDPRKLAFWYPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPS 60
+PIG + R+ W ++ + +R+LS WK R++S GGR+ L+K VL+SIP+YF SFF+ P
Sbjct: 607 IPIGANSRRCHMWDSIIRKCERKLSKWK*RHISFGGRVTLIKSVLTSIPIYFFSFFRVPQ 786
Query: 61 GIISTLESLFNAFFWGGCEDIRKITWIKWETVCSRKGEGGL 101
++ L + F WGG D +KI+WI+WE +C K GG+
Sbjct: 787 SVVDKLVKI*RTFLWGGGPDQKKISWIRWEKLCLPKERGGI 909
>TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {Arabidopsis
thaliana;} , partial (18%)
Length = 759
Score = 102 bits (255), Expect = 2e-22
Identities = 44/99 (44%), Positives = 62/99 (62%)
Frame = +2
Query: 34 MGGRLILLKPVLSSIPVYFLSFFKAPSGIISTLESLFNAFFWGGCEDIRKITWIKWETVC 93
MGGR+ L+K VLS++P+ LSFFK P I+ L +L F WGG + +I W+KW +C
Sbjct: 2 MGGRVTLIKSVLSALPIXXLSFFKIPQRIVDKLVTLQRQFLWGGTQHHNRIPWVKWADIC 181
Query: 94 SRKGEGGLGVRRLREFNLALLGKWWWRILQERGTLWYRV 132
+ K +GGLG++ L FN AL G+W W + LW R+
Sbjct: 182 NPKIDGGLGIKDLSNFNAALRGRWIWGLASNHNQLWARL 298
>CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polymerase
homolog T24M8.8 - Arabidopsis thaliana, partial (4%)
Length = 456
Score = 102 bits (254), Expect = 3e-22
Identities = 58/147 (39%), Positives = 75/147 (50%), Gaps = 5/147 (3%)
Frame = +2
Query: 54 SFFKAPSGIISTLESLFNAFFWGGCEDIRKITWIKWETVCSRKGEGGLGVRRLREFNLAL 113
SFF P II+ LESL F WGG D RKI W+ W+TVC K +GGLG++ LR FN L
Sbjct: 5 SFFSLP*KIIAKLESLQRRFLWGGEADSRKIAWVNWKTVCLPKAKGGLGIKDLRTFNTTL 184
Query: 114 LGKWWWRILQERGTLWYRVLCARYG-----EEGGRLCCSCRDGGSVWWQTILDVRDGVGQ 168
LGKW W + + W +VL ++YG EEG S S WW+ ++ +
Sbjct: 185 LGKWRWDLFYIQQEPWAKVLQSKYGGWRALEEG-----SSGSKDSAWWKDLIKTQQLQRN 349
Query: 169 IDSGWMLDNISRRVGNGLSTLFWVDPW 195
I + +VG G FW D W
Sbjct: 350 IP---LKRETIWKVGGGDRIKFWEDLW 421
>CO982349
Length = 795
Score = 95.5 bits (236), Expect = 3e-20
Identities = 38/93 (40%), Positives = 63/93 (66%)
Frame = +3
Query: 1 LPIGGDPRKLAFWYPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPS 60
+PI +PR P++ + + +L+ WK R++S+GGR+ L+ +L+++P+YF SFF+APS
Sbjct: 510 IPIEANPRCREI*DPIIRKCETKLARWKQRHISLGGRVTLINAILTALPIYFFSFFRAPS 689
Query: 61 GIISTLESLFNAFFWGGCEDIRKITWIKWETVC 93
++ L ++ F WGG R+I W+KWETVC
Sbjct: 690 KVVQKLVNIQRKFLWGGGSXQRRIAWVKWETVC 788
>BM891241
Length = 407
Score = 77.4 bits (189), Expect(2) = 4e-18
Identities = 30/63 (47%), Positives = 45/63 (70%)
Frame = -2
Query: 76 GGCEDIRKITWIKWETVCSRKGEGGLGVRRLREFNLALLGKWWWRILQERGTLWYRVLCA 135
GG D +KI W+KWE VC K +GGLG++ + +FN+ALLGKW W + ++ LW R++ +
Sbjct: 406 GGDHDHKKIPWVKWEVVCLPKTKGGLGIKDISKFNIALLGKWIWALASDQQQLWARIINS 227
Query: 136 RYG 138
+YG
Sbjct: 226 KYG 218
Score = 31.6 bits (70), Expect(2) = 4e-18
Identities = 18/50 (36%), Positives = 23/50 (46%)
Frame = -3
Query: 149 RDGGSVWWQTILDVRDGVGQIDSGWMLDNISRRVGNGLSTLFWVDPWLEE 198
+ G S WW+ D+R Q D +S +VG G FW D WL E
Sbjct: 186 KGGYSYWWR---DLRKFYHQSDHSIFHQYMSWKVGCGDKINFWTDKWLGE 46
>BI972659
Length = 453
Score = 85.1 bits (209), Expect = 4e-17
Identities = 37/87 (42%), Positives = 54/87 (61%)
Frame = +1
Query: 1 LPIGGDPRKLAFWYPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPS 60
+PI W PL+++ K +LS W + LSMGGR+ L+K VL+++P+Y LSFFK P
Sbjct: 193 MPIAVKASSRMVWEPLINKFKAKLSKWNQKYLSMGGRVTLIKSVLNALPIYLLSFFKIPQ 372
Query: 61 GIISTLESLFNAFFWGGCEDIRKITWI 87
I+ L SL F WGG + +I+W+
Sbjct: 373 RIVDKLVSLQRTFMWGGNQHHNRISWV 453
>CF922212
Length = 445
Score = 77.8 bits (190), Expect = 7e-15
Identities = 49/143 (34%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Frame = -2
Query: 59 PSGIISTLESLFNAFFWGGCEDIRKITWIKWETVCSRKGEGGLGVRRLREFNLALLGKWW 118
P+ ++ L L F WGG +KI W+ W++VC K + GLG R LR+FN ALLGK
Sbjct: 438 PNKVLDKLVRLQQWFLWGGGLGQKKIAWVNWDSVCLPKEKEGLGNRDLRKFNYALLGKRR 259
Query: 119 WRILQERGTLWYRVLCARYG-----EEGGRLCCSCRDGGSVWWQTILDVRDGVGQIDSGW 173
W + +G L RVL ++Y +E R+ S WWQ I + D W
Sbjct: 258 WNLFHHQGELGARVLDSKYKRWRNLDEERRV-----KSESFWWQEISFITHSTE--DGSW 100
Query: 174 MLDNI-SRRVGNGLSTLFWVDPW 195
+ R+G FW D W
Sbjct: 99 FEKRTKTGRLGCRAKVRFWEDGW 31
>BM891637
Length = 427
Score = 67.0 bits (162), Expect = 1e-11
Identities = 35/117 (29%), Positives = 55/117 (46%)
Frame = -1
Query: 82 RKITWIKWETVCSRKGEGGLGVRRLREFNLALLGKWWWRILQERGTLWYRVLCARYGEEG 141
+KI WI W C+ GGLG++ ++ N ALL KW W + + LW R+L ++Y
Sbjct: 418 KKIAWISWRQCCASGDVGGLGIKDIKILNNALLIKWKWLMFHQPHQLWNRILISKYKGWR 239
Query: 142 GRLCCSCRDGGSVWWQTILDVRDGVGQIDSGWMLDNISRRVGNGLSTLFWVDPWLEE 198
G + S WW + + I + + +VG G LFW D W+++
Sbjct: 238 GLDQGPQKYYFSPWWADLRAINQHQSMIAAS---NQFCWKVGRGDQILFWEDSWVDD 77
>AI496436
Length = 414
Score = 64.3 bits (155), Expect(2) = 2e-11
Identities = 25/63 (39%), Positives = 43/63 (67%)
Frame = +1
Query: 1 LPIGGDPRKLAFWYPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPS 60
+PIG + R W P+V + +R+L+ WK + +S GGR+ L+ VL+++P+Y LSFF+ P
Sbjct: 154 IPIGANSRHSDVWEPVVRKCERKLASWKQKYVSFGGRVTLINSVLTALPIYLLSFFRIPD 333
Query: 61 GII 63
++
Sbjct: 334 KVV 342
Score = 22.3 bits (46), Expect(2) = 2e-11
Identities = 8/14 (57%), Positives = 9/14 (64%)
Frame = +2
Query: 73 FFWGGCEDIRKITW 86
F WGG + RKI W
Sbjct: 371 FLWGGDLE*RKIAW 412
>BU090318 weakly similar to GP|9743345|gb| F21J9.30 {Arabidopsis thaliana},
partial (1%)
Length = 422
Score = 61.6 bits (148), Expect = 5e-10
Identities = 42/129 (32%), Positives = 67/129 (51%), Gaps = 3/129 (2%)
Frame = +3
Query: 83 KITWIKWETVCSRKGEGGLGVRRLREFNLALLGKWWWRILQERGTLWYRVLCARYGEEGG 142
K+ W W+ +CS K GGLG+R L N +LL K + + L RV+ ++Y + G
Sbjct: 3 KVHWTSWKALCSPKKSGGLGLRLLSHMNKSLLMKST*NMCTKPNDL*VRVVRSKY-K*GK 179
Query: 143 RLC--CSCRDGGSVWWQTILDVRDGVGQIDSGWMLDNISRRVGNGLSTLFWVDPW-LEEK 199
+ + G+ +WQ I+ + + DNI+R++GNG S FW D W L+E
Sbjct: 180 DILP*IDPKKTGTNFWQGIVHCWEKITH-------DNIARKIGNGRSVHFWKDVWVLQEG 338
Query: 200 PLS*RRINY 208
L +R+N+
Sbjct: 339 KLIDQRLNF 365
>BE554941
Length = 383
Score = 49.7 bits (117), Expect = 2e-06
Identities = 34/100 (34%), Positives = 45/100 (45%)
Frame = +1
Query: 102 GVRRLREFNLALLGKWWWRILQERGTLWYRVLCARYGEEGGRLCCSCRDGGSVWWQTILD 161
G++ L FN++LL K L E +W +L YG S S WW+ IL
Sbjct: 7 GIKNLSIFNISLLVKRR*HFLVETKAIWAPILKFHYGSLDLSNTGSA-SRTSQWWRDIL- 180
Query: 162 VRDGVGQIDSGWMLDNISRRVGNGLSTLFWVDPWLEEKPL 201
R G Q GW+ R++G+ TLFW L E PL
Sbjct: 181 -RIGNCQYQPGWLTSTFCRKIGDRKDTLFWRHRCLNETPL 297
>BI427218
Length = 421
Score = 47.4 bits (111), Expect = 1e-05
Identities = 27/85 (31%), Positives = 40/85 (46%)
Frame = +1
Query: 115 GKWWWRILQERGTLWYRVLCARYGEEGGRLCCSCRDGGSVWWQTILDVRDGVGQIDSGWM 174
GKW W +G LW R+L ++YG+ + + S WW+ I ++ D G D
Sbjct: 1 GKWKWSFFYNKGELWARILDSKYGDWRHLEEPTRGNRASAWWKDI-NIIDPSG-ADGSLF 174
Query: 175 LDNISRRVGNGLSTLFWVDPWLEEK 199
I +V G T FW D W E++
Sbjct: 175 DKGIKWKVACGEKTKFWEDGWKEDR 249
>BI698930
Length = 384
Score = 43.9 bits (102), Expect = 1e-04
Identities = 24/69 (34%), Positives = 35/69 (49%)
Frame = +3
Query: 24 LSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPSGIISTLESLFNAFFWGGCEDIRK 83
L+ WK LS GR L+ +LSS+P++ SFFK P + S+ F G +
Sbjct: 81 LALWKHEVLSFAGRTCLINSMLSSLPLFSFSFFKVPLNVGMRFISI*RNFV-GEWRES*M 257
Query: 84 ITWIKWETV 92
I W+ W+ V
Sbjct: 258 IPWVNWDKV 284
>CD417230 weakly similar to GP|2244915|emb| reverse transcriptase like
protein {Arabidopsis thaliana}, partial (7%)
Length = 688
Score = 38.9 bits (89), Expect = 0.004
Identities = 17/40 (42%), Positives = 27/40 (67%)
Frame = -2
Query: 16 LVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSF 55
++D++ +R S WKS LSM GRL L K V+ ++P + + F
Sbjct: 189 IIDKVNKRFSRWKSNLLSMVGRLTLTK*VV*ALPSHIMQF 70
>TC216740 similar to UP|O60585 (O60585) Ser/Arg-related nuclear matrix
protein, partial (3%)
Length = 800
Score = 37.0 bits (84), Expect = 0.014
Identities = 15/34 (44%), Positives = 19/34 (55%)
Frame = -2
Query: 93 CSRKGEGGLGVRRLREFNLALLGKWWWRILQERG 126
C R+G GG+G RR + WWWR L+ RG
Sbjct: 154 CERRGRGGVGFRRREGWRACRRSSWWWR-LRRRG 56
>BQ297454
Length = 135
Score = 35.4 bits (80), Expect = 0.041
Identities = 14/36 (38%), Positives = 20/36 (54%)
Frame = -1
Query: 45 LSSIPVYFLSFFKAPSGIISTLESLFNAFFWGGCED 80
L+SI +YF SFF+ P ++ L + F W G D
Sbjct: 108 LTSISIYFFSFFRVPKKVVDKLARIQRKFLWEGALD 1
>BF071275 similar to PIR|S14981|S149 extensin class I (clone w1-8 L) - tomato
(fragment), partial (10%)
Length = 389
Score = 34.7 bits (78), Expect = 0.070
Identities = 19/88 (21%), Positives = 36/88 (40%)
Frame = -3
Query: 12 FWYPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPSGIISTLESLFN 71
+W+ + RI+RR +K R S G +++ + IPV PS I+ + +
Sbjct: 270 WWWKQLSRIRRRWRPYKPRTKSRGHKVVSTIKIPVIIPVLISRVLITPSPIVIMMVVTIS 91
Query: 72 AFFWGGCEDIRKITWIKWETVCSRKGEG 99
W + K ++W + + G
Sbjct: 90 TLMWTTTIPVPKRISMRWVIIRQKSSSG 7
>CF920672
Length = 792
Score = 34.7 bits (78), Expect = 0.070
Identities = 17/54 (31%), Positives = 28/54 (51%)
Frame = -2
Query: 59 PSGIISTLESLFNAFFWGGCEDIRKITWIKWETVCSRKGEGGLGVRRLREFNLA 112
P+ + L + F WGG D +KI W+KW+ C + L V+ R +N++
Sbjct: 791 PNRVAEKLTQIQRRFXWGGGLDQKKIAWVKWD--C*LMEQEPLRVKYPRLYNIS 636
>TC234877
Length = 534
Score = 33.5 bits (75), Expect = 0.15
Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Frame = +1
Query: 15 PLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFFKAPSGIISTLESLFNAFF 74
P+ DRIK L WK LS G + L+ ++ + + + P ++ +++ F
Sbjct: 100 PIADRIKAELQSWKGNLLSFMGCVQLIISIIDGMLL*SFHLYSWPKALLIEMQTWTRNFI 279
Query: 75 W--GGCEDIRKITWIK 88
W G ++ W+K
Sbjct: 280 WTRGYFKEKTGYCWLK 327
>TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR retroelement
reverse transcriptase, partial (7%)
Length = 956
Score = 32.7 bits (73), Expect = 0.26
Identities = 15/43 (34%), Positives = 26/43 (59%)
Frame = -2
Query: 14 YPLVDRIKRRLSDWKSRNLSMGGRLILLKPVLSSIPVYFLSFF 56
Y + D+IK +L+ WK LS+ GR+ L+ V+ + +Y S +
Sbjct: 187 YCVADKIKCKLASWKGSILSIMGRIQLVNSVIHGMLLYSFSVY 59
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.350 0.159 0.603
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,057,748
Number of Sequences: 63676
Number of extensions: 371392
Number of successful extensions: 4578
Number of sequences better than 10.0: 62
Number of HSP's better than 10.0 without gapping: 4373
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4552
length of query: 366
length of database: 12,639,632
effective HSP length: 99
effective length of query: 267
effective length of database: 6,335,708
effective search space: 1691634036
effective search space used: 1691634036
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.8 bits)
S2: 59 (27.3 bits)
Medicago: description of AC144345.7