
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148758.4 - phase: 0 /pseudo
(618 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC86452 similar to PIR|T09892|T09892 hypothetical protein T22A6.... 1047 0.0
TC79643 similar to PIR|C86410|C86410 protein F3M18.18 [imported]... 313 e-138
TC92523 similar to PIR|T09892|T09892 hypothetical protein T22A6.... 271 e-107
TC79271 weakly similar to GP|20466444|gb|AAM20539.1 unknown prot... 305 3e-83
TC90348 similar to GP|15810435|gb|AAL07105.1 unknown protein {Ar... 224 1e-58
CB893903 weakly similar to GP|20466444|gb unknown protein {Arabi... 209 3e-54
TC84610 similar to GP|20466444|gb|AAM20539.1 unknown protein {Ar... 108 7e-45
BQ139188 weakly similar to PIR|C86410|C864 protein F3M18.18 [imp... 170 2e-42
BG648302 similar to GP|22202725|dbj contains ESTs AU032617(S1163... 159 3e-39
TC91102 homologue to PIR|C86420|C86420 unknown protein 124288-1... 90 2e-35
CA991122 similar to PIR|C86420|C86 unknown protein 124288-12173... 118 2e-32
AW686086 similar to GP|20466444|gb| unknown protein {Arabidopsis... 84 2e-32
TC90560 weakly similar to GP|20466444|gb|AAM20539.1 unknown prot... 130 2e-30
BF633115 weakly similar to GP|15809820|gb At1g29690/F15D2_24 {Ar... 127 1e-29
BE321769 weakly similar to PIR|C86410|C864 protein F3M18.18 [imp... 115 4e-26
TC91124 weakly similar to GP|15809820|gb|AAL06838.1 At1g29690/F1... 101 1e-21
TC92134 similar to GP|14209545|dbj|BAB56041. contains ESTs C7286... 101 1e-21
BQ139397 weakly similar to PIR|T09892|T098 hypothetical protein ... 68 9e-12
BQ143931 similar to PIR|T02229|T022 protein BYJ15 - common tobac... 30 2.7
TC86275 similar to PIR|E96603|E96603 unknown protein F14G9.26 [i... 29 6.1
>TC86452 similar to PIR|T09892|T09892 hypothetical protein T22A6.120 -
Arabidopsis thaliana, partial (80%)
Length = 2233
Score = 1047 bits (2708), Expect = 0.0
Identities = 544/624 (87%), Positives = 556/624 (88%), Gaps = 6/624 (0%)
Frame = +3
Query: 1 MSSKKKVLSVEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNV 60
MSSKKKVLSVEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNV
Sbjct: 171 MSSKKKVLSVEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNV 350
Query: 61 PKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANT 120
PKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANT
Sbjct: 351 PKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANT 530
Query: 121 KSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGG 180
KSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGG
Sbjct: 531 KSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGG 710
Query: 181 TDIIYAKQQYSSPLQPSDVQKKLKDMADELFRGQAGQNNANDGTFNSKEKFMRDNGLGFL 240
TDIIYAKQQYSSPLQPSDVQKKLKDMADELFRGQAGQNNANDGTFNSKEKFMRDNGLGFL
Sbjct: 711 TDIIYAKQQYSSPLQPSDVQKKLKDMADELFRGQAGQNNANDGTFNSKEKFMRDNGLGFL 890
Query: 241 DIQAQSYRETEKNTTNVLAIYP------LLVKHSRCRI*SLCARGKVEMGNKISATMSGA 294
DIQAQSYRETE + L + C+ ++ ++ V + I T +
Sbjct: 891 DIQAQSYRETEVQDIKFMCKRKGGNGKQNLSHNEWCQ--TVLSQPDVISMSFIPIT---S 1055
Query: 295 KLFCLNLM*YQCHSYQLHLYLVE*MGVDI*LMP*ISIYDTADKPAIEELHQFLEFQLPRQ 354
L +N Y H+ L+L KPAIEELHQFLEFQLPRQ
Sbjct: 1056LLGGINGSGYLTHAINLYLRY---------------------KPAIEELHQFLEFQLPRQ 1172
Query: 355 WAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVGMKPVTGLRLYLEGKKSNC 414
WAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVGMKPVTGLRLYLEGKKSNC
Sbjct: 1173WAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVGMKPVTGLRLYLEGKKSNC 1352
Query: 415 LAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKSFSHICTAPVESYDDNAVV 474
LAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKSFSHICTAPVESYDDNAVV
Sbjct: 1353LAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKSFSHICTAPVESYDDNAVV 1532
Query: 475 TGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGP 534
TGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGP
Sbjct: 1533TGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGP 1712
Query: 535 QKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGARLYV 594
QKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGARLYV
Sbjct: 1713QKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGARLYV 1892
Query: 595 EKGKISLKVKYSLLTVILQDEETE 618
EKGKISLKVKYSLLTVILQDEETE
Sbjct: 1893EKGKISLKVKYSLLTVILQDEETE 1964
>TC79643 similar to PIR|C86410|C86410 protein F3M18.18 [imported] -
Arabidopsis thaliana, partial (74%)
Length = 2150
Score = 313 bits (803), Expect(2) = e-138
Identities = 192/455 (42%), Positives = 261/455 (57%), Gaps = 22/455 (4%)
Frame = +1
Query: 182 DIIYAKQQYSSPLQPSDVQKKLKDMADELFRGQAGQN-NANDGTFNSKEKFMRDNGLGFL 240
D+++ KQ SS + P+++QK LK +ADE F + Q+ N N + K L
Sbjct: 673 DVVHIKQSKSSDIPPTELQKLLKQLADERFSAVSNQSSNVNPAAISGK-----------L 819
Query: 241 DIQAQSYRETEKNTTNVLAIYPLLVKHSRCRI*SLCARGKVEMGNKISATMSGAKLFCLN 300
R +N L P++ HS K + IS G +F
Sbjct: 820 KDDHTKLRGLHRNKPPSLVGRPIVKSHS-----------KNDDIVSISVRRGGIDVF--- 957
Query: 301 LM*YQCHSYQLHLYLVE*MGVDI*LMP*ISIYDTAD---------------KPAIEELHQ 345
Q ++ L + + L+P S+ ++ KPAIEELHQ
Sbjct: 958 ----QPYNQWLSTISQSPNVISMSLVPITSLLNSVPGNGFLSHAVNLYLRYKPAIEELHQ 1125
Query: 346 FLEFQLPRQWAPVFGELALGPDRK-KSQSSSSLQFSFMGPKLYVNTSPVVVGMKPVTGLR 404
FLEFQLPRQWAP++G+L L D K K +S SLQF+ MGPKLYVNT V G +PVTG+R
Sbjct: 1126 FLEFQLPRQWAPMYGDLPLVFDHKYKRNASPSLQFTLMGPKLYVNTVKVDSGNRPVTGIR 1305
Query: 405 LYLEGKKSNCLAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKSFSHICTAP 464
LYLEGKK+N L+IHLQHLS +P ++ ++ + + D +R YYE V+W FSH+ TAP
Sbjct: 1306 LYLEGKKNNHLSIHLQHLSEVPGVLEISEDHSYDPIDEPDDRGYYEPVKWSMFSHVYTAP 1485
Query: 465 VE-----SYDDNAVVTGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQK 519
V+ + ++VT A FEV G+KKVLFLRL + VA A ++R EWDG ++K
Sbjct: 1486 VQYNSSRMDESTSIVTKAWFEVKLMGMKKVLFLRLGYSTVASA-KIRRSEWDGPSTSSRK 1662
Query: 520 SGMISTFISTRFSGPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQ 579
SG S +S + S P Q S V++NSA+Y GGPPVP +APK+L FVDT EM RGP+
Sbjct: 1663 SGFFSALMSAKLSQGLHSPEKQ-SKVDINSAIYHGGPPVPTRAPKMLNFVDTKEMVRGPE 1839
Query: 580 DLPGYWVVSGARLYVEKGKISLKVKYSLLTVILQD 614
D PGYWVV+GA+L VE G+IS+K KYSLLT++ ++
Sbjct: 1840 DPPGYWVVTGAKLCVEGGRISIKAKYSLLTILSEE 1944
Score = 198 bits (504), Expect(2) = e-138
Identities = 102/175 (58%), Positives = 126/175 (71%), Gaps = 2/175 (1%)
Frame = +3
Query: 8 LSVEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHD--NLRTVELPGRVSIPNVPKSIN 65
L+ E + +G GY+L ND+R CK S+LI ID+ N R + P V +PNVP SI
Sbjct: 150 LAAEKAVSVIGQGYNLCNDIRFSACK--SRLIHIDNSSSNTRDLVFPSGVVVPNVPLSIK 323
Query: 66 CDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANTKSLAF 125
DKGD R SDVL+F QMSE FN+++SLSGKIP+G FNS F W RDAA+TKSLAF
Sbjct: 324 SDKGDCTRFRSDVLTFIQMSEHFNRQLSLSGKIPSGQFNSMFDMKKCWSRDAASTKSLAF 503
Query: 126 DGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGG 180
DG ITLY++ LD+T++ LS+ VK+ VP SW+PAALA FIEKYGTH VVGVK+GG
Sbjct: 504 DGWFITLYSVELDRTNITLSETVKKDVPCSWNPAALAEFIEKYGTHVVVGVKMGG 668
>TC92523 similar to PIR|T09892|T09892 hypothetical protein T22A6.120 -
Arabidopsis thaliana, partial (45%)
Length = 1327
Score = 271 bits (694), Expect(2) = e-107
Identities = 134/197 (68%), Positives = 158/197 (80%)
Frame = +1
Query: 337 KPAIEELHQFLEFQLPRQWAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVG 396
KP IEELHQFLEFQLPRQWAPVF +L LGP K+ +SS+SLQFSFMGP+LYVNT PV VG
Sbjct: 1 KPPIEELHQFLEFQLPRQWAPVFSDLPLGPQWKQ-RSSASLQFSFMGPRLYVNTIPVDVG 177
Query: 397 MKPVTGLRLYLEGKKSNCLAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKS 456
+PVTGLRLYLEGKKSN LAIH+QHLSSLPK FQL+D++N N S ++++YEKVQWK+
Sbjct: 178 KRPVTGLRLYLEGKKSNRLAIHMQHLSSLPKIFQLEDDSNENFRRKSYDKRFYEKVQWKN 357
Query: 457 FSHICTAPVESYDDNAVVTGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGL 516
FSH+CTAPVES ++ +VVTGA +V G K +LFLRL F V A V+ PEWDGSPGL
Sbjct: 358 FSHVCTAPVESEEELSVVTGAQLQVENYGFKNILFLRLKFSTVLGAKEVKHPEWDGSPGL 537
Query: 517 TQKSGMISTFISTRFSG 533
KSG+IST IS F+G
Sbjct: 538 GPKSGLISTLISQHFTG 588
Score = 135 bits (341), Expect(2) = e-107
Identities = 62/77 (80%), Positives = 72/77 (92%)
Frame = +3
Query: 542 PSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGARLYVEKGKISL 601
P+DVN+NSA+YPGGPPVP QAPKLLKFVDTTEMTRGPQ+ PGYWVV+GARL VEKGKISL
Sbjct: 615 PADVNINSAVYPGGPPVPVQAPKLLKFVDTTEMTRGPQETPGYWVVTGARLLVEKGKISL 794
Query: 602 KVKYSLLTVILQDEETE 618
+VKYSLLT+IL D++ +
Sbjct: 795 RVKYSLLTMILPDDDDD 845
>TC79271 weakly similar to GP|20466444|gb|AAM20539.1 unknown protein
{Arabidopsis thaliana}, partial (26%)
Length = 1719
Score = 305 bits (782), Expect = 3e-83
Identities = 195/509 (38%), Positives = 274/509 (53%), Gaps = 45/509 (8%)
Frame = +2
Query: 14 IKSVGLGYDLTNDLRLKFCKY----------DSKLIAIDHDNLRTVELPG---RVSIPNV 60
++S+G G+DL +D RL+F K +L+ +D N R + +PG I V
Sbjct: 293 LESLGKGFDLASDFRLRFAKGIRGGGNSNSGSKRLVVLDEQNKRDILIPGVGGATVIKGV 472
Query: 61 PKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANT 120
++I CDKGDR+R SDVL F QMSE NQ+ ++ GKIP+G+FN+ F SG W RDAA+
Sbjct: 473 SENIRCDKGDRIRFKSDVLEFNQMSELLNQKSAVQGKIPSGYFNALFDMSGDWLRDAADI 652
Query: 121 KSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGG 180
K LAFDG I+LY + L + +VL + VK++VP+ WDPA+L+RFI+ YGTH VVG+ +GG
Sbjct: 653 KYLAFDGYFISLYCLHLTASPLVLQEEVKKSVPAQWDPASLSRFIQTYGTHIVVGMAVGG 832
Query: 181 TDIIYAKQQYSSPLQPSDVQKKLKDMADELFRGQAGQNNANDGTFNSKEKFMRDNGLGFL 240
D+I KQ++SS + P DV++ L+D+ D LF ++ T +SK K
Sbjct: 833 QDVICVKQKHSSKIPPGDVRRHLEDLGDFLFSDVRSPSSLQRKTADSKHK---------- 982
Query: 241 DIQAQSYRETEKNTTNVLAIYPLLVKHSRCRI*SLCARG----KVEMGNKISATMSGAKL 296
+ R + NTT +I K I S RG K N + S +
Sbjct: 983 -VPEVFNRVMQSNTTQFTSISETSSKDGLTIICS--KRGGDVFKHSHSNWLQTVPSNPEA 1153
Query: 297 FCLNLM------------*YQCHSYQLHLYLVE*MGVDI*LMP*ISIYDTADKPAIEELH 344
+ Y H+ L+L KP+ E+L
Sbjct: 1154IIFKFVPISSLLTGIPGSGYLSHAINLYLRY---------------------KPSPEDLQ 1270
Query: 345 QFLEFQLPRQWAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVGMKPVTGLR 404
FLEFQ+PRQWAP+F EL L R+K+ SS SLQF +GPKL+++++ VV KPV GLR
Sbjct: 1271YFLEFQIPRQWAPMFCELPLRHQRRKT-SSLSLQFCCLGPKLHISSTEVVSEQKPVVGLR 1447
Query: 405 LYLEGKKSNCLAIHLQHLSSLPKTFQLKDETN--------RNVSDASSERKYYEKVQWKS 456
LYLEGKKS+ LA+H+ HLSSLP L + + R + S ++ E V+WK
Sbjct: 1448LYLEGKKSDRLALHINHLSSLPNKMILSSDASTPSIQSMWRGSDENESSNQFLEPVRWKR 1627
Query: 457 FSHICTAPVESYDDN--------AVVTGA 477
FS++CTA V+ +D N +VTGA
Sbjct: 1628FSNVCTAVVK-HDPNWLNDCGGVYIVTGA 1711
>TC90348 similar to GP|15810435|gb|AAL07105.1 unknown protein {Arabidopsis
thaliana}, partial (48%)
Length = 663
Score = 224 bits (570), Expect = 1e-58
Identities = 112/165 (67%), Positives = 136/165 (81%), Gaps = 4/165 (2%)
Frame = +2
Query: 11 EDVIKSVGLGYDLTNDLRLKFCKYDS---KLIAIDHDN-LRTVELPGRVSIPNVPKSINC 66
E I S+G GYD+++D+RLKFCK DS +LI ID DN LR V LPG VS+PNV K I C
Sbjct: 167 EIAIGSIGRGYDISSDIRLKFCKGDSIHSRLIEIDEDNDLREVVLPGGVSLPNVSKLIKC 346
Query: 67 DKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANTKSLAFD 126
DKG+R R SDVLSFQQM+EQFNQE+SL+GKIP+G FNS F+FSG WQ+DAA+TK+LAFD
Sbjct: 347 DKGERTRFRSDVLSFQQMTEQFNQELSLTGKIPSGLFNSMFEFSGSWQKDAAHTKTLAFD 526
Query: 127 GVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTH 171
GV ITLY +AL+K+ ++L DHVK+AVPSSWDP ALARFI+ +GTH
Sbjct: 527 GVLITLYTVALEKSQMLLCDHVKKAVPSSWDPPALARFIDTFGTH 661
>CB893903 weakly similar to GP|20466444|gb unknown protein {Arabidopsis
thaliana}, partial (35%)
Length = 815
Score = 209 bits (532), Expect = 3e-54
Identities = 118/262 (45%), Positives = 165/262 (62%), Gaps = 8/262 (3%)
Frame = +3
Query: 337 KPAIEELHQFLEFQLPRQWAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVVVG 396
KP + +L FL++Q + WAPV +L LGP S S L + MGPKLYVNT V VG
Sbjct: 39 KPPMSDLSYFLDYQGYKIWAPVHNDLPLGPTTNISTISPFLTLNLMGPKLYVNTDKVTVG 218
Query: 397 MKPVTGLRLYLEGKKSNCLAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKS 456
+P+TG+RL+LEG K N LAIH++HL + P K E S+ ++ +++E + K
Sbjct: 219 KRPITGMRLFLEGMKCNRLAIHVEHLLNTPTMLSNKIEDTTIWSEEINDERFFEAISGKK 398
Query: 457 FSHICTAPVE-----SYDDNA--VVTGAHFEVGETGLKKVLF-LRLHFCKVADATRVRAP 508
FSH+CTAPV+ S + N +VTGA V + K+VL LRL F KV+++ V++
Sbjct: 399 FSHVCTAPVKYNPKWSTEKNVAFIVTGAQLHVKKHDTKRVLLHLRLLFSKVSNSFVVKSN 578
Query: 509 EWDGSPGLTQKSGMISTFISTRFSGPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKF 568
GS GL+QKSG+ S IST SG K + S V ++S+++P GPPVP Q K+LKF
Sbjct: 579 WTKGSSGLSQKSGIFSA-ISTSISGSSK--DQKKSTVLLDSSVFPTGPPVPVQTQKMLKF 749
Query: 569 VDTTEMTRGPQDLPGYWVVSGA 590
VDT+E+ +GPQ PG+W+V+GA
Sbjct: 750 VDTSELCKGPQHTPGHWLVTGA 815
>TC84610 similar to GP|20466444|gb|AAM20539.1 unknown protein {Arabidopsis
thaliana}, partial (9%)
Length = 681
Score = 108 bits (270), Expect(3) = 7e-45
Identities = 47/95 (49%), Positives = 72/95 (75%)
Frame = +3
Query: 117 AANTKSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGV 176
A + KSLAFDG I+LYN+ L +H++L + +K++VP+ WDPA+L+RFI YGTH +VG+
Sbjct: 381 AQDIKSLAFDGYFISLYNLHLTASHLILQEELKKSVPAHWDPASLSRFIATYGTHIIVGM 560
Query: 177 KIGGTDIIYAKQQYSSPLQPSDVQKKLKDMADELF 211
+GG D+I KQ++SS + P D+++ L+D+ D LF
Sbjct: 561 AVGGQDVICVKQKHSSKVPPGDLRRHLEDLXDFLF 665
Score = 75.1 bits (183), Expect(3) = 7e-45
Identities = 34/67 (50%), Positives = 45/67 (66%)
Frame = +2
Query: 59 NVPKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAA 118
+V + I CDKGDR+R SDVL F QMSE NQ+ ++ GKIP+G+FN+ F SG W RD +
Sbjct: 206 DVSEDIRCDKGDRVRFKSDVLQFNQMSEMLNQKSAIQGKIPSGYFNAVFDLSGDWFRDCS 385
Query: 119 NTKSLAF 125
+ F
Sbjct: 386 RHQISCF 406
Score = 36.6 bits (83), Expect(3) = 7e-45
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 3/44 (6%)
Frame = +1
Query: 17 VGLGYDLTNDLRLKFCK---YDSKLIAIDHDNLRTVELPGRVSI 57
+G G+DLT+D R+KF K +L ID N R + +PG V+I
Sbjct: 70 LGKGFDLTSDFRMKFSKGLINGGRLXVIDEMNKRDIMVPGGVTI 201
>BQ139188 weakly similar to PIR|C86410|C864 protein F3M18.18 [imported] -
Arabidopsis thaliana, partial (10%)
Length = 667
Score = 170 bits (430), Expect = 2e-42
Identities = 90/195 (46%), Positives = 125/195 (63%), Gaps = 3/195 (1%)
Frame = +1
Query: 9 SVEDVIKSVGLGYDLTNDLRLKFCK-YDSKLIAIDHD-NLRTVELPGRVSIPNVPKSINC 66
+ + I S+GLG+D+T D+ CK S LI I++ + R +ELPG V+IPNV S+ C
Sbjct: 73 AAQKAINSIGLGFDITLDINFDNCKSIGSPLIFINNQQHCRHLELPGGVTIPNVSNSVKC 252
Query: 67 DKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANTKSLAFD 126
+G+ +R+ SDVL+ QM + FN E+ L G +GHF ++F SG +D A+ KSLA+D
Sbjct: 253 VRGESIRIHSDVLTLHQMLQHFNHEMRLVGDTASGHFCASFGLSGRCIKDLASIKSLAYD 432
Query: 127 GVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGGTDIIYA 186
G I Y + L+ H L DHVK AVPSSWDP ALARFIE++GTH +VGV +G D+ Y
Sbjct: 433 GWFIKRYAVELENYHGELHDHVKEAVPSSWDPEALARFIERFGTHVIVGVSMGXKDVFYV 612
Query: 187 KQQ-YSSPLQPSDVQ 200
Q+ S P+ +Q
Sbjct: 613 XQEDTSXXXDPTSIQ 657
>BG648302 similar to GP|22202725|dbj contains ESTs AU032617(S11633)
D46758(S11633) C73071(E2861)~similar to Oryza sativa
chromosome 1, partial (5%)
Length = 816
Score = 159 bits (402), Expect = 3e-39
Identities = 92/201 (45%), Positives = 117/201 (57%), Gaps = 1/201 (0%)
Frame = +2
Query: 410 KKSNCLAIHLQHLSSLPKTFQLKDETNRNVSDASSERKYYEKVQWKSFSHICTAPVESYD 469
K+SN LAI+LQHL SLPK+ L D S S KY++K++W FS++CTAP+ES D
Sbjct: 5 KRSNRLAINLQHLVSLPKSLPLADNAPAYSSCDSYSCKYHKKLKWNCFSYVCTAPIESDD 184
Query: 470 DNAVVTGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQ-KSGMISTFIS 528
++VTGA +V KK L LRLHF KV AT + PEWD L + K G +
Sbjct: 185 SLSIVTGAQLQVE----KKCLLLRLHFSKVIGATL*KPPEWDQPSNLGKSKEGYMDY--- 343
Query: 529 TRFSGPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVS 588
P P + V+ LY G P + PKL ++VD E RGP++ PGYW VS
Sbjct: 344 -----------PIPGEETVHPLLYSGALSRPVRTPKLQRYVDRMERIRGPKNTPGYWAVS 490
Query: 589 GARLYVEKGKISLKVKYSLLT 609
GA+LYV GKI L VKYSLL+
Sbjct: 491 GAKLYVHNGKIYLLVKYSLLS 553
>TC91102 homologue to PIR|C86420|C86420 unknown protein 124288-121737
[imported] - Arabidopsis thaliana, partial (32%)
Length = 1061
Score = 89.7 bits (221), Expect(2) = 2e-35
Identities = 58/136 (42%), Positives = 78/136 (56%)
Frame = +1
Query: 473 VVTGAHFEVGETGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFS 532
+VTGA V + G K VL L+L F KV T +R WD +P T +G S S+ S
Sbjct: 580 IVTGAQLGVWDFGAKNVLHLKLLFSKVPGCT-IRRSVWDHNPS-TPVAGHKSDGASS--S 747
Query: 533 GPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGARL 592
+K + D +V+ KL K VD TEM++GPQD+PG+W+V+GA+L
Sbjct: 748 SAKKTSDEKKEDSSVHIG-------------KLAKIVDMTEMSKGPQDIPGHWLVTGAKL 888
Query: 593 YVEKGKISLKVKYSLL 608
VEKGKI L++KYSLL
Sbjct: 889 GVEKGKIVLRIKYSLL 936
Score = 78.2 bits (191), Expect(2) = 2e-35
Identities = 39/80 (48%), Positives = 51/80 (63%), Gaps = 6/80 (7%)
Frame = +3
Query: 393 VVVGMKPVTGLRLYLEGKKSNCLAIHLQHLSSLPKTFQLKDETNRNV------SDASSER 446
V VG KPVTGLRL LEG K N LAIHLQHL SLPK Q + + + +
Sbjct: 300 VTVGRKPVTGLRLSLEGNKQNRLAIHLQHLVSLPKNLQPHWDAHMAIGAPKWQGPEEQDS 479
Query: 447 KYYEKVQWKSFSHICTAPVE 466
+++E ++WK+FSH+ TAP+E
Sbjct: 480 RWFEPIKWKNFSHVSTAPIE 539
>CA991122 similar to PIR|C86420|C86 unknown protein 124288-121737 [imported]
- Arabidopsis thaliana, partial (32%)
Length = 744
Score = 118 bits (295), Expect(2) = 2e-32
Identities = 74/219 (33%), Positives = 113/219 (50%), Gaps = 13/219 (5%)
Frame = +3
Query: 378 QFSFMGPKLYVNTSPVVVGMKPVTGLRLYLEGKKSNCLAIHLQHLSSLPKTFQLKDETNR 437
QFS MG KLYV+ + VG +PVTG+RL LEG K N L++HLQHL SLPK Q +++
Sbjct: 132 QFSIMGQKLYVSQEQITVGRRPVTGIRLCLEGNKQNRLSVHLQHLVSLPKILQPYWDSHV 311
Query: 438 NVS------DASSERKYYEKVQWKSFSHICTAPVES-------YDDNAVVTGAHFEVGET 484
+ + +++E V+WK+FSH+ TAP+E+ + +VTGA V +
Sbjct: 312 AIGAPKWQGPEEQDSRWFEPVKWKNFSHVSTAPIENPETFIGDFSGVYIVTGAQLGVWDF 491
Query: 485 GLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGPQKLPPPQPSD 544
G + VL+++L + ++ T +R WD P + KS +T D
Sbjct: 492 GSRNVLYMKLLYSRLPGCT-IRRSLWDHIPNTSPKSSTAGNTSNT--------------D 626
Query: 545 VNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPG 583
+ N G L+K+VD +E+ +GP+D PG
Sbjct: 627 NSTNL-----GSRENT*QTSLVKYVDLSELRQGPEDPPG 728
Score = 39.7 bits (91), Expect(2) = 2e-32
Identities = 19/34 (55%), Positives = 23/34 (66%)
Frame = +1
Query: 337 KPAIEELHQFLEFQLPRQWAPVFGELALGPDRKK 370
KP IEEL FLEFQ+PR WAP+ + G RK+
Sbjct: 13 KPPIEELRYFLEFQIPRVWAPLHDRVP-GQQRKE 111
>AW686086 similar to GP|20466444|gb| unknown protein {Arabidopsis thaliana},
partial (13%)
Length = 666
Score = 83.6 bits (205), Expect(2) = 2e-32
Identities = 41/78 (52%), Positives = 55/78 (69%), Gaps = 1/78 (1%)
Frame = +1
Query: 88 FNQEVSLSGKIPTGHFNSAFQFS-GVWQRDAANTKSLAFDGVSITLYNIALDKTHVVLSD 146
FN++ S+ GKIP+G+FN+ F F G W +AANTK L DG I L+N+ +D ++LS
Sbjct: 244 FNRKSSIPGKIPSGYFNTVFGFDEGSWAAEAANTKCLGVDGYLIKLFNLHIDPYPLLLSK 423
Query: 147 HVKRAVPSSWDPAALARF 164
V +AVPSSWDP ALAR+
Sbjct: 424 QVIQAVPSSWDPPALARY 477
Score = 74.3 bits (181), Expect(2) = 2e-32
Identities = 36/74 (48%), Positives = 49/74 (65%)
Frame = +2
Query: 10 VEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNVPKSINCDKG 69
VE + S+G G+DLT+D RLKFCK + +LI ++ R + +PG SI +V I CDKG
Sbjct: 17 VEKALNSLGKGFDLTSDFRLKFCKGEERLILLNEIEKRELSVPGFGSIKDVSVDIKCDKG 196
Query: 70 DRMRLCSDVLSFQQ 83
DR R SD+L+F Q
Sbjct: 197DRTRYQSDILTFTQ 238
>TC90560 weakly similar to GP|20466444|gb|AAM20539.1 unknown protein
{Arabidopsis thaliana}, partial (18%)
Length = 439
Score = 130 bits (326), Expect = 2e-30
Identities = 65/130 (50%), Positives = 88/130 (67%), Gaps = 1/130 (0%)
Frame = +3
Query: 10 VEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNVPKSINCDKG 69
VE + S+G G+DLT+D RLKFCK + +LI ++ R + +PG SI +V I CDKG
Sbjct: 45 VEKALNSLGKGFDLTSDFRLKFCKGEERLILLNEIEKRELSVPGFGSIKDVSVDIKCDKG 224
Query: 70 DRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQF-SGVWQRDAANTKSLAFDGV 128
DR R SD+L+F QMSE FN++ S+ GKIP+G+FN+ F F G W +AANTK L DG
Sbjct: 225 DRTRYQSDILTFTQMSELFNRKSSIPGKIPSGYFNTVFGFDEGSWAAEAANTKCLGVDGY 404
Query: 129 SITLYNIALD 138
I L+N+ +D
Sbjct: 405 LIKLFNLHID 434
>BF633115 weakly similar to GP|15809820|gb At1g29690/F15D2_24 {Arabidopsis
thaliana}, partial (48%)
Length = 630
Score = 127 bits (320), Expect = 1e-29
Identities = 66/157 (42%), Positives = 101/157 (64%), Gaps = 2/157 (1%)
Frame = +2
Query: 14 IKSVGLGYDLTNDLRLKFCKY--DSKLIAIDHDNLRTVELPGRVSIPNVPKSINCDKGDR 71
I+++G G+D+T+D+RL +CK S+L+ +D ++ R + L + +PNV I+ +G
Sbjct: 77 IQALGRGFDVTSDIRLLYCKGAPGSRLVHLDEEHNRDLALSQELVVPNVSLDIDFSRGKS 256
Query: 72 MRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANTKSLAFDGVSIT 131
+ V SF++M+E FN+ + GKIP G FNS F F+G DAA TKSLA G I
Sbjct: 257 GIEKTPVCSFEKMAEYFNERSGIEGKIPLGSFNSMFNFTGSSMVDAAATKSLAMVGYFIP 436
Query: 132 LYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKY 168
L+ + L K ++ L+D V+RAVP SWDPA+LAR ++ +
Sbjct: 437 LFEVKLTKQNLALNDEVRRAVPYSWDPASLARMLQSF 547
>BE321769 weakly similar to PIR|C86410|C864 protein F3M18.18 [imported] -
Arabidopsis thaliana, partial (9%)
Length = 378
Score = 115 bits (289), Expect = 4e-26
Identities = 60/126 (47%), Positives = 81/126 (63%), Gaps = 2/126 (1%)
Frame = +1
Query: 69 GDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAF--QFSGVWQRDAANTKSLAFD 126
G+ +R+ SDVLS QQM + FN E+ L GK +GHF ++F F G + D+ LA+D
Sbjct: 1 GESIRINSDVLSLQQMLQHFNHEMRLDGKTASGHFCASFGLHFHGT*ELDSII--HLAYD 174
Query: 127 GVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIEKYGTHAVVGVKIGGTDIIYA 186
G I Y + L+K H L DHVK VPS WD AL RFIE++GTH +VGV +GG D++Y
Sbjct: 175 GWFIKRYAVELEKYHGQLHDHVKEVVPSLWDAGALTRFIERFGTHVIVGVSMGGKDVLYV 354
Query: 187 KQQYSS 192
+Q +S
Sbjct: 355 RQDDTS 372
>TC91124 weakly similar to GP|15809820|gb|AAL06838.1 At1g29690/F15D2_24
{Arabidopsis thaliana}, partial (37%)
Length = 653
Score = 101 bits (251), Expect = 1e-21
Identities = 62/169 (36%), Positives = 97/169 (56%), Gaps = 3/169 (1%)
Frame = +3
Query: 2 SSKKKVLSVEDVIKSVGLGYDLTNDLRLKFCKY--DSKLIAIDHDNLRTVELPGRVSIPN 59
SS ++ + I+++G G+D+T+D+RL +CK S+L+ +D ++ R + L + +PN
Sbjct: 126 SSDSLSATICNSIQALGRGFDVTSDIRLLYCKGAPGSRLVHLDEEHNRDLALSQELVVPN 305
Query: 60 VPKSINCDKGDRMRLCSDVLSFQQMSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAAN 119
V I+ +G + V SF++M+E FN+ + GKIP G FNS F F+G DAA
Sbjct: 306 VSLDIDFSRGKSGIEKTPVCSFEKMAEYFNERSGIEGKIPLGSFNSMFNFTGSSMVDAAA 485
Query: 120 TKSLAFDGVSITLYNIALDKTHVVL-SDHVKRAVPSSWDPAALARFIEK 167
TKSLA + + +++T V+ AVP SWDP +LA FI K
Sbjct: 486 TKSLAMGWIFHSSIRS*INETKFXP*MMKVRPAVPYSWDPXSLASFIGK 632
>TC92134 similar to GP|14209545|dbj|BAB56041. contains ESTs C72864(E2385)
AU082952(E2385)~similar to Arabidopsis thaliana
chromosome 1 F15D2.24~, partial (8%)
Length = 997
Score = 101 bits (251), Expect = 1e-21
Identities = 57/134 (42%), Positives = 78/134 (57%), Gaps = 2/134 (1%)
Frame = +3
Query: 26 DLRLKFCK--YDSKLIAIDHDNLRTVELPGRVSIPNVPKSINCDKGDRMRLCSDVLSFQQ 83
D RL +CK S+++ ID R + L V +PNV + I RL S V SFQ+
Sbjct: 594 DTRLLYCKGGSGSRVVEIDEQYQRDLFLYDDVVVPNVSRDIRSFPEPMGRLSSGVCSFQE 773
Query: 84 MSEQFNQEVSLSGKIPTGHFNSAFQFSGVWQRDAANTKSLAFDGVSITLYNIALDKTHVV 143
M + FN + S+SG P G FNSAF F+G DAA TK+L+ DG I L + L K ++
Sbjct: 774 MVDYFNHKASISGSFPLGSFNSAFSFTGSKHVDAAATKTLSSDGFYIPLAKVQLQKIDLM 953
Query: 144 LSDHVKRAVPSSWD 157
L ++VKRA+P +WD
Sbjct: 954 LQENVKRAIPVNWD 995
>BQ139397 weakly similar to PIR|T09892|T098 hypothetical protein T22A6.120 -
Arabidopsis thaliana, partial (13%)
Length = 618
Score = 68.2 bits (165), Expect = 9e-12
Identities = 32/58 (55%), Positives = 43/58 (73%)
Frame = +3
Query: 337 KPAIEELHQFLEFQLPRQWAPVFGELALGPDRKKSQSSSSLQFSFMGPKLYVNTSPVV 394
KP ++L FLEFQ+PR+WAP+F EL L RKK+ S LQFSFM PKL++N++ V+
Sbjct: 270 KPTPDDLQYFLEFQIPREWAPMFSELPLRHQRKKTY-SPPLQFSFMSPKLHINSTQVI 440
>BQ143931 similar to PIR|T02229|T022 protein BYJ15 - common tobacco (fragment),
partial (15%)
Length = 1239
Score = 30.0 bits (66), Expect = 2.7
Identities = 13/28 (46%), Positives = 18/28 (63%)
Frame = +3
Query: 534 PQKLPPPQPSDVNVNSALYPGGPPVPAQ 561
P+ LPP P DV+ SA+ P PVP++
Sbjct: 990 PRSLPPDSPPDVSE*SAIIPLPAPVPSE 1073
>TC86275 similar to PIR|E96603|E96603 unknown protein F14G9.26 [imported] -
Arabidopsis thaliana, partial (51%)
Length = 961
Score = 28.9 bits (63), Expect = 6.1
Identities = 19/57 (33%), Positives = 25/57 (43%)
Frame = -2
Query: 534 PQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTEMTRGPQDLPGYWVVSGA 590
P +LPPP P +S+ GG P+P + G LPG WV+ GA
Sbjct: 588 PDQLPPPPPPPPPSSSSR--GGNPLPPSS------------ICGDFSLPGRWVIVGA 460
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.322 0.138 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,874,669
Number of Sequences: 36976
Number of extensions: 247931
Number of successful extensions: 1263
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 1219
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1238
length of query: 618
length of database: 9,014,727
effective HSP length: 102
effective length of query: 516
effective length of database: 5,243,175
effective search space: 2705478300
effective search space used: 2705478300
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)
Medicago: description of AC148758.4