FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5882, 496 aa 1>>>pF1KB5882 496 - 496 aa - 496 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9897+/-0.000746; mu= 13.5700+/- 0.045 mean_var=180.9963+/-36.403, 0's: 0 Z-trim(115.7): 15 B-trim: 166 in 1/52 Lambda= 0.095332 statistics sampled from 16209 (16224) to 16209 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.8), E-opt: 0.2 (0.498), width: 16 Scan time: 3.850 The best scores are: opt bits E(32554) CCDS10221.1 SMAD6 gene_id:4091|Hs108|chr15 ( 496) 3485 491.2 1.1e-138 CCDS11936.1 SMAD7 gene_id:4092|Hs108|chr18 ( 426) 1112 164.8 1.7e-40 CCDS59317.1 SMAD7 gene_id:4092|Hs108|chr18 ( 425) 1110 164.5 2.1e-40 CCDS54186.1 SMAD7 gene_id:4092|Hs108|chr18 ( 211) 781 118.9 5.4e-27 CCDS11934.1 SMAD2 gene_id:4087|Hs108|chr18 ( 467) 396 66.3 8e-11 >>CCDS10221.1 SMAD6 gene_id:4091|Hs108|chr15 (496 aa) initn: 3485 init1: 3485 opt: 3485 Z-score: 2603.5 bits: 491.2 E(32554): 1.1e-138 Smith-Waterman score: 3485; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496) 10 20 30 40 50 60 pF1KB5 MFRSKRSGLVRRLWRSRVVPDREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGRSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MFRSKRSGLVRRLWRSRVVPDREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGRSEV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 RPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLPESD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLPESD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 CETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTYSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 CETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTYSLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 KRLKERSLDTLLEAVESRGGVPGGCVLVPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KRLKERSLDTLLEAVESRGGVPGGCVLVPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 LKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSPRDEYKPLDLSDSTLSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSPRDEYKPLDLSDSTLSY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 TETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 TETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 PQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 GGRALVVRKVPPGYSIKVFDFERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GGRALVVRKVPPGYSIKVFDFERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQF 430 440 450 460 470 480 490 pF1KB5 ITSCPCWLEILLNNPR :::::::::::::::: CCDS10 ITSCPCWLEILLNNPR 490 >>CCDS11936.1 SMAD7 gene_id:4092|Hs108|chr18 (426 aa) initn: 1307 init1: 559 opt: 1112 Z-score: 840.4 bits: 164.8 E(32554): 1.7e-40 Smith-Waterman score: 1289; 45.7% identity (64.5% similar) in 512 aa overlap (1-494:1-425) 10 20 30 40 50 pF1KB5 MFRSKRSGLVRRLWRSRVVP---DREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGR :::.:::.:::::::::. : :.:::..::::::. :.: : .: CCDS11 MFRTKRSALVRRLWRSRA-PGGEDEEEGAGGGGGGGEL------RGEGATDSR------- 10 20 30 40 60 70 80 90 100 110 pF1KB5 SEVRPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLP :.::: .::: : :: CCDS11 -------------------AHGAG----GGGPGR------AG------------------ 50 120 130 140 150 160 170 pF1KB5 ESDCETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTY ::: . .: . . : :::. :::. : .::..:. CCDS11 --------CCLGKAVRGAKGHHHPHPPAAGAG--AAGGA------------EADLKALTH 60 70 80 90 180 190 200 210 220 pF1KB5 SLLKRLKERSLDTLLEAVESRGGVPGGCVLVP-RADLRLG-GQPA------PPQ-----L :.::.::::.:. ::.:::::::. .:.:.: : : ::: : :: ::. : CCDS11 SVLKKLKERQLELLLQAVESRGGTRTACLLLPGRLDCRLGPGAPAGAQPAQPPSSYSLPL 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB5 LLGRLFRWPDLQHAVELKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSP :: ..::::::.:. :.: :: :.:.. . :::::.:.:::: ::::::::: : CCDS11 LLCKVFRWPDLRHSSEVKRLCCCESYGKI-NPELVCCNPHHLSRLCELESPPPPYSRY-P 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB5 RDEYKPL-DLSDSTLSYTETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRV : :: : :.. : .:: .:: : ::: .::... . :::: :::::..::: CCDS11 MDFLKPTADCPDAVPSSAETGGTNYL--APGGLSDSQLLLEPGDRSHWCVVAYWEEKTRV 220 230 240 250 260 270 350 360 370 380 390 400 pF1KB5 GRLYAVYDQAVSIFYDLPQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAY :::: : . ...::::::::.:::::::: ...:. :...::::: :: :..: ::::.: CCDS11 GRLYCVQEPSLDIFYDLPQGNGFCLGQLNSDNKSQLVQKVRSKIGCGIQLTREVDGVWVY 280 290 300 310 320 330 410 420 430 440 450 460 pF1KB5 NRGEHPIFVNSPTLDAPGGRALVVRKVPPGYSIKVFDFERS-GLQHAPEPDAADGPYDPN ::. .:::..: ::: : .:.:.:.:: ::.:::.::.:.. .::. . . . :. CCDS11 NRSSYPIFIKSATLDNPDSRTLLVHKVFPGFSIKAFDYEKAYSLQRPNDHEFMQQPWTGF 340 350 360 370 380 390 470 480 490 pF1KB5 SVRISFAKGWGPCYSRQFITSCPCWLEILLNNPR .:.:::.:::: ::.::::.:::::::...:. CCDS11 TVQISFVKGWGQCYTRQFISSCPCWLEVIFNSR 400 410 420 >>CCDS59317.1 SMAD7 gene_id:4092|Hs108|chr18 (425 aa) initn: 1174 init1: 558 opt: 1110 Z-score: 839.0 bits: 164.5 E(32554): 2.1e-40 Smith-Waterman score: 1301; 45.8% identity (64.6% similar) in 511 aa overlap (1-494:1-424) 10 20 30 40 50 pF1KB5 MFRSKRSGLVRRLWRSRVVP---DREEGGSGGGGGGDEDGSLGSRAEPAPRAREGGGCGR :::.:::.:::::::::. : :.:::..::::::. :.: : .: CCDS59 MFRTKRSALVRRLWRSRA-PGGEDEEEGAGGGGGGGEL------RGEGATDSR------- 10 20 30 40 60 70 80 90 100 110 pF1KB5 SEVRPVAPRRPRDAVGQRGAQGAGRRRRAGGPPRPMSEPGAGAGSSLLDVAEPGGPGWLP :.::: .::: : :: CCDS59 -------------------AHGAG----GGGPGR------AG------------------ 50 120 130 140 150 160 170 pF1KB5 ESDCETVTCCLFSERDAAGAPRDASDPLAGAALEPAGGGRSREARSRLLLLEQELKTVTY ::: . .: . . : :::. :::. : .::..:. CCDS59 --------CCLGKAVRGAKGHHHPHPPAAGAG--AAGGA------------EADLKALTH 60 70 80 90 180 190 200 210 220 pF1KB5 SLLKRLKERSLDTLLEAVESRGGVPGGCVLVP-RADLRLG-GQPA------PPQ-----L :.::.::::.:. ::.:::::::. .:.:.: : : ::: : :: ::. : CCDS59 SVLKKLKERQLELLLQAVESRGGTRTACLLLPGRLDCRLGPGAPAGAQPAQPPSSYSLPL 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB5 LLGRLFRWPDLQHAVELKPLCGCHSFAAAADGPTVCCNPYHFSRLCGPESPPPPYSRLSP :: ..::::::.:. :.: :: :.:.. . :::::.:.:::: ::::::::: : CCDS59 LLCKVFRWPDLRHSSEVKRLCCCESYGKI-NPELVCCNPHHLSRLCELESPPPPYSRY-P 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB5 RDEYKPLDLSDSTLSYTETEATNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVG : :: : :.. : .:: .:: : ::: .::... . :::: :::::..:::: CCDS59 MDFLKPTDCPDAVPSSAETGGTNYL--APGGLSDSQLLLEPGDRSHWCVVAYWEEKTRVG 220 230 240 250 260 270 350 360 370 380 390 400 pF1KB5 RLYAVYDQAVSIFYDLPQGSGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYN ::: : . ...::::::::.:::::::: ...:. :...::::: :: :..: ::::.:: CCDS59 RLYCVQEPSLDIFYDLPQGNGFCLGQLNSDNKSQLVQKVRSKIGCGIQLTREVDGVWVYN 280 290 300 310 320 330 410 420 430 440 450 460 pF1KB5 RGEHPIFVNSPTLDAPGGRALVVRKVPPGYSIKVFDFERS-GLQHAPEPDAADGPYDPNS :. .:::..: ::: : .:.:.:.:: ::.:::.::.:.. .::. . . . :. . CCDS59 RSSYPIFIKSATLDNPDSRTLLVHKVFPGFSIKAFDYEKAYSLQRPNDHEFMQQPWTGFT 340 350 360 370 380 390 470 480 490 pF1KB5 VRISFAKGWGPCYSRQFITSCPCWLEILLNNPR :.:::.:::: ::.::::.:::::::...:. CCDS59 VQISFVKGWGQCYTRQFISSCPCWLEVIFNSR 400 410 420 >>CCDS54186.1 SMAD7 gene_id:4092|Hs108|chr18 (211 aa) initn: 750 init1: 558 opt: 781 Z-score: 598.1 bits: 118.9 E(32554): 5.4e-27 Smith-Waterman score: 781; 53.6% identity (79.1% similar) in 211 aa overlap (286-494:2-210) 260 270 280 290 300 310 pF1KB5 GPTVCCNPYHFSRLCGPESPPPPYSRLSPRDEYKPL-DLSDSTLSYTETEATNSLITAPG : :: : :.. : .:: .:: : ::: CCDS54 MDFLKPTADCPDAVPSSAETGGTNYL--APG 10 20 320 330 340 350 360 370 pF1KB5 EFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDLPQGSGFCLGQLNLE .::... . :::: :::::..::::::: : . ...::::::::.:::::::: . CCDS54 GLSDSQLLLEPGDRSHWCVVAYWEEKTRVGRLYCVQEPSLDIFYDLPQGNGFCLGQLNSD 30 40 50 60 70 80 380 390 400 410 420 430 pF1KB5 QRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPGGRALVVRKVPPGY ..:. :...::::: :: :..: ::::.:::. .:::..: ::: : .:.:.:.:: ::. CCDS54 NKSQLVQKVRSKIGCGIQLTREVDGVWVYNRSSYPIFIKSATLDNPDSRTLLVHKVFPGF 90 100 110 120 130 140 440 450 460 470 480 490 pF1KB5 SIKVFDFERS-GLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQFITSCPCWLEILLN :::.::.:.. .::. . . . :. .:.:::.:::: ::.::::.:::::::...: CCDS54 SIKAFDYEKAYSLQRPNDHEFMQQPWTGFTVQISFVKGWGQCYTRQFISSCPCWLEVIFN 150 160 170 180 190 200 pF1KB5 NPR . CCDS54 SR 210 >>CCDS11934.1 SMAD2 gene_id:4087|Hs108|chr18 (467 aa) initn: 603 init1: 145 opt: 396 Z-score: 307.8 bits: 66.3 E(32554): 8e-11 Smith-Waterman score: 473; 30.3% identity (56.8% similar) in 333 aa overlap (213-495:115-445) 190 200 210 220 230 240 pF1KB5 LKERSLDTLLEAVESRGGVPGGCVLVPRADLRLGGQPAPPQLLLGRLFRWPDLQHAVELK :... . . :... ::.:::::. ::: CCDS11 WGLSTPNTIDQWDTTGLYSFSEQTRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHELK 90 100 110 120 130 140 250 260 270 280 pF1KB5 PLCGC-HSFAAAADGPTVCCNPYHFSRLCGPESPP-------------PP---YSR---- . .: ..: : :: ::::..:. : :: :: :.. CCDS11 AIENCEYAFNLKKD--EVCVNPYHYQRVETPVLPPVLVPRHTEILTELPPLDDYTHSIPE 150 160 170 180 190 200 290 300 310 320 pF1KB5 -------LSPRDEYKPLDLSDSTLSY----TETEATNSLIT-APGEFSDASMSP------ . :...: : . .: .. . ..:. : .:.:.: ...:: CCDS11 NTNFPAGIEPQSNYIPETPPPGYISEDGETSDQQLNQSMDTGSPAELSPTTLSPVNHSLD 210 220 230 240 250 260 330 340 350 360 370 pF1KB5 ----DATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSI--FYDLPQGSGFCLGQLNLEQRS ..:. :::.::.: :::. . . . .... : : .. :::: :. .:. CCDS11 LQPVTYSEPAFWCSIAYYELNQRVGETFHASQPSLTVDGFTDPSNSERFCLGLLSNVNRN 270 280 290 300 310 320 380 390 400 410 420 430 pF1KB5 ESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPGG-RALVVRKVPPGYSI .:. :: .:: :. : :.: .. :::.::. . : . .: :.::: .. CCDS11 ATVEMTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKIPPGCNL 330 340 350 360 370 380 440 450 460 470 480 490 pF1KB5 KVFDF-ERSGLQHAPEPDAADGPYDPN---SVRISFAKGWGPCYSRQFITSCPCWLEILL :.:. : ..: .. .. :. . ..:.::.:::: : :: .:: :::.:. : CCDS11 KIFNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKGWGAEYRRQTVTSTPCWIELHL 390 400 410 420 430 440 pF1KB5 NNPR :.: CCDS11 NGPLQWLDKVLTQMGSPSVRCSSMS 450 460 496 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 18:41:11 2016 done: Mon Nov 7 18:41:12 2016 Total Scan time: 3.850 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]