FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0472, 993 aa 1>>>pF1KB0472 993 - 993 aa - 993 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.2898+/-0.00107; mu= 5.4211+/- 0.065 mean_var=225.3272+/-44.868, 0's: 0 Z-trim(111.7): 23 B-trim: 82 in 2/49 Lambda= 0.085441 statistics sampled from 12579 (12597) to 12579 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.714), E-opt: 0.2 (0.387), width: 16 Scan time: 4.610 The best scores are: opt bits E(32554) CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 ( 993) 6438 807.1 0 CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118) 496 74.7 1.2e-12 CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129) 496 74.7 1.2e-12 >>CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 (993 aa) initn: 6438 init1: 6438 opt: 6438 Z-score: 4300.0 bits: 807.1 E(32554): 0 Smith-Waterman score: 6438; 99.8% identity (99.9% similar) in 993 aa overlap (1-993:1-993) 10 20 30 40 50 60 pF1KB0 MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRGVCLLKLQVSSQRTGLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRGVCLLKLQVSSQRTGLY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 GRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLATGILTRVTQKSVTVAFDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 GRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLATGILTRVTQKSVTVAFDE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB0 SHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHSGPASSLIEVLFGRSAPSPAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHSGPASSLIEVLFGRSAPSPAS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB0 EIHPLTFFNTCLDTSQKEAVSFALSQKELAIIHGPPGTGKTTTVVEIILQAVKQGLKVLC :::::::::::::::::::: ::::::::::::::::::::::::::::::::::::::: CCDS81 EIHPLTFFNTCLDTSQKEAVLFALSQKELAIIHGPPGTGKTTTVVEIILQAVKQGLKVLC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB0 CAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQQHSLDAVLARSDSAQIVADIRKDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 CAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQQHSLDAVLARSDSAQIVADIRKDI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB0 DQVFVKNKKTQDKREKSNFRNEIKLLRKELKEREEAAMLESLTSANVVLATNTGASADGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 DQVFVKNKKTQDKREKSNFRNEIKLLRKELKEREEAAMLESLTSANVVLATNTGASADGP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB0 LKLLPESYFDVVVIDECAQALEASCWIPLLKARKCILAGDHKQLPPTTVSHKAALAGLSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LKLLPESYFDVVVIDECAQALEASCWIPLLKARKCILAGDHKQLPPTTVSHKAALAGLSL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB0 SLMERLAEEYGARVVRTLTVQYRMHQAIMRWASDTMYLGQLTAHSSVARHLLRDLPGVAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 SLMERLAEEYGARVVRTLTVQYRMHQAIMRWASDTMYLGQLTAHSSVARHLLRDLPGVAA 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB0 TEETGVPLLLVDTAGCGLFELEEEDEQSKGNPGEVRLVSLHIQALVDAGVPARDIAVVSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 TEETGVPLLLVDTAGCGLFELEEEDEQSKGNPGEVRLVSLHIQALVDAGVPARDIAVVSP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB0 YNLQVDLLRQSLVHRHPELEIKSVDGFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 YNLQVDLLRQSLVHRHPELEIKSVDGFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB0 VTRARRHVAVICDSRTVNNHAFLKTLVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VTRARRHVAVICDSRTVNNHAFLKTLVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSS 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB0 HAATKPQGPAASTRTGSQRQEGGQEAAAPARQGRKKPAGKSLASEAPSQPSLNGGSPEGV ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 HAATKPQGPATSTRTGSQRQEGGQEAAAPARQGRKKPAGKSLASEAPSQPSLNGGSPEGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB0 ESQDGVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 ESQDGVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFI 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB0 TVSKRAPRPRAALGPPAGTGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 TVSKRAPRPRAALGPPAGTGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQR 790 800 810 820 830 840 850 860 870 880 890 900 pF1KB0 VRSAQGQPASKEQQASGQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VRSAQGQPASKEQQASGQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFA 850 860 870 880 890 900 910 920 930 940 950 960 pF1KB0 KCTAGVTTLGQFCQLCSRRYCLSHHLPEIHGCGERARAHARQRISREGVLYAGSGTKNGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 KCTAGVTTLGQFCQLCSRRYCLSHHLPEIHGCGERARAHARQRISREGVLYAGSGTKNGS 910 920 930 940 950 960 970 980 990 pF1KB0 LDPAKRAQLQRRLDKKLSELSNQRTSRRKERGT ::::::::::::::::::::::::::::::::: CCDS81 LDPAKRAQLQRRLDKKLSELSNQRTSRRKERGT 970 980 990 >>CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118 aa) initn: 382 init1: 221 opt: 496 Z-score: 340.8 bits: 74.7 E(32554): 1.2e-12 Smith-Waterman score: 690; 29.8% identity (58.8% similar) in 638 aa overlap (17-633:312-895) 10 20 30 40 pF1KB0 MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKE---LQSR :: . : ...: .. :.::... :... CCDS12 KPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEADYDKKLKESQT-QDNITVRWDLGLNKK 290 300 310 320 330 340 50 60 70 80 90 100 pF1KB0 GVCLLKL-QVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLA . . : ...:. . : . : :. : : .. : .. . : : :...: CCDS12 RIAYFTLPKTDSDMRLMQGDEICL---RYKGDLAPLWKGI---GHVIKVPD--NYGDEIA 350 360 370 380 390 110 120 130 140 150 160 pF1KB0 TGILTRVTQKSVTVAFDESHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHSGP . ..:: . . .:.::... . ..... :...:: .. ... CCDS12 IEL-----RSSVGAPVEVTHNFQVDF----------VWKSTSFDRMQSALKTFAVDETSV 400 410 420 430 170 180 190 200 210 pF1KB0 ASSLIEVLFGRSAPSPASEIH-PLTFFNTCL---DTSQKEAVSFALSQKELAIIHGPPGT .. . . :.:. . . . . : : : . :: ::. .: :. :..:.::::: CCDS12 SGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVL-QRPLSLIQGPPGT 440 450 460 470 480 490 220 230 240 250 260 270 pF1KB0 GKTTTVVEIILQAVKQGL-KVLCCAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQQ :::.: . :. . ..:: :: :::::::::.:.:.. ...:: .: :.:. CCDS12 GKTVTSATIVYHLARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSR--EAID- 500 510 520 530 540 550 280 290 300 310 320 330 pF1KB0 HSLDAVLARSDSAQIVADIRKDIDQVFVKNKKTQDKREKSNFRNEIKLLRKELKEREEAA : . :: . :: ...:.. .: : : : ... . . . ::. : CCDS12 -SPVSFLALHN--QI-----RNMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAER- 560 570 580 590 600 340 350 360 370 380 390 pF1KB0 MLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDECAQALEASCWIPL-LKARKCI : : .:.:. : .:: : .: . : ..::: .:: : : .:. : :.. : CCDS12 --ELLMNADVICCTCVGA---GDPRL-AKMQFRSILIDESTQATEPECMVPVVLGAKQLI 610 620 630 640 650 400 410 420 430 440 450 pF1KB0 LAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVRTLTVQYRMHQAIMRWASDTM :.::: :: :... .::: :::: ::.:::. : : .: : :::::: :. . :. . CCDS12 LVGDHCQLGPVVMCKKAAKAGLSQSLFERLVV-LGIRPIR-LQVQYRMHPALSAFPSNIF 660 670 680 690 700 710 460 470 480 490 500 510 pF1KB0 YLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEEDEQSKGNPGEVR : :.: ..: .. . . . :... : : .:. :.:. : CCDS12 YEGSLQNGVTAADRVKKGFD--FQWPQPDKPMFFYVTQG-------QEEIASSGTSYLNR 720 730 740 750 760 520 530 540 550 560 pF1KB0 LVSLHIQA----LVDAGVPARDIAVVSPYNLQVDLLRQSL-----VHR--HPELEIKSVD . ... :. ::. .:....::. : . : : . .: . :.:: ::: CCDS12 TEAANVEKITTKLLKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVD 770 780 790 800 810 820 570 580 590 600 610 620 pF1KB0 GFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARRHVAVICDSRTVNNHAFLKT .::::::. .::: ::.:.. .::: . ::.:::.:::: : .. . ...... . . CCDS12 AFQGREKDFIILSCVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNH 830 840 850 860 870 880 630 640 650 660 670 680 pF1KB0 LVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSSHAATKPQGPAASTRTGSQRQEGGQE :..:. .. CCDS12 LLNYYKEQKVLVEGPLNNLRESLMQFSKPRKLVNTINPGARFMTTAMYDAREAIIPGSVY 890 900 910 920 930 940 >>CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129 aa) initn: 382 init1: 221 opt: 496 Z-score: 340.7 bits: 74.7 E(32554): 1.2e-12 Smith-Waterman score: 681; 31.0% identity (59.5% similar) in 565 aa overlap (86-633:389-906) 60 70 80 90 100 110 pF1KB0 RTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLATGILTRVTQKSVT : .. . : : :...: . ..:: CCDS74 IIWLRDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPD--NYGDEIAIEL-----RSSVG 360 370 380 390 400 410 120 130 140 150 160 170 pF1KB0 VAFDESHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHSGPASSLIEVLFGRSA . . .:.::... . ..... :...:: .. ... .. . . :.:. . CCDS74 APVEVTHNFQVDF----------VWKSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEV 420 430 440 450 460 180 190 200 210 220 230 pF1KB0 PSPASEIH-PLTFFNTCL---DTSQKEAVSFALSQKELAIIHGPPGTGKTTTVVEIILQA . . . : : : . :: ::. .: :. :..:.::::::::.: . :. . CCDS74 EDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVL-QRPLSLIQGPPGTGKTVTSATIVYHL 470 480 490 500 510 520 240 250 260 270 280 290 pF1KB0 VKQGL-KVLCCAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQQHSLDAVLARSDSA ..:: :: :::::::::.:.:.. ...:: .: :.:. : . :: . CCDS74 ARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSR--EAID--SPVSFLALHN-- 530 540 550 560 570 300 310 320 330 340 350 pF1KB0 QIVADIRKDIDQVFVKNKKTQDKREKSNFRNEIKLLRKELKEREEAAMLESLTSANVVLA :: ...:.. .: : : : ... . . . ::. : : : .:.:. CCDS74 QI-----RNMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAER---ELLMNADVICC 580 590 600 610 620 360 370 380 390 400 pF1KB0 TNTGASADGPLKLLPESYFDVVVIDECAQALEASCWIPL-LKARKCILAGDHKQLPPTTV : .:: : .: . : ..::: .:: : : .:. : :.. ::.::: :: :... CCDS74 TCVGA---GDPRL-AKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVM 630 640 650 660 670 680 410 420 430 440 450 460 pF1KB0 SHKAALAGLSLSLMERLAEEYGARVVRTLTVQYRMHQAIMRWASDTMYLGQLTAHSSVAR .::: :::: ::.:::. : : .: : :::::: :. . :. .: :.: ..: CCDS74 CKKAAKAGLSQSLFERLVV-LGIRPIR-LQVQYRMHPALSAFPSNIFYEGSLQNGVTAAD 690 700 710 720 730 740 470 480 490 500 510 520 pF1KB0 HLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEEDEQSKGNPGEVRLVSLHIQA----L .. . . . :... : : .:. :.:. : . ... : CCDS74 RVKKGFD--FQWPQPDKPMFFYVTQG-------QEEIASSGTSYLNRTEAANVEKITTKL 750 760 770 780 790 530 540 550 560 570 pF1KB0 VDAGVPARDIAVVSPYNLQVDLLRQSL-----VHR--HPELEIKSVDGFQGREKEAVILS . ::. .:....::. : . : : . .: . :.:: :::.::::::. .::: CCDS74 LKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILS 800 810 820 830 840 850 580 590 600 610 620 630 pF1KB0 FVRSNRKGEVGFLAEDRRINVAVTRARRHVAVICDSRTVNNHAFLKTLVEYFTQHGEVRT ::.:.. .::: . ::.:::.:::: : .. . ...... . . :..:. .. CCDS74 CVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVE 860 870 880 890 900 910 640 650 660 670 680 690 pF1KB0 AFEYLDDIVPENYSHENSQGSSHAATKPQGPAASTRTGSQRQEGGQEAAAPARQGRKKPA CCDS74 GPLNNLRESLMQFSKPRKLVNTINPGARFMTTAMYDAREAIIPGSVYDRSSQGRPSSMYF 920 930 940 950 960 970 993 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 18:50:15 2016 done: Mon Nov 7 18:50:16 2016 Total Scan time: 4.610 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]