FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0917, 288 aa 1>>>pF1KE0917 288 - 288 aa - 288 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3432+/-0.000975; mu= 8.0726+/- 0.059 mean_var=242.3991+/-50.166, 0's: 0 Z-trim(111.6): 183 B-trim: 0 in 0/54 Lambda= 0.082378 statistics sampled from 12331 (12535) to 12331 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.713), E-opt: 0.2 (0.385), width: 16 Scan time: 2.580 The best scores are: opt bits E(32554) CCDS33905.1 TRA2B gene_id:6434|Hs108|chr3 ( 288) 1972 247.0 1.2e-65 CCDS58872.1 TRA2B gene_id:6434|Hs108|chr3 ( 188) 1319 169.2 2.1e-42 CCDS5383.1 TRA2A gene_id:29896|Hs108|chr7 ( 282) 1163 150.9 1e-36 CCDS64609.1 TRA2A gene_id:29896|Hs108|chr7 ( 181) 838 112.0 3.3e-25 CCDS75569.1 TRA2A gene_id:29896|Hs108|chr7 ( 180) 819 109.7 1.6e-24 >>CCDS33905.1 TRA2B gene_id:6434|Hs108|chr3 (288 aa) initn: 1972 init1: 1972 opt: 1972 Z-score: 1292.2 bits: 247.0 E(32554): 1.2e-65 Smith-Waterman score: 1972; 100.0% identity (100.0% similar) in 288 aa overlap (1-288:1-288) 10 20 30 40 50 60 pF1KE0 MSDSGEQNYGERESRSASRSGSAHGSGKSARHTPARSRSKEDSRRSRSKSRSRSESRSRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSDSGEQNYGERESRSASRSGSAHGSGKSARHTPARSRSKEDSRRSRSKSRSRSESRSRS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 RRSSRRHYTRSRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RRSSRRHYTRSRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 GVFGLSLYTTERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GVFGLSLYTTERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERAN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 GMELDGRRIRVDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGYDDRDYYSRSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GMELDGRRIRVDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGYDDRDYYSRSY 190 200 210 220 230 240 250 260 270 280 pF1KE0 RGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY :::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY 250 260 270 280 >>CCDS58872.1 TRA2B gene_id:6434|Hs108|chr3 (188 aa) initn: 1319 init1: 1319 opt: 1319 Z-score: 874.7 bits: 169.2 E(32554): 2.1e-42 Smith-Waterman score: 1319; 100.0% identity (100.0% similar) in 188 aa overlap (101-288:1-188) 80 90 100 110 120 130 pF1KE0 SRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCLGVFGLSLYTT :::::::::::::::::::::::::::::: CCDS58 MSTRRRHVGNRANPDPNCCLGVFGLSLYTT 10 20 30 140 150 160 170 180 190 pF1KE0 ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERANGMELDGRRIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERANGMELDGRRIR 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE0 VDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGYDDRDYYSRSYRGGGGGGGGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGYDDRDYYSRSYRGGGGGGGGW 100 110 120 130 140 150 260 270 280 pF1KE0 RAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY :::::::::::::::::::::::::::::::::::::: CCDS58 RAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY 160 170 180 >>CCDS5383.1 TRA2A gene_id:29896|Hs108|chr7 (282 aa) initn: 986 init1: 800 opt: 1163 Z-score: 772.6 bits: 150.9 E(32554): 1e-36 Smith-Waterman score: 1189; 65.2% identity (78.7% similar) in 305 aa overlap (1-288:1-282) 10 20 30 40 50 pF1KE0 MSDSGEQNYGERESRSASRSGSAHGSG-KSARHTPARSRSKEDSRRSRSKSRSRSESRSR ::: :.:. ::::: :.: .. . :: .. .:: :. :..:.:.:::::.:::: CCDS53 MSDVEENNFEGRESRSQSKSPTGTPARVKSESRSGSRSPSRV-SKHSESHSRSRSKSRSR 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 SRRSSRRHYTRSRSRSRSHRR-SRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNC ::: :.:.::::::.:.:::: ::::::. .::::.:.::::::.::::.:.::::::: CCDS53 SRRHSHRRYTRSRSHSHSHRRRSRSRSYTPEYRRRRSRSHSPMSNRRRHTGSRANPDPNT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 CLGVFGLSLYTTERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKER :::::::::::::::::::::.:::.. :..::::.. ::::::::::: .::.::: :: CCDS53 CLGVFGLSLYTTERDLREVFSRYGPLSGVNVVYDQRTGRSRGFAFVYFERIDDSKEAMER 120 130 140 150 160 170 180 190 200 210 220 pF1KE0 ANGMELDGRRIRVDFSITKRPHTPTPGIYMGRPTY--------------GSSRRRD-YYD ::::::::::::::.::::: :::::::::::::. :..:::: ::: CCDS53 ANGMELDGRRIRVDYSITKRAHTPTPGIYMGRPTHSGGGGGGGGGGGGGGGGRRRDSYYD 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE0 RGYDRGYDDRDYYSRSYRGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSY :::::::: . :. :: :::::::::::: :::::::::: CCDS53 RGYDRGYDRYEDYD--YR------------------YRRRSPSPYYSR--YRSRSRSRSY 240 250 260 270 pF1KE0 SPRRY ::::: CCDS53 SPRRY 280 >>CCDS64609.1 TRA2A gene_id:29896|Hs108|chr7 (181 aa) initn: 871 init1: 658 opt: 838 Z-score: 566.0 bits: 112.0 E(32554): 3.3e-25 Smith-Waterman score: 864; 68.5% identity (77.3% similar) in 203 aa overlap (101-288:1-181) 80 90 100 110 120 130 pF1KE0 SRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCLGVFGLSLYTT ::.::::.:.::::::: :::::::::::: CCDS64 MSNRRRHTGSRANPDPNTCLGVFGLSLYTT 10 20 30 140 150 160 170 180 190 pF1KE0 ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERANGMELDGRRIR :::::::::.:::.. :..::::.. ::::::::::: .::.::: :::::::::::::: CCDS64 ERDLREVFSRYGPLSGVNVVYDQRTGRSRGFAFVYFERIDDSKEAMERANGMELDGRRIR 40 50 60 70 80 90 200 210 220 230 pF1KE0 VDFSITKRPHTPTPGIYMGRPTY--------------GSSRRRD-YYDRGYDRGYDDRDY ::.::::: :::::::::::::. :..:::: ::::::::::: . CCDS64 VDYSITKRAHTPTPGIYMGRPTHSGGGGGGGGGGGGGGGGRRRDSYYDRGYDRGYDRYED 100 110 120 130 140 150 240 250 260 270 280 pF1KE0 YSRSYRGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY : .:: :::::::::::: ::::::::::::::: CCDS64 Y--DYR------------------YRRRSPSPYYSR--YRSRSRSRSYSPRRY 160 170 180 >>CCDS75569.1 TRA2A gene_id:29896|Hs108|chr7 (180 aa) initn: 822 init1: 661 opt: 819 Z-score: 553.8 bits: 109.7 E(32554): 1.6e-24 Smith-Waterman score: 848; 67.0% identity (75.9% similar) in 203 aa overlap (101-288:1-180) 80 90 100 110 120 130 pF1KE0 SRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCLGVFGLSLYTT ::.::::.:.::::::: :::::::::::: CCDS75 MSNRRRHTGSRANPDPNTCLGVFGLSLYTT 10 20 30 140 150 160 170 180 190 pF1KE0 ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERANGMELDGRRIR :::::::::.:::.. :..::::.. ::::::::::: .::.::: :::::::::::::: CCDS75 ERDLREVFSRYGPLSGVNVVYDQRTGRSRGFAFVYFERIDDSKEAMERANGMELDGRRIR 40 50 60 70 80 90 200 210 220 230 pF1KE0 VDFSITKRPHTPTPGIYMGRPTY--------------GSSRRRD-YYDRGYDRGYDDRDY ::.::::: :::::::::::::. :..:::: ::::::::::: . CCDS75 VDYSITKRAHTPTPGIYMGRPTHSGGGGGGGGGGGGGGGGRRRDSYYDRGYDRGYDRYED 100 110 120 130 140 150 240 250 260 270 280 pF1KE0 YSRSYRGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRSRSRSRSYSPRRY :. :: ::::::::: ::::::::::::::: CCDS75 YDYRYR---------------------RSPSPYYSR--YRSRSRSRSYSPRRY 160 170 180 288 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:39:29 2016 done: Sun Nov 6 23:39:30 2016 Total Scan time: 2.580 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]