FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2636, 354 aa 1>>>pF1KE2636 354 - 354 aa - 354 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4052+/-0.000809; mu= 15.5019+/- 0.049 mean_var=67.2252+/-13.593, 0's: 0 Z-trim(106.8): 47 B-trim: 0 in 0/51 Lambda= 0.156426 statistics sampled from 9164 (9206) to 9164 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.283), width: 16 Scan time: 2.450 The best scores are: opt bits E(32554) CCDS47242.1 VCAN gene_id:1462|Hs108|chr5 ( 655) 2384 547.1 1.6e-155 CCDS54876.1 VCAN gene_id:1462|Hs108|chr5 (2409) 2385 547.6 4e-155 CCDS54875.1 VCAN gene_id:1462|Hs108|chr5 (1642) 2378 545.9 8.6e-155 CCDS4060.1 VCAN gene_id:1462|Hs108|chr5 (3396) 2378 546.1 1.6e-154 CCDS53971.1 ACAN gene_id:176|Hs108|chr15 (2431) 1208 282.0 3.7e-75 CCDS53970.1 ACAN gene_id:176|Hs108|chr15 (2530) 1208 282.0 3.8e-75 CCDS12397.1 NCAN gene_id:1463|Hs108|chr19 (1321) 1127 263.6 7e-70 CCDS1150.1 BCAN gene_id:63827|Hs108|chr1 ( 671) 1092 255.5 9.4e-68 CCDS1149.1 BCAN gene_id:63827|Hs108|chr1 ( 911) 1092 255.6 1.2e-67 CCDS4061.1 HAPLN1 gene_id:1404|Hs108|chr5 ( 354) 780 185.0 8.6e-47 CCDS10346.1 HAPLN3 gene_id:145864|Hs108|chr15 ( 360) 753 178.9 5.9e-45 CCDS76790.1 HAPLN3 gene_id:145864|Hs108|chr15 ( 422) 753 178.9 6.8e-45 CCDS1148.1 HAPLN2 gene_id:60484|Hs108|chr1 ( 340) 681 162.6 4.4e-40 CCDS12398.1 HAPLN4 gene_id:404037|Hs108|chr19 ( 402) 382 95.2 1e-19 >>CCDS47242.1 VCAN gene_id:1462|Hs108|chr5 (655 aa) initn: 2382 init1: 2382 opt: 2384 Z-score: 2905.8 bits: 547.1 E(32554): 1.6e-155 Smith-Waterman score: 2384; 99.1% identity (99.4% similar) in 352 aa overlap (1-350:1-352) 10 20 30 40 50 60 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKR--KCLIPF :::::::::::::::::::::::::::::::::::::::::::::::: .: CCDS47 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRPDRCKMNPCLNG 310 320 330 340 350 360 CCDS47 GTCYPTETSYVCTCVPGYSGDQCELDFDECHSNPCRNGATCVDGFNTFRCLCLPSYVGAL 370 380 390 400 410 420 >>CCDS54876.1 VCAN gene_id:1462|Hs108|chr5 (2409 aa) initn: 2565 init1: 2385 opt: 2385 Z-score: 2898.5 bits: 547.6 E(32554): 4e-155 Smith-Waterman score: 2385; 99.7% identity (100.0% similar) in 349 aa overlap (1-349:1-349) 10 20 30 40 50 60 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLIPF ::::::::::::::::::::::::::::::::::::::::::::::::. CCDS54 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRRMSDLSVIGHPI 310 320 330 340 350 360 CCDS54 DSESKEDEPCSEETDPVHDLMAEILPEFPDIIEIDLYHSEENEEEEEECANATDVTTTPS 370 380 390 400 410 420 >>CCDS54875.1 VCAN gene_id:1462|Hs108|chr5 (1642 aa) initn: 2558 init1: 2378 opt: 2378 Z-score: 2892.5 bits: 545.9 E(32554): 8.6e-155 Smith-Waterman score: 2378; 99.7% identity (99.7% similar) in 349 aa overlap (1-349:1-349) 10 20 30 40 50 60 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLIPF ::::::::::::::::::::::::::::::::::::::::::::::: : CCDS54 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKPKEATTIDLSILA 310 320 330 340 350 360 CCDS54 ETASPSLSKEPQMVSDRTTPIIPLVDELPVIPTEFPPVGNIVSFEQKATVQPQAITDSLA 370 380 390 400 410 420 >>CCDS4060.1 VCAN gene_id:1462|Hs108|chr5 (3396 aa) initn: 2558 init1: 2378 opt: 2378 Z-score: 2887.7 bits: 546.1 E(32554): 1.6e-154 Smith-Waterman score: 2378; 99.7% identity (99.7% similar) in 349 aa overlap (1-349:1-349) 10 20 30 40 50 60 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTLPPSYNTSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 FLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGDASLTVVKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 LASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAVIA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 TPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQETYD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGWLS 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLIPF ::::::::::::::::::::::::::::::::::::::::::::::: : CCDS40 DASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKPKEATTIDLSILA 310 320 330 340 350 360 CCDS40 ETASPSLSKEPQMVSDRTTPIIPLVDELPVIPTEFPPVGNIVSFEQKATVQPQAITDSLA 370 380 390 400 410 420 >>CCDS53971.1 ACAN gene_id:176|Hs108|chr15 (2431 aa) initn: 2211 init1: 1140 opt: 1208 Z-score: 1462.9 bits: 282.0 E(32554): 3.7e-75 Smith-Waterman score: 1208; 49.9% identity (77.5% similar) in 355 aa overlap (5-346:1-349) 10 20 30 40 pF1KE2 MFINIKSILWMCSTL-IVTHAL------H----KVKVGKSPPVRGSLSGKVSLPCHF-ST . ..::. :: ..: :. : .:.. . :.: :. ....::.: . CCDS53 MTTLLWVFVTLRVITAAVTVETSDHDNSLSVSIPQPSPLRVLLGTSLTIPCYFIDP 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 MPTLPPSYNTSEFL-RIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHP : . . .:. . :::::.. . ::...::: .: ..... :. .::.:..: CCDS53 MHPVTTAPSTAPLAPRIKWSRVSKE------KEVVLLVATEGRVRVNSAYQDKVSLPNYP 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 EAVGDASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEA .::.: : .: ..:.:.:::.::.::::.. :. ..: :.:::::: ..::::.:. CCDS53 AIPSDATLEVQSLRSNDSGVYRCEVMHGIEDSEATLEVVVKGIVFHYRAISTRYTLDFDR 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 AQKACLDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGV ::.:::. .:.::::::: :::::::.::::::::::::::::..:: :::::: :: CCDS53 AQRACLQNSAIIATPEQLQAAYEDGFHQCDAGWLADQTVRYPIHTPREGCYGDKDEFPGV 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE2 RTYGFRSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAW ::::.:. .:::::::......:.::. : : ::::.:::.::. :::::.:.: :: CCDS53 RTYGIRDTNETYDVYCFAEEMEGEVFYATSPEKFTFQEAANECRRLGARLATTGQLYLAW 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE2 RNGFDQCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFK . :.:.:. :::.: :::.:.. :: .:::.::::::.: ::::.: :.::.:: :. CCDS53 QAGMDMCSAGWLADRSVRYPISKARPNCGGNLLGVRTVYVHANQTGYPDPSSRYDAICYT 300 310 320 330 340 350 350 pF1KE2 RKCLIPF CCDS53 GEDFVDIPENFFGVGGEEDITVQTVTWPDMELPLPRNITEGEARGSVILTVKPIFEVSPS 360 370 380 390 400 410 >-- initn: 1077 init1: 858 opt: 863 Z-score: 1042.1 bits: 204.1 E(32554): 1e-51 Smith-Waterman score: 863; 59.0% identity (78.0% similar) in 205 aa overlap (149-353:477-681) 120 130 140 150 160 170 pF1KE2 KLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAV ::::::: . .::.:.:: ::.::: .::: CCDS53 TPGLGPATAFTSEDLVVQVTAVPGQPHLPGGVVFHYRPGPTRYSLTFEEAQQACLRTGAV 450 460 470 480 490 500 180 190 200 210 220 230 pF1KE2 IATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQET ::.:::: :::: :.:::::::: :::::::: .::. : ::: .. :::::: : :: CCDS53 IASPEQLQAAYEAGYEQCDAGWLRDQTVRYPIVSPRTPCVGDKDSSPGVRTYGVRPSTET 510 520 530 540 550 560 240 250 260 270 280 290 pF1KE2 YDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGW :::::.::.:.:.:: : .:::.:: . ::...: :::.:.: ::: :.:.: :: CCDS53 YDVYCFVDRLEGEVFFATRLEQFTFQEALEFCESHNATLATTGQLYAAWSRGLDKCYAGW 570 580 590 600 610 620 300 310 320 330 340 350 pF1KE2 LSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLIPF :.:.:.:.:... : ::: ::::.: . ::::.: : :: :.::. .: CCDS53 LADGSLRYPIVTPRPACGGDKPGVRTVYLYPNQTGLPDPLSRHHAFCFRGISAVPSPGEE 630 640 650 660 670 680 CCDS53 EGGTPTSPSGVEEWIVTQVVPGVAAVPVEEETTAVPSGETTAILEFTTEPENQTEWEPAY 690 700 710 720 730 740 >>CCDS53970.1 ACAN gene_id:176|Hs108|chr15 (2530 aa) initn: 2211 init1: 1140 opt: 1208 Z-score: 1462.6 bits: 282.0 E(32554): 3.8e-75 Smith-Waterman score: 1208; 49.9% identity (77.5% similar) in 355 aa overlap (5-346:1-349) 10 20 30 40 pF1KE2 MFINIKSILWMCSTL-IVTHAL------H----KVKVGKSPPVRGSLSGKVSLPCHF-ST . ..::. :: ..: :. : .:.. . :.: :. ....::.: . CCDS53 MTTLLWVFVTLRVITAAVTVETSDHDNSLSVSIPQPSPLRVLLGTSLTIPCYFIDP 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 MPTLPPSYNTSEFL-RIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHP : . . .:. . :::::.. . ::...::: .: ..... :. .::.:..: CCDS53 MHPVTTAPSTAPLAPRIKWSRVSKE------KEVVLLVATEGRVRVNSAYQDKVSLPNYP 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 EAVGDASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEA .::.: : .: ..:.:.:::.::.::::.. :. ..: :.:::::: ..::::.:. CCDS53 AIPSDATLEVQSLRSNDSGVYRCEVMHGIEDSEATLEVVVKGIVFHYRAISTRYTLDFDR 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 AQKACLDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGV ::.:::. .:.::::::: :::::::.::::::::::::::::..:: :::::: :: CCDS53 AQRACLQNSAIIATPEQLQAAYEDGFHQCDAGWLADQTVRYPIHTPREGCYGDKDEFPGV 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE2 RTYGFRSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAW ::::.:. .:::::::......:.::. : : ::::.:::.::. :::::.:.: :: CCDS53 RTYGIRDTNETYDVYCFAEEMEGEVFYATSPEKFTFQEAANECRRLGARLATTGQLYLAW 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE2 RNGFDQCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFK . :.:.:. :::.: :::.:.. :: .:::.::::::.: ::::.: :.::.:: :. CCDS53 QAGMDMCSAGWLADRSVRYPISKARPNCGGNLLGVRTVYVHANQTGYPDPSSRYDAICYT 300 310 320 330 340 350 350 pF1KE2 RKCLIPF CCDS53 GEDFVDIPENFFGVGGEEDITVQTVTWPDMELPLPRNITEGEARGSVILTVKPIFEVSPS 360 370 380 390 400 410 >-- initn: 1077 init1: 858 opt: 863 Z-score: 1041.8 bits: 204.1 E(32554): 1e-51 Smith-Waterman score: 863; 59.0% identity (78.0% similar) in 205 aa overlap (149-353:477-681) 120 130 140 150 160 170 pF1KE2 KLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKACLDVGAV ::::::: . .::.:.:: ::.::: .::: CCDS53 TPGLGPATAFTSEDLVVQVTAVPGQPHLPGGVVFHYRPGPTRYSLTFEEAQQACLRTGAV 450 460 470 480 490 500 180 190 200 210 220 230 pF1KE2 IATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGFRSPQET ::.:::: :::: :.:::::::: :::::::: .::. : ::: .. :::::: : :: CCDS53 IASPEQLQAAYEAGYEQCDAGWLRDQTVRYPIVSPRTPCVGDKDSSPGVRTYGVRPSTET 510 520 530 540 550 560 240 250 260 270 280 290 pF1KE2 YDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFDQCDYGW :::::.::.:.:.:: : .:::.:: . ::...: :::.:.: ::: :.:.: :: CCDS53 YDVYCFVDRLEGEVFFATRLEQFTFQEALEFCESHNATLATTGQLYAAWSRGLDKCYAGW 570 580 590 600 610 620 300 310 320 330 340 350 pF1KE2 LSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLIPF :.:.:.:.:... : ::: ::::.: . ::::.: : :: :.::. .: CCDS53 LADGSLRYPIVTPRPACGGDKPGVRTVYLYPNQTGLPDPLSRHHAFCFRGISAVPSPGEE 630 640 650 660 670 680 CCDS53 EGGTPTSPSGVEEWIVTQVVPGVAAVPVEEETTAVPSGETTAILEFTTEPENQTEWEPAY 690 700 710 720 730 740 >>CCDS12397.1 NCAN gene_id:1463|Hs108|chr19 (1321 aa) initn: 1219 init1: 1038 opt: 1127 Z-score: 1368.1 bits: 263.6 E(32554): 7e-70 Smith-Waterman score: 1127; 49.8% identity (76.5% similar) in 327 aa overlap (21-347:37-357) 10 20 30 40 50 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMP :: :.: : :...:. :.::: :. .: CCDS12 WALGLLMLQMLLFVAGEQGTQDITDASERGLHMQKLG-SGSVQAALAELVALPCLFTLQP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 TLPPSYNTSEFLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAV :: . . ::::.:... .. . .. .:::... ........::::.:..:. CCDS12 R--PS-AARDAPRIKWTKVRTASGQR--QDLPILVAKDNVVRVAKSWQGRVSLPSYPRRR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 GDASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQK ..:.: . : :::.:::::.:. :::: :: : : : :::::::.: .::.:.: ::. CCDS12 ANATLLLGPLRASDSGLYRCQVVRGIEDEQDLVPLEVTGVVFHYRSARDRYALTFAEAQE 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE2 ACLDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTY :: .:.::.:..: ::.::::..::::::.:.:::::: : :::::. . :::.: CCDS12 ACRLSSAIIAAPRHLQAAFEDGFDNCDAGWLSDRTVRYPITQSRPGCYGDRSSLPGVRSY 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE2 GFRSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNG : :.::: :::::.. .: :.::.. ..:. : .:. : : ::.::.:. ::..: CCDS12 GRRNPQELYDVYCFARELGGEVFYVGPARRLTLAGARAQCRRQGAALASVGQLHLAWHEG 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE2 FDQCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKC .:::: :::.:.:::.:. . : .::: ::::.::: :.:::: : :::::::. CCDS12 LDQCDPGWLADGSVRYPIQTPRRRCGGPAPGVRTVYRFANRTGFPSPAERFDAYCFRAHH 310 320 330 340 350 360 pF1KE2 LIPF CCDS12 PTSQHGDLETPSSGDEGEILSAEGPPVRELEPTLEEEEVVTPDFQEPLVSSGEEETLILE 370 380 390 400 410 420 >>CCDS1150.1 BCAN gene_id:63827|Hs108|chr1 (671 aa) initn: 1230 init1: 1013 opt: 1092 Z-score: 1329.9 bits: 255.5 E(32554): 9.4e-68 Smith-Waterman score: 1092; 46.8% identity (76.3% similar) in 325 aa overlap (23-347:36-354) 10 20 30 40 50 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTL .:... . :..: :.: ...::: . CCDS11 LPLLAALVLAQAPAALADVLEGDSSEDRAFRVRIAGDAPLQGVLGGALTIPCHVHYLRPP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 PPSYNTSEFLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGD : . :.::. . . :. :. ::::.. .:... :. ::..:..: .. : CCDS11 PSRRAVLGSPRVKWTFL---SRGR---EAEVLVARGVRVKVNEAYRFRVALPAYPASLTD 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 ASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKAC .::.. .: .:.:.:::.:..::.:..:.: . : :::: :: ...::...: .::.:: CCDS11 VSLALSELRPNDSGIYRCEVQHGIDDSSDAVEVKVKGVVFLYREGSARYAFSFSGAQEAC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 LDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGF .:: :::::::.::: :.::::::::.::::::::..:: .:::: : :::.:: CCDS11 ARIGAHIATPEQLYAAYLGGYEQCDAGWLSDQTVRYPIQTPREACYGDMDGFPGVRNYGV 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 RSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFD .:.. ::::::.. :.:..: : :.:.::: :... :..::.:.: ::: .:.: CCDS11 VDPDDLYDVYCYAEDLNGELFLGDPPEKLTLEEARAYCQERGAEIATTGQLYAAWDGGLD 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 QCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLI .:. :::.:.:::.:... .::::: ::.::. : :::::: :::..:::. CCDS11 HCSPGWLADGSVRYPIVTPSQRCGGGLPGVKTLFLFPNQTGFPNKHSRFNVYCFRDSAQP 300 310 320 330 340 350 pF1KE2 PF CCDS11 SAIPEASNPASNPASDGLEAIVTVTETLEELQLPQEATESESRGAIYSIPIMEDGGGGSS 360 370 380 390 400 410 >>CCDS1149.1 BCAN gene_id:63827|Hs108|chr1 (911 aa) initn: 1230 init1: 1013 opt: 1092 Z-score: 1327.9 bits: 255.6 E(32554): 1.2e-67 Smith-Waterman score: 1092; 46.8% identity (76.3% similar) in 325 aa overlap (23-347:36-354) 10 20 30 40 50 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFSTMPTL .:... . :..: :.: ...::: . CCDS11 LPLLAALVLAQAPAALADVLEGDSSEDRAFRVRIAGDAPLQGVLGGALTIPCHVHYLRPP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 PPSYNTSEFLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPEAVGD : . :.::. . . :. :. ::::.. .:... :. ::..:..: .. : CCDS11 PSRRAVLGSPRVKWTFL---SRGR---EAEVLVARGVRVKVNEAYRFRVALPAYPASLTD 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 ASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAAQKAC .::.. .: .:.:.:::.:..::.:..:.: . : :::: :: ...::...: .::.:: CCDS11 VSLALSELRPNDSGIYRCEVQHGIDDSSDAVEVKVKGVVFLYREGSARYAFSFSGAQEAC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 LDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVRTYGF .:: :::::::.::: :.::::::::.::::::::..:: .:::: : :::.:: CCDS11 ARIGAHIATPEQLYAAYLGGYEQCDAGWLSDQTVRYPIQTPREACYGDMDGFPGVRNYGV 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 RSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAWRNGFD .:.. ::::::.. :.:..: : :.:.::: :... :..::.:.: ::: .:.: CCDS11 VDPDDLYDVYCYAEDLNGELFLGDPPEKLTLEEARAYCQERGAEIATTGQLYAAWDGGLD 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 QCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSRFDAYCFKRKCLI .:. :::.:.:::.:... .::::: ::.::. : :::::: :::..:::. CCDS11 HCSPGWLADGSVRYPIVTPSQRCGGGLPGVKTLFLFPNQTGFPNKHSRFNVYCFRDSAQP 300 310 320 330 340 350 pF1KE2 PF CCDS11 SAIPEASNPASNPASDGLEAIVTVTETLEELQLPQEATESESRGAIYSIPIMEDGGGGSS 360 370 380 390 400 410 >>CCDS4061.1 HAPLN1 gene_id:1404|Hs108|chr5 (354 aa) initn: 700 init1: 505 opt: 780 Z-score: 953.6 bits: 185.0 E(32554): 8.6e-47 Smith-Waterman score: 782; 39.5% identity (66.3% similar) in 332 aa overlap (19-347:39-351) 10 20 30 40 pF1KE2 MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPCHFST : : ... .: :: :.:.:::.: CCDS40 LISICWADHLSDNYTLDHDRAIHIQAENGPHLLVEAEQAKVFSHRG---GNVTLPCKFYR 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 MPTLPPSYNTSEFLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVSVPTHPE :: : . .::::.:. : :::. :.:... . : :.::: . . CCDS40 DPTAFGSGIHK--IRIKWTKLTSDY----LKEVDVFVSMGYHKKTYGGYQGRVFLKGGSD 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 AVGDASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSRYTLNFEAA . ::::... : : : :.:.:. :.:: .:.: ..:::: : .::.:::. : CCDS40 S--DASLVITDLTLEDYGRYKCEVIEGLEDDTVVVALDLQGVVFPYFPRLGRYNLNFHEA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 QKACLDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCYGDKMGKAGVR :.:::: ::::. .::. :.. :.. :.::::.: .:.::: :: : : . ::: CCDS40 QQACLDQDAVIASFDQLYDAWRGGLDWCNAGWLSDGSVQYPITKPREPC-GGQNTVPGVR 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE2 TYGFRSPQET-YDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDARLATVGELQAAW .::: . ... :::.:......: ..: :.:.:..::.. : :. :..: ::.. ::: CCDS40 NYGFWDKDKSRYDVFCFTSNFNGRFYYLIHPTKLTYDEAVQACLNDGAQIAKVGQIFAAW 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE2 RN-GFDQCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGFPPPDSR-FDAYC . :.:.:: :::.:.:::.:.. : .:. .:: . ::: . . .:: CCDS40 KILGYDRCDAGWLADGSVRYPISRPRRRCSPTEAAVRFV-------GFPDKKHKLYGVYC 300 310 320 330 340 350 pF1KE2 FKRKCLIPF :. CCDS40 FRAYN 350 354 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 17:35:30 2016 done: Tue Nov 8 17:35:31 2016 Total Scan time: 2.450 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]