Ownership and Product Similarity

Firm Clustering from BERT embeddings of 10k filings

I generate a time-varying measure of S&P500 firm similarity using a zero-shot clustering model. The model takes as input BERT embeddings of product descriptions and is trained on market definitions from the EU commission. The objective is to estimate the causal effect of common ownership on product similarity.