This dataset concerns the language used by Philip Morris international (PMI) in their external communications. The dataset was collected in order to identify patterns in language used in different corporate communications. This dataset includes Annual Reports, Investor reports, Investor day slide decks, transcripts and reports. It is a record of the inductive coding conducted for this project. The dataset contains the final codebook, which was agreed and approved by all coders.