Skip to content

Commit 31812cc

Browse files
Update 07-wordEmbed_intro.md
1 parent b02804d commit 31812cc

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

episodes/07-wordEmbed_intro.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -128,7 +128,7 @@ The learned vectors create useful and meaningful representations of words, captu
128128

129129
## Preliminary Considerations
130130
In determining whether or not Word2Vec is a suitable embedding method for your research, it's important to consider the following:
131-
* **Analysis Relevance**: Does examining the relationships and meanings among words serve as a guideline for your research? Are you able to pinpoint specific terms or clusters of terms that encapsulate the broader conceptual realms you are investigating?"
131+
* **Analysis Relevance**: Does examining the relationships and meanings among words serve as a guideline for your research? Are you able to pinpoint specific terms or clusters of terms that encapsulate the broader conceptual realms you are investigating?
132132
* **Data Quality**: Ensure that your text corpus is of high quality. Garbage or noisy data can adversely affect Word2Vec embeddings.
133133
* **Corpus Size**: Word2Vec performs better with larger corpora. Having substantial text data improves the quality of learned word vectors.
134134
* **Domain-Specific Data Availability**: Choose a dataset relevant to your DH research. If you’re analyzing historical texts, use historical documents. For sentiment analysis, domain-specific data matters.

0 commit comments

Comments
 (0)