You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/main/asciidoc/en/working-with-text-en.adoc
+31-18Lines changed: 31 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -262,6 +262,7 @@ An edge between two terms will have:
262
262
The logic is simple, and yet there are some refinements to discuss. It will be up to you to decide what's preferable:
263
263
264
264
//ST: !
265
+
[[binary-counting]]
265
266
===== If 2 terms appear several times *in a given unit of text*, should their co-occurences be counted several times?
266
267
//ST: !
267
268
@@ -283,8 +284,8 @@ ____
283
284
//ST: !
284
285
The question is:
285
286
286
-
- should I count only *one* co-occurrence between `molecular` and `nanotechnology`, because it happened on this one web page?
287
-
- or should I consider that `molecular` appears twice on this page, and `nanotechnology` three times, so *multiple* co-occurrences between these 2 terms should be counted, just on this page already?
287
+
- should I count only *one* co-occurrence between `molecular` and `nanotechnology`, because it happened on this one web page? This is called *binary counting*
288
+
- or should I consider that `molecular` appears twice on this page, and `nanotechnology` three times, so *multiple* co-occurrences between these 2 terms should be counted, just on this page already? This is called *full counting*
288
289
289
290
There is no exact response, and you can experiment with both possibilities.
290
291
@@ -356,7 +357,7 @@ The network was built from the short summaries ("abstracts") of 1484 research ar
356
357
357
358
"social neuroscience" OR "neuroeco*" OR "decision neuroscience"
358
359
359
-
-> The query can be see https://www.ncbi.nlm.nih.gov/pubmed?term=(%22social%20neuroscience%22%20OR%20%22neuroeco*%22%20OR%20%22decision%20neuroscience%22)[online here].
360
+
-> The query can be seent at https://www.ncbi.nlm.nih.gov/pubmed?term=(%22social%20neuroscience%22%20OR%20%22neuroeco*%22%20OR%20%22decision%20neuroscience%22)[online here].
360
361
(it comprises more than 1484 results, because some articles have no abstract).
361
362
362
363
We used https://github.com/seinecle/Cowo[Cowo] to create the network from these 1484 short pieces of text, based on co-occurrences.
@@ -371,36 +372,40 @@ image::semantic-import-1-en.png[align="center", title="First view of the network
0 commit comments