Skip to content

Commit 52f7dc1

Browse files
committed
updated working with text tuto with proper titling
1 parent 82fd3fc commit 52f7dc1

2 files changed

Lines changed: 31 additions & 18 deletions

File tree

src/main/asciidoc/en/working-with-text-en.adoc

Lines changed: 31 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -262,6 +262,7 @@ An edge between two terms will have:
262262
The logic is simple, and yet there are some refinements to discuss. It will be up to you to decide what's preferable:
263263

264264
//ST: !
265+
[[binary-counting]]
265266
===== If 2 terms appear several times *in a given unit of text*, should their co-occurences be counted several times?
266267
//ST: !
267268

@@ -283,8 +284,8 @@ ____
283284
//ST: !
284285
The question is:
285286

286-
- should I count only *one* co-occurrence between `molecular` and `nanotechnology`, because it happened on this one web page?
287-
- or should I consider that `molecular` appears twice on this page, and `nanotechnology` three times, so *multiple* co-occurrences between these 2 terms should be counted, just on this page already?
287+
- should I count only *one* co-occurrence between `molecular` and `nanotechnology`, because it happened on this one web page? This is called *binary counting*
288+
- or should I consider that `molecular` appears twice on this page, and `nanotechnology` three times, so *multiple* co-occurrences between these 2 terms should be counted, just on this page already? This is called *full counting*
288289

289290
There is no exact response, and you can experiment with both possibilities.
290291

@@ -356,7 +357,7 @@ The network was built from the short summaries ("abstracts") of 1484 research ar
356357

357358
"social neuroscience" OR "neuroeco*" OR "decision neuroscience"
358359

359-
-> The query can be see https://www.ncbi.nlm.nih.gov/pubmed?term=(%22social%20neuroscience%22%20OR%20%22neuroeco*%22%20OR%20%22decision%20neuroscience%22)[online here].
360+
-> The query can be seent at https://www.ncbi.nlm.nih.gov/pubmed?term=(%22social%20neuroscience%22%20OR%20%22neuroeco*%22%20OR%20%22decision%20neuroscience%22)[online here].
360361
(it comprises more than 1484 results, because some articles have no abstract).
361362

362363
We used https://github.com/seinecle/Cowo[Cowo] to create the network from these 1484 short pieces of text, based on co-occurrences.
@@ -371,36 +372,40 @@ image::semantic-import-1-en.png[align="center", title="First view of the network
371372
//ST: !
372373
Several steps will make this network intelligible
373374

375+
//ST: !
376+
==== 2. Managing labels size and colors
377+
//ST: !
378+
374379

375380
//ST: !
376-
1. Showing the labels of the nodes:
381+
a. Showing the labels of the nodes:
377382

378383
image::https://docs.google.com/drawings/d/1o-J7Uce1okDdIN_IgvZaH4MdGjn74_ChTlXo-nxS-w4/pub?w=960&h=720[align="center", title="showing node labels"]
379384

380385

381386
//ST: !
382387
[start=2]
383-
2. Making the edges (relations) invisible, because they clutter the view
388+
b. Making the edges (relations) invisible, because they clutter the view
384389

385390
image::https://docs.google.com/drawings/d/1EZuk_b2plau6HzFNkkB64Mep0I0xq5l5sJhnIsOpSMM/pub?w=1440&h=1080[align="center", title="hiding edges"]
386391

387392

388393
//ST: !
389394
[start=3]
390-
3. Reducing node size to the minimum (0.5) because we just need labels
395+
c. Reducing node size to the minimum (0.5) because we just need labels
391396

392397
image::semantic-resize-nodes1-en.png[align="center", title="Making nodes disappear"]
393398

394399

395400
//ST: !
396401
[start=4]
397-
4. Detect communities with the "modularity" function in the statistics panel
402+
d. Detect communities with the "modularity" function in the statistics panel
398403

399404
see tutorial 'simple project from A to Z' for this step
400405

401406
//ST: !
402407
[start=5]
403-
5. Give a different color to each community
408+
e. Give a different color to each community
404409

405410
-> each group of terms, distinguished by a color, will represent a topic.
406411

@@ -409,38 +414,47 @@ image::semantic-coloring-communities-1-en.png[align="center", title="Coloring no
409414

410415
//ST: !
411416
[start=6]
412-
6. We then need to assign this node color to their labels:
417+
f. We then need to assign this node color to their labels:
413418

414419
image::https://docs.google.com/drawings/d/1fAyeO4K9cv-HV4AM0YrL8yQXw8hKxqWX-1K1GlwogN0/pub?w=1440&h=1080[align="center", title="Coloring nodes - second step"]
415420

416421

417422
//ST: !
418-
[start=7]
419-
7. Spatializing the network with Force Atlas 2 will place related terms next to each other, because they co-occur:
423+
==== 3. Tuning the spatialization
424+
//ST: !
425+
426+
427+
//ST: !
428+
a. Spatializing the network with Force Atlas 2 will place related terms next to each other, because they co-occur:
420429

421430
Some parameters have been modified:
422431

423432
image::semantic-spatializing-1-en.png[align="center", title="Spatializing the network"]
424433

425434
//ST: !
426-
[start=8]
427-
8. The network so far:
435+
[start=2]
436+
b. The network so far:
428437

429438
image::semantic-spatializing-2-en.png[align="center", title="The network - colored and spatialized"]
430439

431440

432441
//ST: !
433-
[start=9]
434-
9. Apply 2 more layouts to enhance readability:
442+
[start=3]
443+
c. Apply 2 more layouts to enhance readability:
435444

436445
- "Expansion" to spread nodes (just select it and click on Run a couple of times)
437446
- "Label Adjust" to move labels around so that they don't overlap
438447

439448
image::semantic-spatializing-3-en.png[align="center", title="Spreading labels"]
440449

450+
441451
//ST: !
442-
[start=10]
443-
10. Switching to the preview panel
452+
==== 4. Exporting an image of the network
453+
//ST: !
454+
455+
456+
//ST: !
457+
a. Switching to the preview panel
444458

445459
- A number of parameters must be modified (to show Labels, hide edges, etc.)
446460

@@ -449,7 +463,6 @@ image::semantic-preview-1-en.png[align="center", title="The preview panel"]
449463
//ST: !
450464
The network is now ready to be exported to pdf, png or svg file formats.
451465

452-
453466
== More tutorials on working with semantic networks
454467
//ST: More tutorials on working with semantic networks
455468
//ST: !
1.32 MB
Binary file not shown.

0 commit comments

Comments
 (0)