Skip to content

Commit e55e4b9

Browse files
authored
Update base.R
1 parent 67b8bc1 commit e55e4b9

1 file changed

Lines changed: 2 additions & 0 deletions

File tree

  • server/preprocessing/other-scripts

server/preprocessing/other-scripts/base.R

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -140,6 +140,8 @@ get_papers <- function(query, params, limit=100,
140140
subject_cleaned = gsub(" ?\\d[:?-?]?(\\d+.)+", "", subject_cleaned) # replace residuals like 5:621.313.323 or '5-76.95'
141141
subject_cleaned = gsub("\\w+:\\w+-(\\w+\\/)+", "", subject_cleaned) # replace residuals like Info:eu-repo/classification/
142142
subject_cleaned = gsub("^; $", "", subject_cleaned) # replace residuals like Info:eu-repo/classification/
143+
subject_cleaned = gsub(",", ", ", subject_cleaned) # clean up keyword separation
144+
subject_cleaned = gsub("\\s+", " ", subject_cleaned) # clean up keyword separation
143145

144146
metadata$subject = subject_cleaned
145147

0 commit comments

Comments
 (0)