Со всеми путаницами, это вопрос:
data = readLines("file.txt")
# data reads
[1] "JESSICA [Day 1, 9:00 A.M.]: When there is sun, there was darkness."
[2] " However, nobody knew it was happening."
[3] " SAM [Day 1, 9:01 A.M.]: I thought it was not true."
[4] " But it was."
[5] " I thought it was "present" but it wasn't."
То, что я пытаюсь сделать, это:
(1) объединить текст по имени (JESSICA, SAM).
Я могу определить имена в данных
test = regexpr("^([A-Z]+ \\[)",data)
names = regmatches(data,test)
final.name = sub("\\[","",names)
[1] "JESSICA" "SAM"
Я могу определить дату и время в данных
test = regexpr("\\[(.*)\\]", data)
time = regmatches(data,test)
[1] "[Day 1, 9:00 A.M.]" "[Day 1, 9:01 A.M.]"
У меня возникли трудности с объединением разных строк для каждого имени. То есть вместо этого:
[1] "JESSICA [Day 1, 9:00 A.M.]: When there is sun, there was darkness."
[2] " However, nobody knew it was happening."
Я хочу, чтобы каждая строка была:
[1] "JESSICA [Day 1, 9:00 A.M.]: When there is sun, there was darkness. However, nobody knew it was happening."
[2] " SAM [Day 1, 9:01 A.M.]: I thought it was not true. But it was. I thought it was "present" but it wasn't."