Fixing Microsoft 365 multiline JSON logs parsing#316
Conversation
|
thank you! reviewing this in conjunction with some other sample data and the Invictus folks WRT their tool output. |
|
I'm about to send a testing VM to a small group of people for validation. if you are interested, @BrianMer, please send me an email: Phil at lewestech dot com. while I have not yet integrated this PR, it would be done on that branch. can you share which tool/workflow are you using that generated this JSON in multiline format? so far, I haven't been able to identify a source that does and all UAL test data I've used so far is in "ndjson" format (albeit with a |
|
Hi @philhagen, I'm going to send you a mail for testing the VM, thank you for the proposition! In fact, the sample that I shared with you is a CSV converted into JSON by a homemade Python script, and I didn't know that it was ndjson format ; I didn't have any other sample in the correct "ndjson" format to test with. But I tested again by converting the CSV into "ndjson" format, and it is still working. I hope it is still working as well on your side. |
|
I suspect this may be an artifact of how the logs are retrieved. The workflow we've built around is from FOR509, which uses the PowerShell |
|
FWIW, I was able to parse these records when in ndjson form using the pending release of the VM mentioned above. there is currently a blocker (upstream from filebeat) that I have to clear before the testing VM can be released but the parser does generally handle these log entries. We may want to customize those somewhat but that is pretty straightforward and will be part of the testing process that I'll send out with the VM when it's ready. |


Hi,
Idk if this pull request can be related to issue #285, but I think I fixed multiline JSON array logs for M365 UAL parsing problem.
Here is a sample (anonymised), if you want to test it by yourself:
Thanks!