Another student with bulk text editing experience- regular expressions, edit macros, delimiters, and scripting - might help until the developer returns from leave. MarcEdit MarkMaker expects its mrk text format, other shrink-wrapped "text to MARC" tools will have similar requirements. A Python developer could use Python string features for cleanup and then the PyMarc library to generate the MARC output.
-----Original Message-----
From: Code for Libraries <[log in to unmask]> On Behalf Of Hammer, Erich F
Sent: Monday, July 28, 2025 3:00 PM
To: [log in to unmask]
Subject: [EXTERNAL EMAIL] Re: [CODE4LIB] Converting image of MARC to text MARC?
Here is a random example.
Don't grind too hard on it; I think we have found a bit of success feeding these to M365 CoPilot (which is licensed to us). It's not perfect and there is still some cleanup, but that would still be true if the data came in perfectly.
Thanks,
Erich
On Monday, July 28, 2025 at 14:11, Wil Blake eloquently inscribed:
Hello Erich, Can you paste an example of the PDF text Marc record into this thread? Regards, Wil Blake
|