A friendly and correct amendment. Roy > On Jan 10, 2017, at 6:12 PM, Stuart A. Yeates <[log in to unmask]> wrote: > > That is, if the XML is completely consistent AND you're guaranteed to never > encounter MARC data with XML special characters, then Kyle's suggestion is > an excellent one. > > I really need to find an excuse to publish a document with a title starting > "<marc:datafield ..." > > cheers > stuart > > -- > ...let us be heard from red core to black sky > >> On Wed, Jan 11, 2017 at 12:06 PM, Roy Tennant <[log in to unmask]> wrote: >> >> Well, I think that's a *bit* harsh. But the "YMMV" addition was >> appreciated, because it can and will. That is, if the XML is completely >> consistent, then Kyle's suggestion is an excellent one. If it isn't, then >> Kevin's link applies, IMHO. Since it appears from what we have been told >> that the records are consistent, I think Kyle's solution is not only >> workable but the most efficient. Given the caveat stated above. >> Roy >> >>> On Jan 10, 2017, at 5:57 PM, Kevin S. Clarke <[log in to unmask]> >> wrote: >>> >>> On the mention of parsing XML with string operations, I'm compelled to >> post one of my favorite StackOverflow responses: >>> >>> http://stackoverflow.com/questions/1732348/regex-match- >> open-tags-except-xhtml-self-contained-tags/1732454#1732454 >>> >>> YMMV of course... >>> >>> Kevin >>> >>> >>> >>> -----Original message----- >>>> From:Kyle Banerjee >>>> Sent: Tuesday, January 10 2017, 5:44 pm >>>> To: [log in to unmask] >>>> Subject: Re: [CODE4LIB] MARCXML help again >>>> >>>> Howdy Julie, >>>> >>>> Depending on your specific needs, it's often easier/faster to use string >>>> rather than XML operations to work with XML. >>>> >>>> Especially if you have a large number of files and/or the files are very >>>> big, stripping the whitespace between elements and then performing a >> simple >>>> string substitution would be a fast low tech way to remove the unwanted >>>> fields. >>>> >>>> kyle >>>> >>>> On Tue, Jan 10, 2017 at 1:13 PM, Julie Swierczek < >> [log in to unmask]> >>>> wrote: >>>> >>>>> Thanks to all who responded to my earlier plea for help. I now have a >> new >>>>> problem. I'm not sure if I can do this with find and replace in >> Oxygen, or >>>>> if this requires XSLT, or what. >>>>> >>>>> I have a project of MARCXML records like this: >>>>> >>>>> <?xml version="1.0" encoding="UTF-8" ?> >>>>> <marc:collection xmlns:marc="http://www.loc.gov/MARC21/slim" >>>>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" >>>>> xsi:schemaLocation="http://www.loc.gov/MARC21/slim >>>>> http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"> >>>>> <marc:record> >>>>> <!--Lots of other datafields here --> >>>>> <marc:datafield tag="710" ind1="2" ind2=" "> >>>>> <marc:subfield code="a">Faux College</marc:subfield> >>>>> <marc:subfield code="b">Special Collections</marc:subfield> >>>>> </marc:datafield> >>>>> </marc:record> >>>>> </marc:collection> >>>>> >>>>> I want to strip out all instances of: >>>>> <marc:datafield tag="710" ind1="2" ind2=" "> >>>>> <marc:subfield code="a">Faux College</marc:subfield> >>>>> <marc:subfield code="b">Special Collections</marc:subfield> >>>>> </marc:datafield> >>>>> but I want to leave other <marc:datafield tag="710" ind1="2" ind2=" "> >>>>> instances intact. I only want to delete ones with both the Faux >> College >>>>> and Special Collections text in the subfields. >>>>> >>>>> Where would I go from here? I thought of doing an xsl:template match >> in an >>>>> XSL stylesheet, and then not providing any instructions for replacing >> the >>>>> match, but I don't know how to select for that specific text. My >> attempts >>>>> to figure that out have not worked. You can only read so much W3C >>>>> documentation and Stack Overflow before you need to just sit quietly >> and >>>>> stare at a wall for a while. >>>>> >>>>> Thanks in advance -- >>>>> >>>>> Julie >>>>> >>>> >>