Skip to content
/ muxml Public

🎷 Streaming XML parser and formatter based on sax

License

Notifications You must be signed in to change notification settings

t1st3/muxml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

285 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

muxml Build Status Travis Coverage Status

Streaming XML parser and formatter

muxml is a Transform stream that accepts XML strings, and emits small chunks containing individual XML elements.

The parser part of this module is built on sax🎷. On parsing, the stream is broken-up and reassembled so that each opening/closing tag and each text is a chunk. During this process, sax events are re-emitted and are listenable down the pipeline.

Once XML is parsed, this module can optionally format XML in the following fashions:

  • XML minification (de-indentation and removal of newlines)
  • XML beautification (indentation and usage of newlines)
  • striping of comments
  • striping of CDATA
  • striping of instruction (like <?xml foo="blerg" ?>) tags
  • striping of <!DOCTYPE declarations
  • striping of attributes
  • tag filtering based on tag name

muxml is written with ES6/ES2015 syntax, although it is also shipped as an ES5 module with UMD syntax, meaning it can also be used:

  • in Node.js >= 4
  • in the browser, as global script
  • in the browser, with an AMD loader

Also available as a gulp / Grunt plugin, or as a CLI app.

Install

$ npm install --save muxml

Usage

Suppose a file named example.xml containing the follwing XML

<a id="foo">
  <b>
    <c>d</c>
  </b>
</a>

then, muxml() returns a transform stream that accepts XML strings and emits XML elements as strings

const muxml = require('muxml');

fs.createReadStream('example.xml')
  .pipe(muxml({pretty: false}))
  .on('data', function (data) {
    console.log(data);
    //=> '<b><c>d</c></b>'
  });

Options

strict

Type: boolean
Default: true

Set sax parser strict argument

pretty

Type: boolean
Default: true

Prettify the output. If true, output has newlines and indentation.

indentStyle

Type: string
Default: spaces

When pretty is set to true, indent with either spaces or tabs.

indentSpaces

Type: integer
Default: 2

When pretty is set to true and indentStyle is set to spaces, then indent with this number of spaces.

indentTabs

Type: integer
Default: 1

When pretty is set to true and indentStyle is set to tabs, then indent with this number of tabs.

filter

Type: string
Default: null
a filter for tag names

Filter XML with tags with name matching the filter.

stripAttributes

Type: boolean
Default: false

Strip attributes from tags.

stripCdata

Type: boolean
Default: true

Strip CDATA tags.

stripComments

Type: boolean
Default: true

Strip XML comments.

stripDoctype

Type: boolean
Default: true

Strip <!DOCTYPE declarations.

stripInstruction

Type: boolean
Default: true

Strip processing instruction (like <?xml foo="blerg" ?>) tags.

saxOptions

Type: object
Default: {}

Set options of sax parser. Note that trim can not be set to false.

API

muxml(options)

options

Type: object
as described above

Related

License

MIT © t1st3

About

🎷 Streaming XML parser and formatter based on sax

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •