Skip to content

Releases: microsoft/markitdown

v0.1.1

25 Mar 06:32
c1f9a32
Compare
Choose a tag to compare

What's Changed

convert_url renamed to convert_uri, and now handles data and file URIs by @afourney in #1153

NOTE: convert_url remains an alias to convert_uri, for backward compatibility.

Both now accept file URIs and data URIs:

e.g.,

markitdown = MarkItDown()
result = markitdown.convert_uri("file:///path/to/file.txt")
print(result.markdown)

And,

markitdown = MarkItDown()
result = markitdown.convert_uri("data:text/plain;base64,SGVsbG8sIFdvcmxkIQ==")
print(result.markdown)

Full Changelog: v0.1.0...v0.1.1

v0.1.0

22 Mar 18:33
2ffe6ea
Compare
Choose a tag to compare

Overview

Version 0.1.0 (previously 0.1.0a6) is a large release, bringing many improvements over the previous 0.0.2 version.

High-level changes include:

  • Organized dependencies into feature groups — install only the converters you need, or get everything with pip install markitdown[all]
  • A new plugin-based architecture, allowing 3rd-party developers to add functionality to MarkItDown (see the sample plugin)
  • All conversions are performed in-memory — no more temporary files
  • Support for new formats including EPUB
  • Option to keep data URIs in converted Markdown
  • Option to override MIME type, extension, and charset in the command-line interface (useful when reading input from a pipe or stdin)

Breaking changes

  • As noted above, dependencies are now organized into optional feature groups. Use pip install markitdown[all] for backward-compatible behavior.
  • convert_stream() now requires a binary file-like object (e.g., a file opened in binary mode, or an io.BytesIO object). This is a breaking change from the previous version, which also accepted text file-like objects, like io.StringIO.
  • The DocumentConverter class interface has changed to read from file-like streams rather than file paths. No temporary files are created anymore. If you are the maintainer of a plugin or custom DocumentConverter, you likely need to update your code. Otherwise, if you're only using the MarkItDown class or CLI (as in these examples), you should not need to change anything.
     

Detailed list of contributions

New Contributors

Full Changelog: v0.0.2...v0.1.0

v0.1.0a6

21 Mar 16:29
efc55b2
Compare
Choose a tag to compare
v0.1.0a6 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.1.0a5...v0.1.0a6

v0.1.0a5

20 Mar 05:13
cd6aa41
Compare
Choose a tag to compare
v0.1.0a5 Pre-release
Pre-release

What's Changed

  • Consider anything with a charset as plain text-convertible. by @afourney in #1142
  • Adjust warning filters and update dependencies by @afourney in #1143

Full Changelog: v0.1.0a4...v0.1.0a5

v0.1.0a4

17 Mar 14:59
a93e056
Compare
Choose a tag to compare
v0.1.0a4 Pre-release
Pre-release

Features

Bug fixes and enhancements

New Contributors

Full Changelog: v0.1.0a1...v0.1.0a4

v0.0.2

08 Mar 00:24
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.0.1...v0.0.2

v0.1.0a1

06 Mar 07:09
6bedf6d
Compare
Choose a tag to compare
v0.1.0a1 Pre-release
Pre-release

What's Changed

This MarkItDown alpha introduces numerous bug-fixes, and the following major changes:

  • Dependencies are now organized into optional feature-groups (further details below). Use pip install markitdown[all] to have backward-compatible behavior.
  • The DocumentConverter class interface has changed to read from file-like streams rather than file paths. No temporary files are created anymore. If you are the maintainer of a DocumentConverter, you likely need to update your code. Otherwise, if only using the MarkItDown class or CLI, you should not need to change anything.
  • MarkItDown now supports extension through 3rd-party plugins. See markitdown-sample-plugin for more details!

v0.0.1

06 Mar 05:42
Compare
Choose a tag to compare

Promoting v0.0.1a5 to a full release.

For more details see the prior Release Notes.

v0.0.1a5

28 Feb 15:38
Compare
Choose a tag to compare
v0.0.1a5 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.0.1a4...v0.0.1a5

MarkItDown version v0.0.1a4

11 Feb 00:12
Compare
Choose a tag to compare
Pre-release

Some of What's Changed

New Contributors

Full Changelog: v0.0.1a3...v0.0.1a4