Changelog
All notable changes to Dangl.TextConverter are documented here.
v1.2.9:
- The class
Dangl.TextConverter.Html.HtmlTableConverter
is now public - The
InternalsVisibleToo
attribute for the assembly was removed to prevent conflicts in applications that sign their binaries
v1.2.8:
- Bugfix where bookmark elements in Rtf texts with missing closing elements where throwing a
System.InvalidOperationException
- CI tests are now also run on Linux
- Bugfix where line endings were sometimes not correctly trimmed when running on .NET Core on Linux
v1.2.7:
- Non-printable unicode escape sequences (
0x00
-0x08
,0x0b
and0x0e
-0x1f
) are now ignored when converting RTF text to plain text
v1.2.6:
- Add the
HtmlAgilityPackLegacyBehaviorHelper
utility class and fix an issue were enabling the legacy behavior in HtmlAgilityPack was not thread safe and could fail when simultaneously accessed
v1.2.5:
- Update HtmlAgilityPack dependency. The previously referenced version
1.9.2
was pulled from NuGet due to an unintended, breaking API change. Please see https://github.com/zzzprojects/html-agility-pack/issues/125 for more information about the change - The
HtmlToText
class now sets the static propertyHtmlAgilityPack.HtmlDocument.DisableBehavaiorTagP = false
in its static method calls to ensure compatible behavior. If your own code relies on different behavior, please ensure that this property is always set to its original value after invoking one of the methods onHtmlToText
. See the README for further details
v1.2.4:
- Dependencies update
v1.2.3.:
- Small internal refactoring
- Dependencies update
v1.2.2:
- Add
bool keepWhitespaceAtLineEnds
parameter toTextToHtml.TransformPlaintextToHtml()
overload which defaults to false - Bugfix where Rtf text was sometimes incorrectly read and output when segments between groups started with whitespace
v1.2.1:
- Add
StringLineStartNormalizationExtensions
v1.2.0:
- Update
HtmlAgilityPack
for huge (about 10x) performance improvements innetstandard
targets, see https://github.com/zzzprojects/html-agility-pack/releases/tag/v1.8.11 - When converting Html to plain text, it's now possible to have the result split by Html class names. Please see the README or https://docs.dangl-it.com/Projects/Dangl.TextConverter for further details
public static SegmentedRtf ConvertRtfToSegmentedText(string rtfInput)
was added toRtfToText
. This will return text segments that contain plain text representations of the texts as well as tags to indicate the opening and closing of bookmarks. This is used, for example, in the GAEB & AVA .Net Libraries by DanglIT to work with text additions in GAEB 2000 files. Please see the README or https://docs.dangl-it.com/Projects/Dangl.TextConverter for further details- Added
TextToRtf.ConvertPlainTextToRtf(SegmentedRtf segmentedRtf)
to convert back to Rtf from segmented texts while preserving bookmarks - Dropped tests for
netcoreapp2.0
, added tests fornetcoreapp2.2
v1.1.5:
- Bugfix: Some empty tables caused a
NullReferenceException
when converting them to plaintext via theHtmlToText
class - Update of
HtmlAgilityPack
andSystem.Text.Encoding.CodePages
(the latter only fornetstandard
targets)
v1.1.4:
- Dependencies update
v1.1.3:
- Update of HtmlAgilityPack dependency to include latest bugfixes
- Internal refactoring of the CI/CD pipeline
v1.1.2:
- Small performance improvements for parsing Rtf text
v1.1.1:
- Update HtmlAgilityPack to latest stable version 1.8.4
v1.1.0:
- Switch to HtmlAgilityPack. The HtmlAgilityPack.Core fork is no longer required since the original supports now netstandard
v1.0.8
- Add
netstandard2.0
target - Switch build system to NUKE
v1.0.7
- Fix bug in RtfToText where Rtf annotations were read as plain text
v1.0.6
- Performance improvements for Rtf texts that contain pictures
v1.0.5
- Update Html encoding/decoding to preserve correctness in roundtrip scenarios
v1.0.4
- Downgrade to netstandard1.3 and net45 for broader compatibility
v1.0.3
- Performance improvements
v1.0.2
- Update ANTLR4 dependencies to latest stable version
v1.0.1
- Target NETStandard 1.3