Docxtemplater read a word file with line break #640

Angelk90 · 2022-02-22T10:05:55Z

Environment

Version of docxtemplater : ^3.29.0
Runner : Browser

How to reproduce my problem :

Lines 10 to 16 in 6d3883b

    
           function getFullText(content, tagsXmlArray) { 
        
           	const matcher = xmlMatcher(content, tagsXmlArray); 
        
           	const result = matcher.matches.map(function (match) { 
        
           		return match.array[2]; 
        
           	}); 
        
           	return wordToUtf8(convertSpaces(result.join(""))); 
        
           }

I'm looking for a way to be able to read a document even if it has a line wrap.

At the moment the only way to be able to read a document seems to be to use the following function: doc.getFullText().

So I went to see how the function is done, it looks like a join is done, so all the text is seen without wrapping.

So if the function were changed like this what would happen:

 function getFullText(content, tagsXmlArray, separator = '') { 
 	const matcher = xmlMatcher(content, tagsXmlArray); 
 	const result = matcher.matches.map((match) => match.array[2]); 
 	return wordToUtf8(convertSpaces(result.join(separator))); 
 }

The text was updated successfully, but these errors were encountered:

edi9999 · 2022-02-23T21:31:57Z

Yes, doc.getFullText() is there for historical purposes (I added it 7 years ago and didn't want to break backwards-compatibility).

Adding a correct algorithm for this would require quite some work, I'm not considering adding it myself but would accept a pull request.

However the separator thing would not work.

edi9999 · 2022-02-24T08:45:14Z

I have just added to the roadmap to remove the getFullText in v4 to avoid the confusion for future users.

#340

I don't want to have code that I know is not core to the library and will not have the proper care it deserves.

Angelk90 · 2022-02-24T13:01:54Z

@edi9999 : I personally wouldn't remove it.

edi9999 closed this as completed Feb 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docxtemplater read a word file with line break #640

Docxtemplater read a word file with line break #640

Angelk90 commented Feb 22, 2022

edi9999 commented Feb 23, 2022

edi9999 commented Feb 24, 2022

Angelk90 commented Feb 24, 2022

Docxtemplater read a word file with line break #640

Docxtemplater read a word file with line break #640

Comments

Angelk90 commented Feb 22, 2022

Environment

How to reproduce my problem :

edi9999 commented Feb 23, 2022

edi9999 commented Feb 24, 2022

Angelk90 commented Feb 24, 2022