What character can be used to parse for paragraphs with Java? -
I'm sure people laugh a lot from this one, but I do not find a separator for my life. Is that when a new paragraph in a string of text has started word, and the line? Easy peasy, but the article seems to be more difficult to find. I have tried two line breaks in a row, Unicode representation of paragraph breaks and line breaks, there is no luck.
EDIT: I apologize for the ambiguity of my original question. To answer some questions, this is basically the original text file created on Windows. I'm testing some code for opening and analyzing its content with BlackBerry JDE 4.5 using RIM Eclipse Plugin. While the source of the file will be the window (at least for the near future) and will be the basic text, I have no control over how they have been created (this is the source of a third party which I am designed to do
There is no such paragraph break character in normal use.
Maybe you are able to overcome two or more line breaks in one line (with optional horizontal whites) pointing to a paragraph break but many exceptions to this "rule" For example, when a paragraph is interrupted by
- a floating shape, or
- in bullet points
and Continues after that. .. This way. For that thing, there is probably no solution.
Edit below @Eden's comment (now it is clear that this is not relevant to OP, but it may be relevant to those who are using Google Ask questions etc.)
Instead of reverse engineer text paragraphs, you might consider specifying that you should have the input (for example) syntax; This is supported by the Stack Overflow as it includes links to Java LinkedIn Parser implementation in many languages with Java.
(It assumes that you have some control over the input format of the text that you are trying to parse in paragraph, etc.)
Comments
Post a Comment