Skip to content

Conversation

@tarzanek
Copy link
Contributor

No description provided.

@oracle-contributor-agreement oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Apr 23, 2024
public BitIntsHolder reduce(Collection<SuggestResultCollector> collectors) {
BitIntsHolder reduced = documentIds;
for (SuggestResultCollector collector : collectors) {
documentIds.or(collector.documentIds); //TODO fix as per https://github.com/apache/lucene/pull/766/files
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this needs fixing

@tarzanek
Copy link
Contributor Author

seems SuggestResultCollector.java has no testing coverage ...

@tarzanek
Copy link
Contributor Author

@ahornace any clues for proper fixing of reduce for that collector? Is it even used?


private void searchIndex(IndexSearcher searcher, boolean paging) throws IOException {
collector = TopScoreDocCollector.create(hitsPerPage * cachePages, Short.MAX_VALUE);
collectorManager = new TopScoreDocCollectorManager(hitsPerPage * cachePages, Short.MAX_VALUE);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

perhaps redesign to reuse the object?

@tarzanek tarzanek changed the title lucene 9.10.0 lucene 9.11.0 Jun 10, 2024
@tarzanek tarzanek changed the title lucene 9.11.0 lucene 9.11.1 Jun 27, 2024
@tarzanek tarzanek changed the title lucene 9.11.1 lucene 9.12.0 Oct 1, 2024
(supress-es should be reviewed, scorer returns bad objects, so it should break scorer tests)
@tarzanek tarzanek changed the title lucene 9.12.0 lucene 10.0.0 Oct 23, 2024
@tarzanek
Copy link
Contributor Author

lucene 10 support needs to be rewritten into new PR
and JDK 21 change should be also separate, but a prereq for lucene 10 ...

url.getHost(), url.getPort(),
url.getPath(), url.getQuery(), url.getRef());
public static String encodeURL(String urlStr) throws URISyntaxException {
// URL url = new URL(urlStr); - this call
Copy link
Member

@vladak vladak Mar 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I took a longer look at the departure from the URL() constructors yesterday. The URL class by itself does not support any encoding/decoding, it merely breaks down the URL into pieces. The encoding is then supplied by the URI class.

The problem is that URI(String) accepts only already encoded URIs and refuses any non-compliant characters. I think the way to go is to propagate the URL object all the way where possible, i.e. change the String url parameter type of the various linkify() and buildLink() methods to URL url and avoid passing the url in the map of attributes as String - pass it as an extra argument with the URL type.

It is still not clear to me where the unescaped characters could originate from.

@tarzanek
Copy link
Contributor Author

tarzanek commented Apr 1, 2025

I will split lucene 10 changes away from this PR, get the lucene 9 done (I think only suggester test is missing before it can be merged)
and for lucene 10 we can make the change once jdk 21 is supported

@tarzanek tarzanek mentioned this pull request Oct 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

OCA Verified All contributors have signed the Oracle Contributor Agreement.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants