Skip to content

stormcrawler-3.2.0

Latest
Compare
Choose a tag to compare
@tballison tballison released this 03 Dec 14:22
· 9 commits to main since this release
fbb52f6

What's Changed

  • Release 3.1.0 by @rzo1 in #1316
  • Bump Apache Storm from 3.1.1 to 2.6.4 & archetype 3.0 to 3.1.0 by @kunalpal97 in #1319
  • #1299 - Add DISCLAIMER to JAR files by @rzo1 in #1320
  • #1300 - Fix "files in jars have odd dates" by @rzo1 in #1321
  • Bump org.yaml:snakeyaml from 2.2 to 2.3 by @dependabot in #1307
  • Bump org.awaitility:awaitility from 4.2.0 to 4.2.2 by @dependabot in #1310
  • Bump org.jacoco:jacoco-maven-plugin from 0.8.11 to 0.8.12 by @dependabot in #1305
  • Bump org.netpreserve:jwarc from 0.29.0 to 0.30.0 by @dependabot in #1304
  • Bump org.apache.maven.plugins:maven-surefire-plugin from 3.2.1 to 3.5.0 by @dependabot in #1308
  • Bump aws.version from 1.12.663 to 1.12.772 by @dependabot in #1302
  • Bump org.apache.solr:solr-solrj from 9.6.1 to 9.7.0 by @dependabot in #1309
  • Bump com.microsoft.playwright:playwright from 1.46.0 to 1.47.0 by @dependabot in #1306
  • Bump org.wiremock:wiremock from 3.5.4 to 3.9.1 by @dependabot in #1311
  • Bump selenium.version from 4.24.0 to 4.25.0 by @dependabot in #1314
  • #1323 Update archetype Storm version from 2.6.4 by @mvolikas in #1325
  • Regenerated License file after dependency upgrades by @github-actions in #1322
  • Bump OpenSearch to 2.17 + fix archetype version in README by @jnioche in #1324
  • Bump org.mockito:mockito-core from 5.13.0 to 5.14.0 by @dependabot in #1334
  • Bump junit.version from 5.11.0 to 5.11.1 by @dependabot in #1333
  • Bump org.apache.maven.plugins:maven-archetype-plugin from 3.2.1 to 3.3.0 by @dependabot in #1332
  • Bump org.apache.maven.archetype:archetype-packaging from 3.2.1 to 3.3.0 by @dependabot in #1330
  • Regenerated License file after dependency upgrades by @github-actions in #1326
  • Regenerated License file after dependency upgrades by @github-actions in #1335
  • Bump log4j2.version from 2.23.0 to 2.24.1 by @dependabot in #1328
  • Regenerated License file after dependency upgrades by @github-actions in #1337
  • Bump org.jetbrains:annotations from 24.1.0 to 25.0.0 by @dependabot in #1331
  • Regenerated License file after dependency upgrades by @github-actions in #1338
  • Bump com.github.crawler-commons:urlfrontier-API from 2.3.1 to 2.4 by @dependabot in #1327
  • Regenerated License file after dependency upgrades by @github-actions in #1340
  • Store metadata as WARC Metadata records by @jnioche in #1341
  • Improve robustness of WARC generation by @jnioche in #1342
  • Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.0 to 3.5.1 by @dependabot in #1350
  • Bump junit.version from 5.11.1 to 5.11.2 by @dependabot in #1345
  • Fix configuration for Github's linguist by @mvolikas in #1344
  • Bump testcontainers.version from 1.20.1 to 1.20.2 by @dependabot in #1346
  • Bump org.mockito:mockito-core from 5.14.0 to 5.14.1 by @dependabot in #1349
  • Bump aws.version from 1.12.772 to 1.12.773 by @dependabot in #1351
  • Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.10.0 to 3.10.1 by @dependabot in #1347
  • Regenerated License file after dependency upgrades by @github-actions in #1352
  • #1354 Fix: fix some typos in project by @psxjoy in #1355
  • Fix #1312 "Sha512 hash of source release is missing the file part " by @rzo1 in #1356
  • Bump de.thetaphi:forbiddenapis from 3.7 to 3.8 by @dependabot in #1359
  • Bump org.jetbrains:annotations from 25.0.0 to 26.0.0 by @dependabot in #1358
  • Regenerated License file after dependency upgrades by @github-actions in #1360
  • Trivial: version number in warc/README fix #1317 by @jnioche in #1363
  • Bugfix nofollow instructions in rel tags ignored by @jnioche in #1362
  • Bump org.jetbrains:annotations from 26.0.0 to 26.0.1 by @dependabot in #1368
  • Bump com.microsoft.playwright:playwright from 1.47.0 to 1.48.0 by @dependabot in #1366
  • Connect to a remote instance using web sockets by @jnioche in #1361
  • Bump aws.version from 1.12.773 to 1.12.776 by @dependabot in #1367
  • Bump org.mockito:mockito-core from 5.14.1 to 5.14.2 by @dependabot in #1369
  • Regenerated License file after dependency upgrades by @github-actions in #1370
  • Bump tika.version from 2.9.2 to 3.0.0 by @dependabot in #1365
  • Apache Storm 2.7.0 by @rzo1 in #1371
  • Regenerated License file after dependency upgrades by @github-actions in #1372
  • #1353 Fix for URLFrontier spout not taking into account the crawl ID by @klockla in #1373
  • Bump junit.version from 5.11.2 to 5.11.3 by @dependabot in #1375
  • Bump com.ibm.icu:icu4j from 75.1 to 76.1 by @dependabot in #1376
  • Bump aws.version from 1.12.776 to 1.12.777 by @dependabot in #1377
  • Bump org.wiremock:wiremock from 3.9.1 to 3.9.2 by @dependabot in #1378
  • Bump testcontainers.version from 1.20.2 to 1.20.3 by @dependabot in #1379
  • Remove references to ES in OpenSearch module by @jnioche in #1374
  • Regenerated License file after dependency upgrades by @github-actions in #1380
  • Fix #1313 "Exclude "__files" from Source Release Artifacts"" by @rzo1 in #1384
  • #1301 - add build doc for the source release by @rzo1 in #1383
  • [1385] bugfix - check for null before the for-each loop by @jnioche in #1386
  • Sync conf files in root and archetype + explicit values for sniff conf by @jnioche in #1388
  • Detect multi addresses separated by ; in a single String. Fixes #1382 by @jnioche in #1387
  • Bump org.apache.maven.plugins:maven-archetype-plugin from 3.3.0 to 3.3.1 by @dependabot in #1390
  • Bump selenium.version from 4.25.0 to 4.26.0 by @dependabot in #1393
  • Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.1 to 3.5.2 by @dependabot in #1392
  • Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.10.1 to 3.11.1 by @dependabot in #1394
  • Bump org.apache.maven.archetype:archetype-packaging from 3.3.0 to 3.3.1 by @dependabot in #1395
  • Regenerated License file after dependency upgrades by @github-actions in #1398
  • #620 Add support for shards - SolrSpout by @mvolikas in #1343
  • #1403 - Downgrade log4j2 to Storm's version. Fixes #1403 by @tballison in #1404
  • #1401 Drop Java-based Topologies by @mvolikas in #1402
  • #1405 - bump development version to 3.2.0-SNAPSHOT by @tballison in #1406
  • #1409 - remove wrapper element by @tballison in #1410
  • Fixes Issues mentioned in IPMC Vote by @rzo1 in #1417
  • Bump aws.version from 1.12.777 to 1.12.778 by @dependabot in #1415
  • Bump com.microsoft.playwright:playwright from 1.48.0 to 1.49.0 by @dependabot in #1419
  • Bump opensearch.version from 2.17.0 to 2.18.0 by @dependabot in #1400
  • Bump testcontainers.version from 1.20.3 to 1.20.4 by @dependabot in #1422
  • Bump org.netpreserve:jwarc from 0.30.0 to 0.31.1 by @dependabot in #1420
  • Regenerated License file after dependency upgrades by @github-actions in #1424
  • Update to Storm 2.7.1 by @rzo1 in #1425
  • Regenerated License file after dependency upgrades by @github-actions in #1426
  • Adress IPMC feedback by @rzo1 in #1423
  • Prevent Dependabot from suggesting dependency updates for Jackson by @jnioche in #1433
  • Bump org.jsoup:jsoup from 1.18.1 to 1.18.3 by @dependabot in #1431
  • Bump aws.version from 1.12.778 to 1.12.779 by @dependabot in #1427
  • Bump org.codehaus.mojo:license-maven-plugin from 2.4.0 to 2.5.0 by @dependabot in #1430
  • Bump org.wiremock:wiremock from 3.9.2 to 3.10.0 by @dependabot in #1432
  • Bump selenium.version from 4.26.0 to 4.27.0 by @dependabot in #1428
  • Regenerated License file after dependency upgrades by @github-actions in #1434
  • [MINOR] update URLs in tests by @pjfanning in #1435

New Contributors

Full Changelog: stormcrawler-3.1.0...stormcrawler-3.2.0