Distributed Decision-Tree Induction in Peer-to-Peer Systems

This paper offers a scalable and robust distributed algorithm for decision-tree induction in large peer-to-peer (P2P) environments. Computing a decision tree in such large distributed systems using standard centralized algorithms can be very communication-expensive and impractical because of the synchronization requirements. The problem becomes even more challenging in the distributed stream monitoring scenario where the decision tree needs to be updated in response to changes in the data distribution. This paper presents an alternate solution that works in a completely asynchronous manner in distributed environments and offers low communication overhead, a necessity for scalability. It also seamlessly handles changes in data and peer failures. The paper presents extensive experimental results to corroborate the theoretical claims.

Data and Resources

DTree.pdfPDF
Paper
Explore
- More information
- Go to resource

Additional Info

Field	Value
Maintainer	Kanishka Bhaduri
Last Updated	February 19, 2025, 10:57 (UTC)
Created	February 19, 2025, 10:57 (UTC)
accessLevel	public
accrualPeriodicity	irregular
bureauCode	{026:00}
catalog_@context	https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_@id	https://data.nasa.gov/data.json
catalog_conformsTo	https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy	https://project-open-data.cio.gov/v1.1/schema/catalog.json
harvest_object_id	f34e376c-d9ca-444f-934c-92cebf5e3675
harvest_source_id	b37e5849-07d2-41cd-8bb6-c6e83fc98f2d
harvest_source_title	DNG Legacy Data
identifier	DASHLINK_178
issued	2010-09-22
landingPage	https://c3.nasa.gov/dashlink/resources/178/
modified	2020-01-29
programCode	{026:029}
publisher	Dashlink
resource-type	Dataset
source_datajson_identifier	true
source_hash	53f946c2760c23245a313a4deed906f9b713573c73484e9cb30f5885d9fed0cb
source_schema_version	1.1