HTML Import Pro
Introduction
HTML Import Pro is a powerful Joomla component for importing HTML content into Joomla articles. Import from local files, fetch from URLs, or parse entire sitemaps for bulk content migration. Intelligent extraction algorithms find the right content while filtering out navigation, ads, and other noise.
What It Does
File Import - Upload HTML files individually or batch import via ZIP archives. Drag and drop support for quick imports. Process .html, .htm, and .xhtml files.
URL Import - Enter URLs and fetch content directly from the web. AJAX-based processing with real-time progress feedback. Automatic alias generation from source URLs.
Sitemap Import - Parse XML sitemaps to discover pages automatically. Filter URLs before import. Process hundreds of pages with progress tracking.
Smart Extraction - CSS selector-based content targeting or automatic detection using Readability-style algorithms. Strip unwanted elements like scripts, navigation, and sidebars.
Image Handling - Download images from source pages automatically. Save to your Joomla media folder. Optional WebP conversion for better performance.
Import Sources
HTML Files - Upload individual HTML files or batch import multiple files via ZIP archives. Supports .html, .htm, and .xhtml formats. Drag and drop for quick uploads.
Web URLs - Import content directly from any publicly accessible web page. Enter multiple URLs for batch processing. AJAX-powered with real-time status updates.
XML Sitemaps - Parse standard XML sitemaps to discover all pages on a website. Filter URLs by pattern. Upload local sitemap files or fetch from URL.
Content Extraction
CSS Selectors - Define precise selectors to target content:
- Content selector (e.g., article, .post-content, #main)
- Title selector (e.g., h1, .entry-title)
- Elements to strip (e.g., nav, .sidebar, .comments)
Auto-Detection - Smart algorithm automatically finds main content area by analyzing page structure, text density, and HTML patterns. Works like Readability.
Metadata Extraction - Automatically captures:
- Page title from content or meta tags
- Meta description for SEO
- Open Graph data
- Publication dates when available
Image Features
Automatic Download - Images referenced in imported content are downloaded and saved locally. External URLs replaced with local paths.
WebP Conversion - Optional conversion to WebP format:
- Smaller file sizes (25-35% reduction)
- Better page performance
- Automatic fallback if conversion fails
Organized Storage - Images saved in structured folders by article alias for easy management.
Key Features
✅ 3 Import Sources - Files, URLs, Sitemaps
✅ Batch Import - ZIP archives and sitemap parsing
✅ AJAX Processing - Real-time progress and status
✅ CSS Selectors - Precise content targeting
✅ Auto-Detection - Smart content extraction
✅ Image Download - Automatic with path updates
✅ WebP Conversion - Optional image optimization
✅ Import Profiles - Save and reuse configurations
✅ Alias Generation - Automatic from source URLs
✅ Metadata Extraction - Titles, descriptions, OG data
✅ Strip Elements - Remove nav, ads, sidebars
✅ Import Logging - Track all operations
✅ CLI Support - Command line automation
✅ Rate Limiting - Respectful URL fetching
✅ Joomla 4, 5 & 6 - Full compatibility
Perfect For
- Website Migration - Move content from old HTML sites to Joomla
- Content Aggregation - Import articles from multiple sources
- Blog Migration - Bring posts from other platforms
- Documentation Import - Convert HTML docs to articles
- Archive Creation - Save web pages as Joomla content
- Content Backup - Import external content for preservation
- Site Redesign - Migrate content during redesign projects
- Bulk Content Creation - Populate sites with existing content
Import Profiles
Save your import configurations for repeated use:
- Content extraction selectors
- Title extraction settings
- Elements to strip
- Target category
- Article state and access
- Image import preferences
- WebP conversion setting
Load profiles during import or save current settings as new profiles. Perfect for importing from multiple source websites with different structures.
AJAX-Powered Processing
URL and sitemap imports feature real-time feedback:
- Progress Bar - Visual completion percentage
- Current URL - Shows which page is being processed
- Status Indicators - Pending, fetching, completed, error
- Continue on Error - Keeps processing if individual URLs fail
- Configurable Delay - Set interval between requests
Configuration Options
Basic Settings
- Default category for imports
- Default author assignment
- Default access level
- Default publishing state
Extraction Settings
- Content CSS selectors
- Title CSS selectors
- Elements to strip/remove
- Auto-detection toggle
Image Settings
- Enable/disable image import
- Destination folder
- WebP conversion toggle
URL Settings
- User agent string
- Request timeout
- Rate limit delay
- robots.txt respect
Permissions
- Configure - Component options
- Access - Basic component access
- Import - Permission to import
- Profiles - Manage import profiles
- Logs - View import history
Technical Features
Modern Architecture
- Joomla 4, 5 & 6 native compatibility
- PHP 8.1+ with type declarations
- Proper namespace implementation
- MVC architecture
- Service-based design
Robust Processing
- DOMDocument HTML parsing
- XPath content extraction
- cURL-based URL fetching
- GD-based image processing
- Batch processing support
Security
- CSRF token validation
- Permission checks
- Input sanitization
- Secure file handling
- Firewall-friendly design
Requirements
- Joomla 4.0 or later (4.x, 5.x, 6.x)
- PHP 8.1 or later
- PHP Extensions: cURL, DOM, libxml
- PHP GD with WebP support (for conversion)
- MySQL 5.7+ / MariaDB 10.3+
Support
- Support Portal: https://support.joomlax.com
- Email: [email protected]
- Website: https://www.joomlax.com
- Documentation: Comprehensive docs included
Why Choose HTML Import Pro?
Multiple Sources - Import from files, URLs, or sitemaps in one component
Smart Extraction - CSS selectors and auto-detection find the right content
Real-Time Feedback - AJAX processing shows progress for every URL
Image Handling - Automatic download and optional WebP conversion
Reusable Profiles - Save configurations for different source websites
Full Logging - Track every import with detailed history
Joomla Native - Built specifically for Joomla following all standards
Migrate your HTML content to Joomla with intelligent extraction, automatic image handling, and powerful batch processing. Perfect for website migrations, content aggregation, and bulk imports.
Joomla 4, 5 & 6 Compatible | 3 Import Sources | Smart Content Extraction | WebP Conversion
HTML Import Pro
- Version:
- 1.0
- Developer:
- Infyways Solutions
- Last updated:
-
Feb 08 2026
18 hours ago - Date added:
- Jan 25 2026
- License:
- GPLv2 or later
- Type:
- Paid download
- Includes:
- c
- Compatibility:
- J4 J5 J6
Share