This package provides a Guzzle 6/7 middleware that transparently converts documents obtained by Guzzle from its native encoding to UTF-8 (or any other specified encoding). It supports the following features:
- Detection of charset from
Content-TypeHTTP header. - Detection of charset from
metaelement in HTML document. - Detection of charset from XML declaration in RSS and other XML documents.
- Updating the
Content-Typeheader in theResponseobject according to target encoding. - Updating the metadata in the
Responsebody according to target encoding (not enabled by default).
It is recommended to install the library using Composer:
composer require fossar/guzzle-transcoderuse Fossar\GuzzleTranscoder\GuzzleTranscoder;
use GuzzleHttp\Client;
use GuzzleHttp\HandlerStack;
$stack = HandlerStack::create();
$stack->push(new GuzzleTranscoder);
$client = new Client(['handler' => $stack]);
$url = 'https://www.myseosolution.de/scripts/encoding-test.php?enc=iso'; // request website with iso-8859-1 encoding
$req = $client->get($url);
echo $req->getBody();use Fossar\GuzzleTranscoder\GuzzleTranscoder;
use GuzzleHttp\Client;
use GuzzleHttp\HandlerStack;
$stack = HandlerStack::create();
$stack->push(new GuzzleTranscoder([
'targetEncoding' => 'windows-1252',
// Swap the default settings:
'replaceHeaders' => false,
'replaceContent' => true,
]));
$client = new Client(['handler' => $stack]);
$url = 'https://www.myseosolution.de/scripts/encoding-test.php?enc=iso'; // request website with iso-8859-1 encoding
$req = $client->get($url);
echo $req->getHeaderLine('Content-Type') . "\n"; // HTTP header will remain unchanged
echo $req->getBody();It is largely based on Pascal Landau’s guzzle-auto-charset-encoding-subscriber and web-utility libraries.
We are using Transcoder library. This allows us to fall back to iconv when mbstring is not available or an encoding is not supported by it.
The source code is available under the terms of MIT license