Skip to content

Commit 0001a33

Browse files
fix: pass extract image args to all partitioners (#3950)
This is needed in order for the user to specify whether to extract the base64 for images, which are now parsed by the html partitioner. ## Testing Adds test that validates this by calling the auto-partitioner with appropriate arguments partitioning an html file with base64 embedded image.
1 parent c0457c1 commit 0001a33

File tree

7 files changed

+28
-3
lines changed

7 files changed

+28
-3
lines changed

Diff for: CHANGELOG.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
## 0.16.26-dev1
1+
## 0.16.26-dev2
22

33
### Enhancements
44

Diff for: example-docs/html-with-base64-image.html

+6
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
<div>
2+
<p>Test page</p>
3+
<img src="data:image/jpeg;base64,/9j/4QDKRXhpZgAATU0AKgAAAAgABgESAAMAAAABAAEAAAEaAAUAAAABAAAAVgEbAAUAAAABAAAAXgEoAAMAAAABAAIAAAITAAMAAAABAAEAAIdpAAQAAAABAAAAZgAAAAAAAABIAAAAAQAAAEgAAAABAAeQAAAHAAAABDAyMjGRAQAHAAAABAECAwCgAAAHAAAABDAxMDCgAQADAAAAAQABAACgAgAEAAAAAQAAAgCgAwAEAAAAAQAAAgCkBgADAAAAAQAAAAAAAAAAAAD/2wCEAAEBAQEBAQIBAQIDAgICAwQDAwMDBAUEBAQEBAUGBQUFBQUFBgYGBgYGBgYHBwcHBwcICAgICAkJCQkJCQkJCQkBAQEBAgICBAICBAkGBQYJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCQkJCf/dAAQACP/AABEIAH0AfQMBIgACEQEDEQH/xAGiAAABBQEBAQEBAQAAAAAAAAAAAQIDBAUGBwgJCgsQAAIBAwMCBAMFBQQEAAABfQECAwAEEQUSITFBBhNRYQcicRQygZGhCCNCscEVUtHwJDNicoIJChYXGBkaJSYnKCkqNDU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6g4SFhoeIiYqSk5SVlpeYmZqio6Slpqeoqaqys7S1tre4ubrCw8TFxsfIycrS09TV1tfY2drh4uPk5ebn6Onq8fLz9PX29/j5+gEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoLEQACAQIEBAMEBwUEBAABAncAAQIDEQQFITEGEkFRB2FxEyIygQgUQpGhscEJIzNS8BVictEKFiQ04SXxFxgZGiYnKCkqNTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqCg4SFhoeIiYqSk5SVlpeYmZqio6Slpqeoqaqys7S1tre4ubrCw8TFxsfIycrS09TV1tfY2dri4+Tl5ufo6ery8/T19vf4+fr/2gAMAwEAAhEDEQA/AP0Aooor/YA/4nwooooAKKKKACiiigAooooAKKKKACiiigAooor4/j3/AJFNX/t3/wBKR+2fR1/5LLB/9v8A/pqZ/9D9AKKKK/2AP+J8+d/jX+1D8J/2f/FHhDwf8RpruG98cX403SRbWzzq9wWVAsjIMRjLDk8YyegNN/aY/aj+Ev7JXg2z8efGWe7t9Nvb9dNiaytZLtvtDozqCkYJCkKQD0zgdSK/NT/grDIlv8dv2ZbiYhIx43hXceBkzxACov8AgvVIh/Zd8L2YI82fxpapGn8TMIZThR3OOwr8m4h43xmFo5pOko3w6hyad4p666+Wx/Y3ht4DZJm2M4SoYpz5cydZVbSSt7OpKK5PdfLole9/kfttE4ljWRejAMPoRkV84ftNftXfBz9kbwpp3jX41XV1aadqd2bKF7S1kum81YXnO5IwWCiONmJxgAV9E2QItIQe0Uf/AKCK/ET/AILrWkOpfBT4ZaPcM0cd/wCNobN3jxvRbiyuISy54yobIzxkV9TxznlfLsnrY7DJc8UrX23S8j8n+j74fZdxNxtguH80clQqykpcrSlZQk9HZrp2Z+s3jv45/DX4ffBa9/aC1a/+1eFLLT01Q3tiv2gSWcihkliCffUqQRjt9K6D4WfErwr8Y/hxonxU8DSSTaP4gtI76yeVDE7QyfcLIeVJx0Nfy/ePfib8Rv2JPgb8ZP8Agm7+0bdNd6Vd6RdXHgHVQhZZbaZiVtztA2pLj5V2hYLgPFkI8O798v8AgnZ/yYp8I/8AsVrH/wBBNfN8IcfVczzL6nKPJy07zj1jUUuVr0tqvKx+neNP0ccLwnwus7p1XWVXE8tCqn+7q4aVH2kJJW+O/uyV/dacbdT7HubiCzt3url1jiiUu7udqqqjJJPQAAZJ7Cvyj1X/AILVfsDaV4pl8PHxBqNxawXBtX1SDTpnsfMBI+VwMunGQyjBXkcV+l3xH8Fx/Eb4fa78Ppp3tk13TrnTzLGMtGLiIx7gO+3Ocelfze+HP+Gu/wDgnF8Dbj4IfHH4L+H/AIqfBqylaSbV7WSGTdb3U++SSYkE7Qx3DzYljjGEMhA3VXiJxNmWXVKTwto0rScp+zlUSatZNRacY2veWtjP6NHhVwrxLhsVHNr1cWpU40qCxFLDOcZc3NKEqsZRnUTUVGleN779D+lvwz4l8PeM/Dtj4u8JXsOpaXqcEd1aXdsweGeGVQySRsOCrDkGvB/2n/2sPgz+yB4IsfiH8cby4sdL1C+XTYXtrd7ljO0UkoBRBkDZE3PTjFd98DPil4K+Nnwf8N/Fj4dIYdE1+whu7OJ0WNoo2GPKZE+RWjIKMF4yOOK9D1HRdH1pEt9ZtILyNHDKk8SyKG6ZAYEA44r7ydSriMEp4OpHmkk1K14+trrTyufzxQwuBy3PHQzvD1HSpzlGdNSUKnu3XLz8kkpJ7vka02Pyek/4Le/8E/oYEuZNd1dY5c7GOk3IDbeDtO3Bx09q/U/wZ4s0bx54R0rxv4dZn0/WLSG9tWddrGGdQ6Er2O09O1fil+wn4W8LXn/BS39qTT7zTLOa3t5dMEUTwRskYJbOxSuFz7V+5lvbwWkCW1sixxxgKiIAqqo6AAcADsBXyHh5muaY6hPE5hODXNKKUYuPwScb6ye9traH7T9JbhDhLh/H0Mq4cw9aE3To1ZSqVYzTVajCqoqMaVOzhzWbu+a20diaiiivV49/5FNX/t3/ANKR8j9HX/kssH/2/wD+mpn/0f0Aooor/YA/4nz44/bY/Y28G/to/CyDwH4gv5dF1TSbr+0NH1SBfMa0ughTLR7kLxsDyodGBwyMrAEfDngv/gld8X/FPxO8N+Mv2xvjLefEvR/CM0dxpulPavEvmwurx75JJpMLlF8whDI4GzzAhdW/aqivjs24ByrHYr65iad5aX1kk+XbmimlK3S6P27g36RXF2QZR/YmV4lRorm5b06cp0+dWn7KpKLnT5uvJKPfcOpye9fC/wC3Z+x1e/tleE/CPhiy8QReHm8LeIIdd8yW1a6E3kxPGIgqyxbc7s7snGOlfdFFe/m+UYfHYaWExUbwlutvy9D864K4zzHh7M6Wc5TPkr0r8rsna6cdpJrZvdHxD+3L+xH4E/ba+GUPhPXJYtL13S5Gk0rVzD5zW4lwtxC6K0bPBOgw6B1wwV1IZFI9y/Zw+EcvwE+A3hD4LT366o/hbSoNNN2sZhWfyBjeIyz7M/3dxx617ZRXPS4ewdPHSzKELVZLlb7pbabdN7XtpsejjfErO8Tw/R4Xr13LCUpupCDS92TVnZ2ulq/dvy3bdrs5Xxv4R0vx94O1TwRrbzx2er2stnM9pM9vOqSrtJimjKvG46qykEGvxGg/4JP/ALV+jeBrj9njw9+0Rdj4VXDNF/Y91polmWzL7xblvNwQABlQViYjmLadlfvHRXn8Q8GZfmkozxcXdJrSUo6PeL5Wrp9mfReGvjjxFwnSqUMmqQUJyjO06VKqlOF+WcVUhLknG+ko2f4HlvwS+EfhX4CfCPw78GfBPmtpfhqwisLd5ypmkEY5kkKqq75Gy7bVAyeABgV6lnaQ3oQfyoor6PDYanRpxo0laMUkl2S2R+Y5rmeIxuJqYzFy5qlRuUm9227t/Nnw1+z/APseXnwS/ah+K/7Rk/iCPUk+Jj2rpYJatE1l9mJIBlMrCXcD2RMV9y0UVyZVlGHwVL2OGjaN2/nJ3f4s9vjDjPMc+xax2aT56ihTpp2S92lCNOCtFJaQilfd2u9QooorwePf+RTV/wC3f/Skfof0df8AkssH/wBv/wDpqZ//0v0Aooor/YA/4nzjvFfxD8A+AxbnxxrdhowuyywfbriO380oMsE3kbto646VJ4X8eeB/G8ckvgvWbDV1h/1hsriOfZ/vbCcV+IH/AAW0svCF/wCIfgPa/EBYToMnid01Rp8BF09in2tmbHyhYdzbhyuMjkCvkmx0b9lrwB/wUf8Ag5H/AMEwL43n2+7lh8SpYXE1xZ/YiR5iGRyAy/Z1Z5U+ZEdYm+VmG78czvxOrYLNKmC9lBwhKnH4rTfPbWMba2vrtof25wF9FDA55wlhs8+tVYVq1HEVV+5Tw8Pq7n7tWrzp0+dQtF8r10sf1RVzHinxr4O8DWUepeNNVs9HtppBFHLezJAjSYJCKzkAtgE49BXTnaCQvTt9K/C7/gvvJs/Zc8DpsRxJ43tEYSIrjabG8zw4I/SvveMs/eVZZWzCMebkV7bdkfzx4H+HNPi7izA8N1avs415cvMle3ut7adj9svEPirwz4Q0Z/EXizUbbS9Pi2h7m7lWGFd5AXLthRuJAFalhf2OqWMOpaZMlxbXCLJFLEwZHRhlWVhwVI6EV+Vf/BZzH/DtPxpj/npof/pfbV93fsyqqfs5eAUjAVV8O6aAAAAALdeABwKWGz+U83nlnLpGnGd/8UpK3y5TXNfDalh+DMPxSqr5qmJq0OS2iVOnSnzX8/aWtbSx7j7CvINe/aB+BHhbxMvgvxL400LT9Xc7RZXF/BHPn08stmvj3/gq98d/Gv7Pf7Enijxp8Orh7LWL2W10qC6iJWS3S8k2TSRsOUk8oEIw+6xz2rwb4Of8Eaf2OJ/gTYad8VtHm8QeJdbsorvUdbF1LFMLmdBITbYOFSMnC7txcDMhYlifIzniXMP7QeW5VRjKUYqUnOTikm2opWTd3Z+SPseBvCvhtcNw4q4vxlSjRq1ZUaUKNONSbcIwlOcuaUIqEFOKt8UntZan7LRyxTRrLEwZXAKlSCCD0II4I9MVheJfFnhbwXpv9s+L9StdKtN6xefeSrDHvfhV3OQMnsK+bf2M/wBmnXv2Tvg7H8HdX8YXni+3tLmSWye7gSD7HbucrbQqvPlL1APCkkIAm1V9L+PH7P8A8KP2lPAZ+Gnxm0w6topuYbs24leHMsBzGS0eDgd16EcHjivpaWKxlTAqqqSjVt8LeifZuN9PNL5H5XjMpyPDZ/LBvFurgozt7WnC0pU/5o05uPvW+y2lfrbUsD9oP4DMcL420HI7f2hB/wDFV6fpOr6Vr2mQa1odzFeWd0gkhngcPFIh6MjLwVPYjiv5k/gB+wf+yt43/wCClnxn/Z38TeFY7jwj4QsLObSrMSujQvLDAzkzLiR8l24cnH4V/SR8Ofh94U+FHgLRvhl4FtjZ6LoFnFYWMDO0pjghG1FLvlmwO7cmvmeCOI8yzL2lTF0oQhGUoe7Jt80Jcr0cVppp+R+p+PPhjwzwvLDYbKMXWrVatOlWtOnCEVTrU1UjrGpJ86TSatZa2kztaKKK7+Pf+RTV/wC3f/SkeD9HX/kssH/2/wD+mpn/0/0Aooor/YA/4nz8rf8AgpD+yn8S/wBp7xj8HV8G6JDrWi+HPEX2rxAk80UaDT3ZBMrJJ/rFePcpRRyOOM5HJeEv2MPiF+yz/wAFCYfjN+zJ4ctD8K/Fumf2drumW8kMH9lOHJElpE+CIt+JNkZ/ikyM7a/X7APUUYB6ivhcb4fYGtjZZg7qq5QkmrXXIrWWnwtbrqf0Bkf0keIMBkVPhqmoSwkaNai4NS5ZxrS53KSUkueErOnJW5bdVcOnFfCn/BRX9kOb9tL9mu8+FOkXENrrNneQappclwSsRubfcpid1DFBLE7x7wrbCQdpAwfuyivp85yihj8LUwWJV4TVn6H5RwRxlj+Hc3w2eZVPlrUJKcH0uu67dGu2h/OV8Vfg5/wV+/bM+HFj+yf8bfDWg+E/DKTWw1bxGJ4XN6lq6GKTyopXdigXeUVIvMlCnci5U/0JeD/DGm+CvCel+DtGBFppFnBZQbuvlwRiNSfc4zXQhFHIFOrw+GuDqOW1Z4j2s6k5JLmm03yx2irJKyv2P0DxR8bsZxPhMPlqwlHC4ejKc1ToQcIupU5eeb5pSbbUYpa2ilaKSPmz9rj9m/w9+1l+z74i+BPiK4+xLrESNa3ezzPs13bsJLeYpld6q4G9MjcuRX5QeDW/4Lb/AAJ8E2v7Pnhjwj4d8V2+nQiy0zxTLdxOLe3QbY/MEskUknljiMtEzABQ5kILN++VN2L6Cs8+4KpY3ErGU606VS3K3BpXj2d01p06o6fDzx2xmRZXLI8TgqGMwvP7SNOvByUKllFzhyyg1dJKUW3F2V4nyZ+xh8H/AI3/AAX+DEPhz9oTxlc+NPE1zcSXc800glitBJ0tYJPLjZ0Tqzsoyx+UKoVR9ZsMjAp1FfSZZl1PCYeGGpX5Yqyu7v72flvFfEuIzjMq2aYpRU6snJqEVCKv0jGKSSXRI/Lj4Cfs0fGHwL/wUp+Mv7RXiTTEh8JeMbC0h0u7E8bNI8MMCMDEp3ryjdQOlfqPSYHXFLXLkeR0sBTnSot2lKU9e8nzPtp2PY4+4/xnEWJo4rHRinSpUqK5U0uWjBU43u3ryxV9lfZIKKKK8jj3/kU1f+3f/SkfafR1/wCSywf/AG//AOmpn//U/QCiiiv9gD/ifAcjIHAoAY/dGfpX4/fthePfHfh//gpj+zb4N0HWr+x0fWRqP2+xt7mWK2uvLXKefCjBJNvbcDiqX/BWPx/488C+JvgJD4J1vUNGTVfHVtaXq2NzLbC5t2kiBimETKJIz/dbI9q+Dx/HdKhRxlV03/s8owfnzKG3b4vwP6J4c+jzisxxuSYKGJjF5jRqVovlfuKm60eV97+xe3ddj9jQR2pQCeFGfpTpBiVwPWvxj/4LV+NPHng34NfDuL4f6/qXhy51jxhFpstzpd3PZyeXPaTqNzQPGzKj7X2E4JUV7vFOfRyvL6mPlHmUFtt1SPz7wi8OanFvEeF4do1VSdZtczV0rRctlbsfs0cr94Y/Civ5rPino37T3/BML44/DLVtJ+NuqfErRvHOuW+k6joOvzSXEjQmRY5JI45JZioxKNssZjKSBAwkVto/pUZdjsn90kc+xrg4X4qeYTrYerRdKpSaUotp7q6s46bfcfQ+Lng9HhmjgcwweNhisLi4ydOpGM4fw5ck1KE0pRae26a2Ep2yT+6fyrl/GkkkHg/V5oWKMlhdsrKcEFbeQggjoQemOlfyP/sd6TpX7QHwisvF/wAYv2wNf8B+JLm7ltP7Fk16fzSqMFik2y3Yf95ngAAelefxbxw8sxNHCQo88qik178YJcvKt5adT6PwY8AYcV5Tjc4xGN9hSw0qcHajUrNuopte7STaS9m7u1ttj+wb2oUFjhRn6Vw3wy8IXvw9+HGgeAtS1S41u40TTrWwk1G8YtcXb28SxmeZiSTJIV3OSTya/LD/AILYeNfHPgP9mHwzqvw+1zUNAvJvF1lbyXGm3c9nK0MkE4aNnt3jYoeDtJxkA4yBj2+IeIP7NyypmNWHwRu4/dpfY+B8NPDf/WfirD8MYSuo+2nyRqOLtbW0uXR6pbdD9iDlfvDFFfzj/HH9nD9u39gP4ZXH7UXwc+Oeu+N7PQ47e51bR/EBkuYntyVZmCXEsqPHzslVDDKEJdH3KFP7v/Aj4r6Z8dPgv4V+MmjwNa23ijS7bUkgfBaHz0DNGSOPkORXn8O8XSxmJngcTQdGrFKVm004vS6cdPJ7WPpvE3wWp5HldDPspzCnjcHUnKlzwjODhUilJwnCaTV4u8WrppdND1miiir49/5FNX/t3/0pHP8AR1/5LLB/9v8A/pqZ/9X9AKKKK/2AP+J8/Ff/AIKi+DPi94F+Nnwb/bY+GPh2fxXY/Da6uo9X0+zV5LgQ3O3bIiRqzFSu9dwGFbaWwmSPl/4ofGfx7/wVL/aR+EHg/wCEPw98Q+GvDPgbXYvEGsalr9r5AQQyRuyq6FowAibFXdvd3BACqxr+kgEqcqcH2pqrtXYOB1wAAPyGK/MM38OJYrE1ZxxLjRqyjKcOVauNlpLdJ8quf1jwT9J6llGVYKlUyyFTG4KnVpYeu5yXJCq5t81Ne7NxdSfK7rdXWg9judm9TX4p/wDBb3wZ4h8d/BX4daD4fsbu98zxgi3P2KCSaSG3ksp4pJcRKxXaH4YjAOK/augZHKnH0r67irII5pl9TL5S5VNJXt2ae2nY/GfB7xHqcIcSYXiOjS9pKg21G9r3i472drX7H8zU/wCxZ4U/Yc/4KUfDbxdr+kap47+H+twtFaanqCTX82j6qpVUnuDFG6lInMbReYQih3fgx5P9MgzkhuoPPfmlBZfunFIBiuDhPg3D5N7eGF+CcuZK3w6JWv1WmnbY+h8ZfG/MeN1gK+bpuvh6XsnPmuppSbjLkslCVnaXLpKydkzmPG6s3gvWVQZJ0+8AAGf+XeTGAK/kE/Y08Rfsy/BX4QWGhftD/s3eK/GXjC3vJLv+1rXSpcKhYNCgMjRNujx2XHoTX9kdO8yX++a87i/gZ5piqOKhUUXTUlrCM173L0eityn03gp9ICHCWU47J6+FlVp4mVOTdOvPDyXs1NJc1NXafPqttEcP8NfGsfxJ+HmhfESGwutLTXtPttQWzvk8u5txcxLIIZ0H3ZI921x2YEV+UH/Bb7w94h8SfsueGLLw1p91qU8fi+xmaKzt5rlwkdvcMWKQI7BeMZIxkgdxX7K/WgEj7px9K93iPh/+0crqZdUnbnja9vTW36H574Y+I8eF+K8NxNhqHMqM+eNPm6a2jzWeye9uh/Ob+0P+2n8f/wBuz4Tz/smfs1fBrxXo914qjisdR1bXrX7Pb21sgXf83+rVX24dmYMqZKKzYFfub+zx8JYPgP8AArwh8Gbe4+2L4Y0m204z4x5rwoA7gdgzZx7V7MWcrtZiR6dvypvSvP4e4TqYXFTx+MrurVlFRvZRSitbKK892fSeJXjHhc3yihw9kmXxweEpzlVcVOVSU6kko80py10iuWMUklqLRRRWnHv/ACKav/bv/pSOf6Ov/JZYP/t//wBNTP/W/QCiiiv9gD/ifCiiigAooooAKKKKACiiigAooooAKKKKACiiivj+Pf8AkU1f+3f/AEpH7Z9HX/kssH/2/wD+mpn/2Q==
4+
5+
" alt="Unstructured Logo" />
6+
</div>

Diff for: test_unstructured/partition/test_auto.py

+13
Original file line numberDiff line numberDiff line change
@@ -627,6 +627,19 @@ def test_auto_partition_pdf_element_extraction(extract_image_block_to_payload: b
627627
)
628628

629629

630+
def test_auto_partition_html_element_extraction():
631+
extract_image_block_types = ["Image"]
632+
633+
with tempfile.TemporaryDirectory() as tmpdir:
634+
elements = partition(
635+
example_doc_path("html-with-base64-image.html"),
636+
extract_image_block_types=extract_image_block_types,
637+
extract_image_block_to_payload=True,
638+
)
639+
640+
assert_element_extraction(elements, extract_image_block_types, True, tmpdir)
641+
642+
630643
def test_partition_pdf_does_not_raise_warning():
631644
# NOTE(robinson): This is the recommended way to check that no warning is emitted,
632645
# per the pytest docs.

Diff for: test_unstructured_ingest/src/local.sh

+1-1
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,6 @@ PYTHONPATH=${PYTHONPATH:-.} "$RUN_SCRIPT" \
3232
--input-path example-docs \
3333
--work-dir "$WORK_DIR"
3434

35-
"$SCRIPT_DIR"/check-num-files-output.sh 13 $OUTPUT_FOLDER_NAME
35+
"$SCRIPT_DIR"/check-num-files-output.sh 14 $OUTPUT_FOLDER_NAME
3636

3737
"$SCRIPT_DIR"/evaluation-ingest-cp.sh "$OUTPUT_DIR" "$OUTPUT_FOLDER_NAME"

Diff for: unstructured/__version__.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = "0.16.26-dev1" # pragma: no cover
1+
__version__ = "0.16.26-dev2" # pragma: no cover

Diff for: unstructured/partition/auto.py

+2
Original file line numberDiff line numberDiff line change
@@ -283,6 +283,8 @@ def augment_metadata(elements: list[Element]) -> list[Element]:
283283
partitioning_kwargs["languages"] = languages
284284
partitioning_kwargs["starting_page_number"] = starting_page_number
285285
partitioning_kwargs["strategy"] = strategy
286+
partitioning_kwargs["extract_image_block_types"] = extract_image_block_types
287+
partitioning_kwargs["extract_image_block_to_payload"] = extract_image_block_to_payload
286288

287289
partition = partitioner_loader.get(file_type)
288290
elements = partition(filename=filename, file=file, **partitioning_kwargs)

Diff for: unstructured/partition/html/partition.py

+4
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,8 @@ def partition_html(
3737
detection_origin: Optional[str] = None,
3838
html_parser_version: Literal["v1", "v2"] = "v1",
3939
image_alt_mode: Optional[Literal["to_text"]] = "to_text",
40+
extract_image_block_to_payload: bool = False,
41+
extract_image_block_types: Optional[list[str]] = None,
4042
**kwargs: Any,
4143
) -> list[Element]:
4244
"""Partitions an HTML document into its constituent elements.
@@ -86,6 +88,8 @@ def partition_html(
8688
detection_origin=detection_origin,
8789
html_parser_version=html_parser_version,
8890
image_alt_mode=image_alt_mode,
91+
extract_image_block_types=extract_image_block_types,
92+
extract_image_block_to_payload=extract_image_block_to_payload,
8993
)
9094

9195
return list(_HtmlPartitioner.iter_elements(opts))

0 commit comments

Comments
 (0)