C Gibberish to English

(cdecl.org)

54 points | by warkanlock 3 days ago ago

39 comments

ashleyn 7 hours ago

   char (*(*x[3])())[5]

I'm more of the mindset that writing something like this is probably a code smell to begin with. Is there any reason I'm not thinking of right now, that this couldn't be typedef'd and refactored into something far more readable?

C gets a lot of blame for pointer gibberish like this but quite honestly you can write gibberish in any language. I don't see any fundamental or technical reason you couldn't write clean, readable C.

[-]

theanonymousone 4 hours ago

I'm not a C programmer but your comment reminds me of all the lame jokes people make about the German language.

jacoblambda 5 hours ago

the function signature certainly should be type def-ed. i.e.

    typedef char (*fn())[5];

and then you have the original as

    fn x [3];

[-]

fsckboy 5 hours ago

I don't keep up with the latest C shenanigans, cuz I like C the way it was, but have they change something where "pointer to an array[5]" is a meaningful distinction to draw?

I mean, "a pointer to an array of X" is simply "a pointer to X" and using hungarian notation, you can encode the knowledge "this pointer can be incremented" into its name.

and typedef'ing *function declarations? who has families of functions with the same type signatures that they want to point to?

[-]

jacoblambda 5 hours ago

> but have they change something where "pointer to an array[5]" is a meaningful distinction to draw

This has been a meaningful distinction since at least C99.

> "a pointer to an array of X" is simply "a pointer to X"

They aren't actually the same. The former can be decayed into the latter but they aren't actually equivalent and you'll get type mismatches if you try to treat them as the same thing.

> who has families of functions with the same type signatures that they want to point to

An example is callbacks or handlers for responding to interrupt requests. A lot of hardware interface code relies on typedef-ed function decls because very often you are passing user side functions through your interface so that you can stash those function pointers somewhere and invoke them when some event occurs.

kelnos 3 hours ago

> "a pointer to an array of X" is simply "a pointer to X"

I don't believe these two are the same. An "array of X" indeed decays to a "pointer to X". But a "pointer to an array of X" is something else. E.g.

    int foo[3];  // array of int
    int *foo;    // pointer to int
    int *foo[3]; // pointer to array of int

Perhaps the first two are what you mean, though, and this is just a terminology issue.

[-]

fsckboy an hour ago

  int *foo[3]; // pointer to array of int

that's an array of 3 pointers to ints. if you pass foo as an argument you get a pointer to a pointer to an int (with knowledge if you can hang onto it that there are more pointers to ints lined up in memory)

pavlov 11 hours ago

Use typedef?

Granted, the function pointer syntax is forever confusing (to me anyway). The rest is easily tackled by naming things.

Even for function pointers, it’s just one lookup and then you can copy-paste the typedef for any other function pointer types in the project.

[-]

khrbtxyz 7 hours ago

Function typedefs make this less confusing by removing awkward parentheses. e.g.

  typedef int read_block_fn(void *context, u8 *buf, unsigned int block, size_t len);

https://github.com/torvalds/linux/blob/0a9b9d17f3a781dea03ba...

dapperdrake 10 hours ago

Reading C Type Declarations: http://unixwiz.net/techtips/reading-cdecl.html

(NOT the author. It simply helped me.)

sebstefan 10 hours ago

>Use typedef?

What if you're given somebody else's code and you need to understand it to put a typedef there

[-]

jacoblambda 5 hours ago

The trick is to just assert equal types. Most compilers have extensions that allow you to easily compare type equality (and C23 actually standardizes the `typeof()` operator).

So you basically take your ugly type, put it in a #define and then create a static assertion that matches the type against said ugly type.

Now the compiler will throw a shit fit if the types don't match. Have fun breaking the type up into smaller pieces until you have something legible.

DaiPlusPlus 10 hours ago

Welp, if it’s that bad then shrug and use a #define

(Yes, this is a joke)

vzaliva 5 hours ago

Is it just a web wrapper around good old `cdecl` command? Or it does something different/better?

elcritch 8 hours ago

Handy site!

Next I want one to explain some of Rust’s more cryptic pointer gibberish. Usually I just hit “use suggested fix X” until the compiler’s happy.

[-]

Thiez 6 hours ago

Rust isn't so bad, is it? The example of `char ((x[3])())[5]` would translate to `[fn() -> [fn() -> u8; 5]; 3]`. It's inherently an ugly type, but I think it's easier to read than the C version.

card_zero 10 hours ago

Is there a language that's substantially free of gibberish?

[-]

chongli 8 hours ago

Racket Beginning Student [1] language.

[1] https://docs.racket-lang.org/htdp-langs/beginner.html

pjmlp 8 hours ago

Those of Algol/Wirth linage, or influenced by then.

Then again people complain that they are too verbose, and they rather write in hieroglyph friendly languages.

[-]

Joker_vD 8 hours ago

Compare

   char (*(*x())[5])();

and

   var x: pointer to func() pointer to array[5] of pointer to func() char;

or if you wish to replace some keywords with glyphs:

   var x: ^func() ^[5] ^func() char;

And it's always a nice puzzle for the reader to explain why there are three "pointer" in cdecl output and three carets in the ALGOL-like declaration, but only two asterisks in the C declaration.

[-]

trealira 8 hours ago

In this case, the C declaration doesn't match the other two. The variable x is a function that returns a pointer to an array of 5 pointers to functions returning char. Indeed, that's what cdecl.org says:

  declare x as function returning pointer to array 5 of pointer to function returning char

Using the notation you did, that would be:

  var x: func() ^[5] ^func() char

There are only two arrows there now.

If you wanted a pointer to a function like this, you would need a third asterisk in the declaration:

  char (*(*(*x)())[5])();

[-]

Joker_vD 7 hours ago

Oh, good catch, thank you! But I remember an example with some other tricky C expression/type declarator where the number of actual dereferences differed from the amount of asterisks in the code.

> Using the notation you did, that would be:

Well, it'd be

    func x() ^[5] ^func() char; ...

because it's a function declaration, after all, not a variable.

unrealhoang 6 hours ago

for declaring function pointer?

Any language with type after name :

    // c
    char (*(*x[3])())[5];
    // golang
    var x [3]func() *[5]byte

dogleash 8 hours ago

Depends on the features you want.

marginalia_nu 9 hours ago

Yeah sure, COBOL is great in that regard. Basically reads as English.

pwilson7 8 hours ago

nope

ngcc_hk 3 days ago

May need some bracket first then English.

djmips 10 hours ago

Is this largely supplanted by LLMs?

[-]

veltas 9 hours ago

The difference is I trust this website, and wouldn't trust an LLM.

marginalia_nu 9 hours ago

LLMs are mostly correct with regards to this stuff.

cdecl is always correct with regards to this stuff.

I don't know why you'd choose the former.

[-]

Y_Y 8 hours ago

Because if you don't do things exactly the way cdecl wants you get a Syntax Error

[-]

marginalia_nu 6 hours ago

Wouldn't any explanation given, apart from syntax error, be wrong in the case you provide it with an invalid syntax?

[-]

Y_Y 6 hours ago

I was thinking of the issue of going from English to C. Even when you out in malformed C it's nice to get either a compiler error which explains the problem, or even better an LLM that can suggest how to fix it.

andix 9 hours ago

Probably.

Output for the example I got on opening the website:

  char (*(*x())[5])()

cdecl.org: declare x as function returning pointer to array 5 of pointer to function returning char

ChatGPT: x is a function that, when called, gives us access to 5 functions that each return a character. (TL;DR, it gave a full explanation too)

Like mentioned before the error rate of LLMs is probably much higher on complex expressions.

makach 8 hours ago

I can read that.

I don't think it is gibberish. It's code and in order to read that code you need to understand the language, and to understand language you need learning and experience.

Maybe it can be useful for learning, but if you have to use such tool, I suspect you won't understand it anyway - so in a way it is more a gibberish-to-gibberish translator.

[-]

tuveson 6 hours ago

I disagree. The conventions for declaring arrays, pointers, and function pointers are all idiosyncratic. In C, the type is always to the left of the variable being declared. Except for arrays, which have part of the declaration to the right. And except for pointers, which need to be affixed to every item if there are multiple declarations. And except for function pointers where you need to wrap the variable name like (*name). Individually I can wrap my head around these exceptions, but putting all of them together, it's just hard to read.

Brian_K_White 5 hours ago

It takes a pretty smart person to do that. Which is pretty confusing.

How can such a smart person not not understand how all things that are possible are not all equally good?

The fact that both the compiler and you can parse that doesn't make it a good way to document or convey meaning or intent.

C is chock full of inconsistencies and ambiguities that are only disambiguated by essentially being a human compiler and maintaining the same parsing state-machine manually in your head to know what any given "(" actually means or does. As a self-proclaimed fluent C linguist, you know this better than most.

All coding involves that of course but all implimentations are not equally unhelpful.

The cpu and some people can read the binary itself. They just need to know the cpu's opcodes, documented right in the datasheet that anyone can read.

lynx23 10 hours ago

Why is it that the first thing I try tends to uncover shortcomings?

typedef uint64_t qbb_t __attribute__((vector_size(sizeof(uint64_t) * 4)))

Syntax error

OK, its an extension, meh.

[-]

veltas 9 hours ago

Looking at the project issues, it seems it supports only 1989-era C, and some Apple stuff, before you even get to attributes.