XMAS CTF 2021

This capture the flag competition was the first I ever competed in (in 2020), and contains some of the most creative and varied challenges I’ve seen thus far. I competed individually and had other commitments during the competition times, so I didn’t invest as much time as I would have liked, however the experience was still quite entertaining and educational.

Discord

Sanity Check

The Sanity Check flag in the discord was hidden in the #general channel’s description (as it was last year). People have a surprisingly hard time finding this, overthinking and believing it has something to do with a bot, which created quite a bit of spam in the channels (something the hosts might benefit from mitigating in subsequent years).

Bot

Upon seeing the Mem-X challenge, I tentatively identified as based on a file inclusion vulnerability, and proceeded as such. The basis of the challenge was bot functionality to make notes using the !note [name of note] command, which returned a hash of the note. The user could then read the note by passing the hash to the !remember command.

After playing with different parameters, trying variations of flag.txt (!note flag.txt, !note ../flag.txt, !note flag) and using the received hashes as parameters to !remember, I realised I may have been going about this the wrong way. I considered that the notes may have been saved with their hashes as filenames and tried !remember flag, receiving “We couldn’t find a note called flag.txt”. I then sent !remember /flag and received the flag! X-MAS{f0rgEtt1nG_EvEry7h1Ng_abf91b10e019c}

Bioinformatics

Having a BLAST

Given a DNA sequence of an enzyme, the goal was to find the enzyme encoded.

CGCTTCCTCCCCAAATTGCTCAGCGCCACCGGTATGCAGGGGCCAGCGGGCAGCGGCTGGGAGGAGGGGAGTGGGAGCCCGCCAGGTGTAACCCCTCTCTTCTCCCCCTAGCCTCGGAGGCTCCCAGCACCTGCCCAGGCTTCACCCATGGGGAGGCTGCTCGGAGGCCCGGCCTCCCCCTGCCCCTCCTCCTCCTCCACCAGCTTCTCCTCCTCTTCCTCTCCCACCTCCGGCGGCTGTGAACACGGCCTCTTCCCCTACGGCCACAGGGGCCCCTCCTCTAATGAGTGGTCGGACCGTGGGGAAGGGCCCCACTCAGGGATCTCAGACCTAGTGCTCCCTTCCTCCTCAAACCGAGAGACTCACACTGGACAGGGCAGGAGGAGGGGGCCGTGCCTCCCACCCTTCTCAGGGACCCCCACGCCTTTGTTGTTTGAATGGAAATGGAAAAGCCAGTATTCTTTTTATAAAATTATCTTTTTGGAACCTGAGCCTGACATTGGGGGGAAGTGGGAGGCCGGACGGGTAGCACCCC

My first approach was plain OSINT, and after looking through search results and databases (far too many databases), I reread the challenge title and focused on the EMBL-EBI BLAST database. After looking around the website for a bit, I ended up just querying all of the relevant databases and after around 15 minutes of “panicking” that the query would take days, I got a match, receiving my flag: X-MAS{acetylcholinesterase}.

A Putative Sequence

The request in this challenge was to find spike glycoprotein (as a polypeptide) from the SARS‑CoV‑2 B.1.617.1 lineage, not including the stop codon. This was done with more OSINT (I see why it’s called bioinformatics now), initially with the website of European Centers of Disease Control, then to pleasantly reliable wikipedia. From there I recovered the main mutations: L452R, D614G, P681R, E154K, Q1071H and E484Q. I then had to painfully try to figure out how to apply them manually (I found this site useful). By manually I mean in a python3 REPL using string indices…

After doing that, I submitted the flag. FAIL. Turns out I was missing some mutations (G142D and T95I), using the right column of the below wikipedia table, which was missing the mutations shown in the stanford-sourced visualisation.

	Nucleotide	Amino acid
Spike	T21895C	-
	T21895C	E154K
	T22917G	L452R
	G23012C	E484Q
		D614G
	C23604G	P681R
		Q1071H

After finally getting all 8 mutations and substituting them into the given amino acid, I was left with the flag:

X-MAS{
MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVT
WFHAIHVSGTNGTKRFDNPVLPFNDGVYFASIEKSNIIRGWIFGTTLDSKTQSLLIVNNATNV
VIKVCEFQFCNDPFLDVYYHKNNKSWMKSEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNL
REFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPG
DSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQT
SNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFK
CYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNL
DSKVGGNYNYRYRLFRKSNLKPFERDISTEIYQAGSTPCNGVQGFNCYFPLQSYGFQPTNGVG
YQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGR
DIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLT
PTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSRRRARSVASQSII
AYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQY
GSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIE
DLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTI
TSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASAL
GKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYV
TQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAH
EKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVN
NTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESL
IDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDD
SEPVLKGVKLHYT}

Forensics

Given a pcap, find the flag. Pretty typical forensics challenge, honestly. I opened the pcap in termshark (I like minimalism and terminal apps and in this case wasn’t sacrificing functionality), and did a quick search for X-MAS{ because sometimes it’s that easy. Spoiler: this time it wasn’t.

So I conducted a somewhat deeper (but still quite shallow) analysis, noting that the pcap consited of a TCP stream between 2 IPs, and that there was no readable ascii. I decided to try reading the pcap without termshark (so that certain parts aren’t hidden by text encoding and the program deciding whether or not it wants to show me data - I wanted more control), so I ran it through strings, redirecting into a file which I scrolled through quickly to see if there was anything suspicious (doing some cleanup of duplicate lines as well)… and there was in fact something.

At the very end of the file, a few lines caught my eye

[12;1HPlease enter the flag fragment (flagment) to continue (10/10). 
Your hint is "ef" >> 
#(vI
#(vI
|S@vI
Thank you!
You sent: ###################### (redacted for security reasons).
Goodbye.

Through a quick search of /10) I found the rest of the fragments, and was left with this:

[12;1HPlease enter the flag fragment (flagment) to continue (1/10). 
Your hint is "X-" >> z
[12;1HPlease enter the flag fragment (flagment) to continue (2/10). 
Your hint is "AS" >>
[12;1HPlease enter the flag fragment (flagment) to continue (3/10). 
Your hint is "u_" >>
[12;1HPlease enter the flag fragment (flagment) to continue (4/10). 
Your hint is "K4" >>
[12;1HPlease enter the flag fragment (flagment) to continue (5/10). 
Your hint is "th" >>
[12;1HPlease enter the flag fragment (flagment) to continue (6/10). 
Your hint is "d3" >>
[12;1HPlease enter the flag fragment (flagment) to continue (7/10). 
Your hint is "af" >>
[12;1HPlease enter the flag fragment (flagment) to continue (8/10). 
Your hint is "03" >>
[12;1HPlease enter the flag fragment (flagment) to continue (9/10). 
Your hint is "81" >>
[12;1HPlease enter the flag fragment (flagment) to continue (10/10). 
Your hint is "ef" >>

Putting that data together left me with an incomplete string that didn’t appear to match up with the supposed length of the flag (from You sent: ######################). I knew I was missing something, but I wasn’t sure what.

## ## ## ## ## ## ## ## ## ## ##
X- AS u_ K4 th d3 af 03 81 ef

After some head-banging-on-my-desk, I realised my lack of characters could have been caused by encoding issues (sigh) so I tried various ways of recovering data (cyberchef, hexed.it, file) but they all failed. I’m fairly certain there were intended to be pieces of data following the >> in the capture, but I couldn’t seem to get them to present as anything useful so /shrug. This challenge was unnecessarily painful for me so I tried not to waste too much time on it, which brings me to my favorite problem, one that had me thinking about design and math and o-notation … programming!

Programming

This problem was a fun Chrismas-themed prime puzzle, with stringent time constraints.

The description (paraphrased):

In the North Pole there are two elements: Naughtyium and Niceium. Naughtium has a prime number of neutrons, while Nicium’s count is composite. Given a collection of numbers (prime and composite), find the Nth possible combination of Naughtium and Niceium (one prime and one composite) [Nth with respect to ascendant value order]

My first approach involved a basic prototype in Python to get used to the infrastructure. I pulled the numbers and N from the TCP socket and ran them through an unoptimized and naive primality testing function. This got me through 5 of the ten challenges in the alloted time (120 seconds).

from pwn import *

def is_prime(n): 
    for i in range(2, int(math.sqrt(n))):
        if (n%i==0):
            return False
    return True

conn = remote("challs.xmas.htsp.ro", 5006)


others=[]
primes=[]
i = 0

conn.recvline()
conn.recvline()
conn.recvline()

while(True):
    line = conn.recvline().decode()
    #print(line)

    if "numbers" in line:
        line = line.replace("numbers = [", "").replace("]\n", "")
        nums = [int(x) for x in line.split(", ")]
        #print("NUMS="+str(nums))
    elif "queries" in line:
        line = line.replace("queries = [", "").replace("]\n", "")
        queries= [int(x) for x in line.split(", ")]
        #print("QUERIES="+str(queries))
        for num in nums:
            if is_prime(num):
                primes.append(num)
            else:
                others.append(num)

        print("DONE")
        ultimate = []
        for x in primes:
            for y in others:
                ultimate.append(x+y)

        ultimate.sort()
        ans = ""
        for x in queries:
            ans += str(ultimate[x-1]) + ", "
            
        ans = ans[:-2] 
        #print(ans)
        conn.sendline(ans)
        conn.recvline()

        others = []
        primes = []

conn.close()

I noticed that the number sets started out relatively small and in small groups, but very quickly ramped up in volume, value, and therefore difficulty, and that my program quite clearly wasn’t able to handle that. I then looked into just getting a collection of the top 100,000 primes and loading them into a dict to have O(n) prime-testing time, however I misread the constraints and I would have needed the top 1,000,000 primes, which wasn’t a viable approach (though programmatically generating them as opposed to downloading and reading them (which probably would have been faster) was something that remained in the back of my mind).

I did a fair amount of research in an attempt to find a faster prime function, but the basic improvements I implemented were still not good enough. I also benchmarked and realised that as the size of the sets increased, programmatically adding all of the primes and composites was a bottleneck, and also that it would likely become necessary to write the program in a compiled language so that interpreter overhead wasn’t making my life more difficult. I began an implementation of Erathmuses’ sieve but quickly got pulled into researching probablistic primality tests.

I also tried writing the program in Rust and improving (decreasing) the number of necessary addition instructions, but that was unsuccessful. Ultimately my completion of this task remained theoretical, as I ran out of time to get something optimized working.

I’m writing this after the challenge’s end, and looking back I should have written it in c/cpp, leveraging someone else’s optimized code (seeing as this is a CTF not a programming challenge) for the “difficult parts”, however I’m still not sure if there are any ctf or networking libraries that would simplify reading from the TCP socket.

I had yet to see a writeup for this challenge, so I messaged the creator and apparently it was based on a problem proposal for the Romanian IOI (International Olympiad of Informatics). I obtained a copy of the original problem for the purpose of perusual of different strategies and point values for each, and am hosting it here.

Overall I quite enjoyed the problem and wish I had more time to hack on it, as getting that probablistic test (and subsequent optimized addition/searching of the identified values) working would have been interesting.