Alibaba staffer offers a glimpse into building LLMs in China

Rita Liao

Updated 27 February 2024 at 2:43 pm·4-min read

Chinese tech companies are gathering all sorts of resources and talent to narrow their gap with OpenAI, and experiences for researchers on both sides of the Pacific Ocean can be surprisingly similar. A recent X post from an Alibaba researcher offers a rare glimpse into the life of developing large language models at the e-commerce firm, which is among a raft of Chinese internet giants striving to match the capabilities of ChatGPT.

Binyuan Hui, a natural language processing researcher at Alibaba's large language model team Qwen, shared his daily schedule on X, mirroring a post by OpenAI researcher Jason Wei that went viral recently.

The parallel glimpse into their typical day reveals striking similarities, with wake-up times at 9 a.m. and bedtime around 1 a.m. Both start the day with meetings, followed by a period of coding, model training and brainstorming with colleagues. Even after getting home, they continue to run experiments at night and ponder on ways to enhance their models well into bedtime.

The notable differences are in how they choose to characterize leisure time. Hui, the Alibaba employee, mentioned reading research papers and browsing X to catch up on "what is happening in the world." And as a commentator pointed out, Hui doesn't have a glass of wine after he arrives home like Wei does.

This intense work regime is not unusual in China's current LLM space, where tech talent with top university degrees are joining tech companies in droves to build competitive AI models.

To a certain extent, Hui's demanding schedule seems to reflect a personal drive to match (or at least the social media appearance of doing so), if not outpace, Silicon Valley companies in the AI space. It seems different from the involuntary "996" work hours associated with more "traditional" types of Chinese internet businesses that involve heavy operations, such as video games and e-commerce.

My typical day as a Member of Technical Staff at Qwen (Just for myself):
[9:00am] Wake up, might stay in bed for an extra 15 mins.
[9:30am] Taking a cab to work, browsing X to catch up on what's happening in the world, checking out @_jasonwei 's latest post.
[10:00am] Work… https://t.co/7o47EQrWcW
— Binyuan Hui (@huybery) February 21, 2024

Indeed, even renowned AI investor and computer scientist Kai-Fu Lee puts in an incredible amount of effort. When I interviewed Lee about his newly minted LLM unicorn 01.AI in November, he admitted that late hours were the norm, but employees were willingly working hard. That day, one of his staff messaged him at 2:15 a.m. to express his excitement about being part of 01.AI’s mission.

Outward displays of intense work ethic speak to the urgency of the remits laid out by tech firms in the country, and subsequently the speed with which those firms are now rolling out LLMs.

Qwen, for example, has open sourced a series of foundation models trained with both English and Chinese data. The number of parameters -- a figure that speaks to the knowledge the model gains from historical training data that defines its ability to generate contextually relevant responses -- is 72 billion for the largest of these. (For some context, GPT3 from OpenAI is believed to have 175 billion; GPT4, its latest LLM, has 1.7 trillion. However, it's arguable that the aim of a particular LLM will be the more important key to decoding the value of high parameter numbers.)

The team also has been quick to introduce commercial applications. Last April, Alibaba began integrating Qwen into its enterprise communication platform DingTalk and online retailer Tmall.

No definite leader has emerged in China's LLM space so far, and venture capital firms and corporate investors are spreading their bets across multiple contenders. Besides building its own LLM in-house, Alibaba has been aggressively investing in startups such as Moonshot AI, Zhipu AI, Baichuan and 01.AI.

Facing competition, Alibaba has been trying to carve out a niche, and its multilingual move could become a selling point. In December, the company released an LLM for several Southeast Asian languages. Called SeaLLM, the model is capable of processing information in Vietnamese, Indonesian, Thai, Malay, Khmer, Lao, Tagalog and Burmese. Through its cloud computing business and acquisition of e-commerce platform Lazada, Alibaba has established a sizable footprint in the region and can potentially introduce SeaLLM to these services down the road.

How China is building a parallel generative AI universe

OK! Magazine
Meghan Markle 'hysterical' after breaking Prince Harry's strict orders in interview
Meghan Markle was given strict instructions by Prince Harry before she sat down for an interview with Vanity Fair - but the Duchess of Sussex seemingly didn't listen
Evening Standard
Elderly woman was 'rammed with trolley' sparking Manchester airport police 'stamping' incident
Brothers confronted man who had argued with their mother on flight before pushing trolley into her, it is claimed
BuzzFeed
Kamala Harris' Press Release About Donald Trump's Fox News Appearance Is Going Viral
"Something about the question mark after 'old and quite weird' is taking me out."
Wales Online
Cheap pill almost everyone takes could cause heart attack and stroke
Some people are even prescribed the drugs for simple conditions, despite the risk of harm
The Daily Beast
FBI Is Not Fully Convinced Trump Was Struck by a Bullet
FBI Director Christopher Wray revealed during a marathon testimony on Wednesday that investigators still do not know if former President Donald Trump was grazed by a bullet or a piece of shrapnel during his attempted assassination.Twice during the hours-long session, Wray told lawmakers that the FBI was still working to determine what exactly struck the former president on his right ear during a rally in Butler, Pennsylvania. “My understanding is that either it [a bullet] or some shrapnel is wha
OK! Magazine
Drivers warned that most car owners will be hammered with new £410 tax from next April
New rules will see many drivers forced to pay additional tax on their vehicle from next year. The so-called 'luxury' car tax would see motorists forking out £410 each year
OK! Magazine
Beloved soap actress dead as tributes paid to BBC star who 'captured the nation'
Her exit from the show, which captivated 20million people, proved one of the most controversial in the show's history...
Manchester Evening News
Rapist tried to spike woman's drink for a second time - she 'took matters into her own hands'
Her quick thinking led to a "strange turn of events", a court heard
OK! Magazine
Prince Harry 'ignored Prince William's strong advice' over Meghan Markle – now he's paying the price
EXCLUSIVE: After claims Prince Harry and Meghan were offered advice on balancing their public and private lives, an expert explains why Prince William was right to air his concerns
OK! Magazine
Prince Andrew's six-word comment to photographer after Newsnight interview
The photographer who was present when Emily Maitlis interviewed Prince Andrew for BBC's Newsnight has recalled the six words the Royal said to him when the interview wrapped up
The Independent
Prince William’s feelings towards Harry revealed in unseen letters from Princess Diana
Collection includes insights into Diana’s royal life
Wales Online
Antiques Roadshow guest 'needs bodyguard' after surprise valuation
Antiques Roadshow expert Alastair Dickenson was left impressed after being shown a decorative silver box
Bradford Telegraph and Argus
Police update on man arrested after horror crash that left six people dead
A man has been arrested on suspicion of causing death by dangerous driving after six people died in a horror crash.
Rolling Stone
Harris Taunts Trump After He Backs Out of Debates
“What happened to ‘any time, any place’?”
Wales Online
Former Man Utd and Cardiff City player now working on a building site after walking away
The Premier League winner once commanded a transfer fee of £35 million
Wales Online
Woman wakes up hours before life support was to be switched off
Emma's family had been told the 32-year-old was brain dead
The Telegraph
Lee Anderson: I’d give medal to police officer filmed stamping on man’s head
Lee Anderson has said an armed officer who was filmed appearing to kick and stamp on a man’s head at Manchester Airport should be given a medal.
The Northern Echo
I compared Heinz tomato sauce with supermarket versions (this is the one to avoid)
Is branded or supermarket own ketchup better? I taste tasted a handful to make up my own mind - here's what I thought.
The Telegraph
‘I’m not the whistleblower but Charlotte Dujardin has lots of enemies’
A dressage trainer claimed Charlotte Dujardin has “many enemies” as she denied suspicions in equestrianism that she is the whistleblower behind the horse-whipping video.
HuffPost
Stephen Colbert Taunts Trump With Absolutely Brutal Reminder About Melania
The "Late Show" host mocked the former president over one curious claim.

Latest stories