31 4 29

Raymond Ng

RaymondAISG

AI & ML interests

Foundation Model; Natural Language Processing; Deep Learning;

Recent Activity

upvoted an article 7 days ago

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

new activity 7 months ago

common-pile/foodista_filtered:ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

liked a Space 11 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

upvoted an article 7 days ago

Article

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

Sep 10, 2025

•

New activity in common-pile/foodista_filtered 7 months ago

ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

#2 opened 7 months ago by

RaymondAISG

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

New activity in aisingapore/Llama-SEA-LION-v3-70B-IT 11 months ago

Instruction and answer must be in the same language?

#1 opened 11 months ago by

rub2000

New activity in aisingapore/Gemma-SEA-LION-v3-9B-IT 11 months ago

Update config.json so that it can be run by llm serving engine

#3 opened 11 months ago by

Pinkgu1

Update config.json

#2 opened 11 months ago by

Pinkgu1

updated a collection 12 months ago

RLMs

Collection

2 items • Updated Jan 24, 2025

upvoted a paper 12 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 433

authored a paper about 1 year ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 19

New activity in aisingapore/SEA-LION-v1-7B-IT about 1 year ago

Model doesn't run under HF's Transformers / Inference Endpoints

#9 opened over 1 year ago by

gtie

New activity in aisingapore/SEA-LION-v1-7B over 1 year ago

Is SEA-LION trained on Singaporean culture?

#13 opened over 1 year ago by

SBSTFRNNDZ

updated 4 models over 1 year ago

liked 3 models over 1 year ago

aisingapore/Llama-SEA-LION-v2-8B

Text Generation • 8B • Updated Apr 15, 2025 • 64 • 4

aisingapore/Llama-SEA-LION-v2-8B-IT

Text Generation • 8B • Updated Apr 15, 2025 • 578 • • 17

aisingapore/SEA-LION-v1-7B-IT

Text Generation • 8B • Updated Apr 14, 2025 • 828 • 24

New activity in aisingapore/SEA-LION-v1-7B over 1 year ago

python llama.cpp/convert-hf-to-gguf.py ~/sea-liton-7b/ error

#10 opened almost 2 years ago by

pacozaa

New activity in aisingapore/SEA-LION-v1-7B-IT over 1 year ago

System Prompt

#5 opened over 1 year ago by

anhnh2002

Raymond Ng

AI & ML interests

Recent Activity

Organizations

RaymondAISG's activity

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

ArrowInvalid Exception: Failed to parse string: '' as a scalar of type timestamp[s]

The Ultra-Scale Playbook

Instruction and answer must be in the same language?

Update config.json so that it can be run by llm serving engine

Update config.json

Model doesn't run under HF's Transformers / Inference Endpoints

Is SEA-LION trained on Singaporean culture?

python llama.cpp/convert-hf-to-gguf.py ~/sea-liton-7b/ error

System Prompt