You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Apologies for the delayed response. Due to some matters in 2023, I was unable to reply in a timely manner, and I apologize once again. Regarding your questions:
Due to our lack of experience during the initial exploration, we did not consider data augmentation, which would undoubtedly enhance the diversity of the dataset.
Recently, there has been some work on data augmentation in the cybersecurity field as well. IBM’s CyberPal mentioned specific details on this topic (unfortunately, the article is closed-source, and the code and dataset were not published). We are working on reproducing it and plan to release our dataset and corresponding code. We look forward to your continued interest in our work.
很棒的工作,有两个疑惑希望作者帮助解答下:
1、类似的行业大模型会采用先增训再用指令数据集SFT的方案,请教下这里为什么考虑直接使用SFT呢?
2、SFT方案对安全领域的知识扩充是否足够,不知道作者有没有这方面的实验,多谢
The text was updated successfully, but these errors were encountered: