Anuket Project

New Time: 6AM Pacific Wednesday. 3PM CET, 2PM GMT, 1930 India time    US is on Standard Time. Pacific time is UTC+0800.

Attendees

Al Morton Sridhar Rao Luc Provoost Luc Provoost 

4 main topics today: PROX, Internship, work in BMWG, Nile release, UNH Transition.

ItemDescwhoNotes/minutes

Special Topic: Containerizing PROX   

Luc, Sridhar, Trevor, and all

Update on  

Trouble finding the old logs: maybe in mail from Daniele

SR - maybe give a presentation on AF_XDP  https://github.com/intel/afxdp-plugins-for-kubernetes  Cillium not working with DPDK - talking with Cillium group to sort this out - maybe needs telco requests.

Also - Korea group is comfortable with Trex, that may be all

Futurewei - has a solution for networking, but does it work with DPDK? https://github.com/CentaurusInfra/mizar SR is exploring teh solution and will connect if it makes sense.

Notes from  

Sridhar tested - same issue

within container, start prox, many screen feedbacks - log file has been stored and shared. -Luc will look at this.

Need to check DPDK, there may be something broken in DPDK? 

Luc will look at log file and nail down driver files, vswitch, maybe try testPMD instead of prox  on the same container? - Sridhar will test this. If works, how many cores are used? -they are equally divided across the interfaces.



Intern Update   

Shivank

Update 

ACTION: Propose to Tim that we complete the Pod 12 work at end of January.   Need time to transition to UNH AFTER the intern project is over.

Status: Status Check from Shivank - needs to be completed Shivank has an exam today...  Needs to take-over the work.

First look at status - then we will review the Intern feedback form.


Notes from 

Tim Gresham wants to know time horizon for the completion of the Intern project.

Sridhar will try UNH - Sawyer or Lincoln Lavoie can get us started. Al sends mail with requirements eventually....

IXIA was loaned to LF, intel just hosting it. Maybe it can be moved to UNH! need to investigate.

Shivank: Status:  Exams in progress.  3 tasks need to be completed.  Where are we now?

  1. Infrastructure setup: Install
    1. OS
    2. Software
    3. cloud, K8s
  2. Test Setup (DUT, Testing tools)
    1. vswitch- kernel module - switching solution
    2. CNIs
    3. TGen Pods
    4. Forwarding Pods
  3. Test Runs
    1. Run Tests
    2. Modify 2 and repeat.

Shivank's Internship page with progress: Internship 2022 - Benchmarking eBPF based solutions

Internship 2022 - Benchmarking eBPF based solutions  (pdf)

If we can complete by end of the year - stay on Intel pod 12

Otherwise, Start to move to UNH LaaS


Notes from  

Tasks assigned are complete except  —

need to change pod manifest files

test setup with node 5, trex is not generating traffic

Is cluster up? no  do cleanup and do fresh installation -

Ansible scripts are failing at the same stage.  Need a call sometime and try to fix it together.  k8sclustermanagement.yaml is not working. call scheduled to fix.

Can we think of replacing CENTOS with Ubuntu? Yes we should do - CENTOS is very old...got to Ubuntu 22.04 LTS Server Edition.  Take a backup and re-install.


UNH transition requirementsAll

SR's requirements:

1 Node for Jump Host  (50% of the HW req below should be ok)

3 Nodes Kubernetes Cluster (1 Master and 2 Worker Nodes)

Requirements on Each Single Node Configuration of the cluster

1. 1x Gigabit ethernet for control-plane/internet/external-access (management)

2. 2 to 4 x SR-IOV compatible NICs for dataplane (testing)  At least 10GigE

3. 2 socket x (Intel Xeon E/Gold, (or ARM) at least 22 cores, at least 2.4 Ghz, approx 50mb Cache) 

4. 64GB to 128Gb Memory

5. 180GB SSD and 3TB SATA HDD storage

Networking between the nodes:

  1. need separate management network and dataplane/testing network
  2. What does the dataplane network include?  ToR switch? How many hosts share the ToR switch?
  3. VLAN tags needed?  How many?
  4. Direct cabling between NICs of Master and one worker nodes would be ideal.

WHAT about COSTS?  does TSC need to approve?

ACTION Al Morton  - fill-out request ASAP, then ping Lincoln and Sawyer on this.


Discussion: Contribution on Containerized network benchmarking in BMWG session IETF-115Al

Slides: Considerations for Benchmarking Network Performance in Containerized Infrastructure  

https://datatracker.ietf.org/doc/draft-dcn-bmwg-containerized-infra/

yangun@dcn.ssu.ac.kr  Sirdhar will contact.
mipearlska1307@dcn.ssu.ac.kr  presented at IETF-115

Nile Progress

Schedule

all

Notes on  

Re-prioritize this for December, complete dev and updates... with possible shift to next release. Prioritize results for December!


Review Sridhar Rao input  

Started Release Plan for ViNePerf Nile

Sridhar has submitted 2 new patches, which will complete the dev schedule - Al must review.  New SW versions and New OS versions. Adding Al as a reviewer.

Using DPDK build with  Meson and Ninja tools now.  new tools.  The build process is different for diff each OS now, diff folders for Fedora, etc.

Need a discussion with Fulvio on the approach to use with building the CNIs.  Sridhar will consider the questions to ask and try to exchange e-mail with Fulvio.

Pod 12 is fine now!  but we are the last user of the Intel community pods...   The IXIA license has expired.

Consider collaboration with Open Programmable Interface Project (Linux OPI) - need a member to join with us and start to identify common areas of interest. Sridhar will talk to Joe White, Dell TSC Chair.

Node 3 in POD 13 has build issues - need to update that system. system dependencies.  Build was successful in other locations, so we are interested to know where other projects are doing their builds. Maybe UNH Lab.



   Sridhar presented at the Tech Discuss!


eBPF  - final slides


eBPF topic (additional details)

Sridhar presented slides, which he used to familiarize Anuket with XDP and eBPF

L3AF project in LFN - life cycle management on eBPF programs

Cillium project has done some of their own benchmarking - but uses some unknown TCP: https://docs.cilium.io/en/stable/operations/performance/benchmark/ 

Not much uptake in telco - still many operations issues. The Cillium community agrees that they have not made a good case for telco usage (in conversations with Sridhar).


Shivank's Intern update

 

Here are Shivank's updates for this week.
https://wiki.anuket.io/display/HOME/Internship+2022+-+Benchmarking+eBPF+based+solutions
repo:
https://github.com/Alt-Shivam/Benchmarking-eBPF-XDP
Sridhar Rao commented: target to complete the Topology-1 (refer to slides that I had sent) first as baseline results to compare with previous results.


 

Shivank's CPU not supported by DPDK .  Need this for most of our benchmarking tools

New T-rex traffic Gen have been added and a new DPDK folder.

Sirdhar shared https://github.com/intel/afxdp-plugins-for-kubernetes  it is a possible option to use  Shivank will investigate.

Mail to Casey Cain about Intern compensation and feedback.


Shivank's report for last week:   

https://github.com/Alt-Shivam/eBPF-CNI

Pod 18 connectivity not working for several days, impacting Shivank's work. Tim G says working... Need Pod 12 working as well. There is an active JIRA ticket and comment stream.

Shivank testing with a VM and K8s cluster - can try out locally

Need to review mid-term Mentoring report - Al and Sridhar completed in last half of the meeting.

ACTION  Al Morton invite Maryam meeting to discuss performance (when we agree it's the right time).



FedericaReview Thesis proposal

Post review of Daniele's work  

closure to Daniele's work and see if there is anything worth publishing, may consider less-selective conferences if novelty is less

Sridhar

Some issues we dig into now

No reply from simonartxavier@gmail.com  and Luc? Al and Sridhar sent new messages.

Once we containerize prox, and we assign multiple cores to single interface, it crashes.  how can we run Prox with multiple cores in a container??



Tasks and Action Plan for Shivank
  1. Get access to the Testbed (Pod18 now)
  2. Demonstrate how to use Prox and T-Rex on Node-5
  3. Start with Baremetal and eBPF: Node-5 and Node-4
  4. Kubernetes Cluster Setup - Node-1 and Node-2: Install Necessary Components.

Open Programmable Interface Project (Linux OPI)

Change CPU NIC architecture, more autonomy and CPU power goes to NIC in OPI, more than smart NIC now. New Trust boundary between Autonomous NIC and CPU.

Sridhar

New project, governing board formed and TSC meeting set.

Opportunity for Benchmarking: work with us.

Need to know planned NIC speeds, Need NIC HW to benchmark.

There will be a test bed discussion, UNH and Keysight collaboration - using Keysight tools.  We want to join the collaboration. Maybe UNH would welcome our help to some degree. Need some time to understand how the DUT HW will be obtained and installed. Also Remote access (for us, like Intel labs).



Testbed - IXIA supportSridhar/Al

2nd ACTION: Do we have support from Ancuta or other IXIA person?  Al Morton sent e-mail

ACTION: License for IXIA HW - Check with Trevor

Trevor Cooper says connected and powered on, but we still might have a license issue. Need Pierre's help.

this activity seems to have stalled ...

Pod 19 also not accessible - Dan Xu.

ACTION:      Al: can you add a comment in the Jira issue.. INFRA-7 ?   As a PTL you can mention the dev is stuck with nodes being inaccessible.   https://jira.anuket.io/browse/INFRA-7



Additional Mentoring inputSridhar/Federica

  Federica

Past meeting: python scripts.  Possible combine 1 and 3

@Sridhar will expand on the proposal descriptions and Federica will begin to ask students about their interests.

python scripting with K8s is possible for some students in 3rd year. Projects must have resume value, add skill set or new experience.

Sridhar Rao created some projects.

ACTION: See proposals & review

https://wiki.anuket.io/display/HOME/Potential+Projects+for+Student+Volunteers


 Intern (background info)Sridhar/Al/Federica/Shivank
XDP performance Studies for Cloud-Native NFV Use Cases

"Maryam Tahhan joins the crew to talk XDP, AF_XDP, and fast networking"

Operations support seems to be a big issue, and performance is currently about 0.7 of VPP and DPDK




K8s on Pod 12Sridharwill look into it


Not DONE YET??? - Progress for Moselle (summary: items 5,6,and 7 lack the necessary automated address discovery feature, defer)

6/14 moved to 6/21 but no Moselle page update?  Patches have been submitted to Releng, but have not been committed yet. Release manager must give go ahead after TSC approval.

  Patches still open. possibly earlier, VinePerf Patches have been merged.

see Moselle Release

1Epic-VINEPERF-672:Create Tools as part of moselle release1. Done (Yet to submit the patch)
2. Another - Will not implement - Idea is already stale.

CCLA for Moselle release (LF as C)

CCLA issue resolved  

Patch is submitted - Done.



2Epic-VINEPERF-671:Add support for newer software versionsCompleted testing.Done


3Epic-VINEPERF-670:Create clean workflows for Baremetal, Openstack and Kubernetes UsecasesWorkflows are drawn-up. Implementation may need few more days. Request for 1 week more.

Minor work needed for K8s Workflow

DONE???  I think so.



4Epic-VINEPERF-669:Improve Stability for moselle Release1. Bug Fixes (2 patches)Done


5Epic-VINEPERF-652:Enhance XTesting-ViNePerf IntegrationMoved to Next Releasedepends on 7


6Task-VINEPERF-658:Enhance framework for XTesting-K8s UsecasePartially done (reading results from output), Deployment tool.1 task remains

7Task-VINEPERF-654:XTesting-ViNePerf Integration Enhancement - Kubernetes Will not implement due to limitations with CNIs.
Moved to Next release - if CNIs support this.

Need CNI to add  flows automatically in Switches (Userspace-CNI, supports DPDK, OVS, VPP). Major impediment to integrate with X-Testing

Sridhar will check with Xavier if ARP resolution is supported in Prox as a switch

TBD

8Task-VINEPERF-653:XTesting-ViNePerf Enhancement - Openstack [OLD]Done



9Epic-VINEPERF-638:Dataplane performance testing for various internal (within cloud) scenariosCompleted - Yet to submit the patches - Done

Danielle will submit patches after Thesis on Friday, Monday  04/25

After review tomorrow, Daniele will submit the patch.

Al Morton will need to review (big patch with documentation) - Done  

10Task-VINEPERF-643:Pod-Pod Communication



ACTION ITEMS

  • No labels