Revealing asymmetry on midplane of proto-planetary disc through modelling of axisymmetric emission: methodology

Masataka Aizawa,1,2,3 Takayuki Muto,4,5,6 Munetake Momose3
1Tsung-Dao Lee Institute, Shanghai Jiao Tong University, Shengrong Road 520, 201210 Shanghai, P. R. China
2 RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
3College of Science, Ibaraki University, 2-1-1 Bunkyo, Mito, Ibaraki 310-8512, Japan
4Division of Liberal Arts, Kogakuin University, 1-24-2 Nishi-Shinjyuku, Shinjyuku-ku, Tokyo 163-8677, Japan
5Leiden Observatory, Leiden University, P.O. Box 9513, NL-2300 RA Leiden, The Netherlands
6Department of Earth and Planetary Sciences, Tokyo Institute of Technology, 2-12-1 Oh-okayama, Meguro-ku, Tokyo 152-8551, Japan
E-mail: masataka.aizawa.sw35@vc.ibaraki.ac.jp
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

This study proposes an analytical framework for deriving the surface brightness profile and geometry of a geometrically-thin axisymmetric disc from interferometric observation of continuum emission. Such precise modelling facilitates the exploration of faint non-axisymmetric structures, such as spirals and circumplanetary discs. As a demonstration, we simulate interferometric observations of geometrically-thin axisymmetric discs. The proposed method can reasonably recover the injected axisymmetric structures, whereas Gaussian fitting of the same data yielded larger errors in disc orientation estimation. To further test the applicability of the method, it was applied to the mock data for m=1,2𝑚12m=1,2italic_m = 1 , 2 spirals and a point source, which are embedded in a bright axisymmetric structure. The injected non-axisymmetric structures were reasonably recovered except for the innermost parts, and the disc geometric parameter estimations were better than Gasussian fitting. The method was then applied to the real data of Elias 20 and AS 209, and it adequately subtracted the axisymmetric component, notably in Elias 20, where substantial residuals remained without our method. We also applied our method to continuum data of PDS 70 to demonstrate the effectiveness of the method. We successfully recovered emission from PDS 70 c consistently with previous studies, and also tentatively discovered new substructures. The current formulation can be applied to any data for disc continuum emission, and aids in the search of spirals and circumplanetary discs, whose detection is still limited.

keywords:
radio continuum: planetary systems –methods: data analysis –protoplanetary discs
pubyear: 2024pagerange: Revealing asymmetry on midplane of proto-planetary disc through modelling of axisymmetric emission: methodology23

1 Introduction

High-resolution imaging of proto-planetary discs by the Atacama Large Millimeter/submillimeter Array (ALMA) has revolutionized our view of planet formation. The spatial resolutions, down to 0.01similar-toabsent0.01\sim 0.01\arcsec∼ 0.01 ″, have enabled the identification of rings and gap structures in the disc continuum emission (ALMA Partnership et al., 2015; Andrews et al., 2018; Huang et al., 2018a). The mechanisms for invoking such annular structures have been under debate, and multiple explanations have been provided, such as a disc-planet interaction (Lin & Papaloizou, 1979; Goldreich & Tremaine, 1980; Dong et al., 2018), snowline (Zhang et al., 2015), sintering (Okuzumi et al., 2016), secular gravitational instability (Takahashi & Inutsuka, 2014), and non-ideal MHD effects (Flock et al., 2015). Nevertheless, the disc-planet interaction is independently supported by recent discoveries of kinematic signals of embedded planets (Pinte et al., 2018, 2020, 2023). Here, the signals, usually referred to as velocity “kinks”, appear as localized deviations from the Keplerian motion of discs in velocity maps, and there have been dozens of such detections.

In contrast to multiple detections of rings, gaps, and kinks, the direct detection of embedded planets and the circumplanetary discs within gaps remains limited. Currently, there are a few cases presenting the robust detection of embedded planets in gaps, such as, PDS 70 (Keppler et al., 2018; Wagner et al., 2018; Benisty et al., 2021), AB Aur (Currie et al., 2022), HD 169142 (Reggiani et al., 2014; Hammond et al., 2023), and MWC 758 (Wagner et al., 2023). These embedded planets have been somewhat selectively identified around (pre-)transitional discs with the ages greater than 4 Myr. In addition, Andrews et al. (2021) searched for circumplanetary discs in gaps of DSHARP discs (Andrews et al., 2018), whose typical age is 1 Myr; however, no compelling case remains.

Another supportive evidence for the planetary hypothesis can be obtained from a planetary spiral, which is excited by planetary gravity. However, despite the discovery of many spirals in discs (Muto et al., 2012; Grady et al., 2013; Huang et al., 2018b), only the spirals in systems with detected embedded planets are highly likely to be produced by the planetary gravity. The other spirals can also originate from non-planetary mechanisms, for example, gravity from stellar companions, gravitational instability or stellar flyby. Indeed, the spiral in HD 100453 A is a notable example, where its M-dwarf companion is thought to be the driver for the structure (Wagner et al., 2015; Dong et al., 2016).

Recently, Speedie & Dong (2022) searched for planetary spirals in 10 discs with kinematic signatures of embedded planets; however, they did not find any conclusive case. The limited detection of circumplanetary discs and spirals might indicate that the current sensitivities are still inadequate for most of the cases, or certain annular structures may have been caused via non-planetary mechanisms.

Nevertheless, an observational effort to search for circumplanetary discs and spirals is still essential for testing the planetary hypothesis. However, one obstacle to their detection is that these signals appear as small perturbations to the bright disc emission. This is similar to the situation of searching for a needle in a haystack. Thus, the precise modelling of the background disc emission is necessary to subtract its contribution.

One reasonable approximation for the background emission is an axisymmetric disc. However, the modelling of a vertically extended axisymmetric disc is difficult because of the geometric effect (e.g., shadows) and additional parameters to be solved (e.g., radial disc heights). Such geometric effects are significant at optical to infrared wavelengths, because of their sensitivity to vertically extended small dust grains. On the other hand, the disc emission at the radio band is well approximated by a geometrically thin disc model with no vertical structure owing to its sensitivity to large grains, which are more settled to the disc midplane. This approximation is to a certain extent justified by the observations of many clear concentric structures of discs in radio continuum emissions (e.g., ALMA Partnership et al., 2015; Andrews et al., 2018).

Recently, assuming a geometrically thin and axisymmetric disc, Jennings et al. (2020) proposed an algorithm for deriving a radial surface brightness profile of continuum emission in the radio band. Their method can resolve annular structures finer than the standard imaging technique, CLEAN, using simulated and real data (Jennings et al., 2020, 2022a, 2022b). Using their method, Andrews et al. (2021) also searched for circumplanetary disc emission from DSHARP data by subtracting the axisymmetric structures of discs.

Although the method proposed in Jennings et al. (2020) can reasonably recover the brightness profile of the disc, there is still room for improvement. Specifically, in their modelling, the geometric parameters of the axisymmetric model, such as an orientation and a central position of a disc, and hyperparameters for the Gaussian Process model that is used to prevent overfitting must be fixed. This limitation can be problematic, because a slight change in geometric parameters of the disc model, for example, several mas in a central position, can introduce the apparent non-axisymmetric structures in the image with their axisymmetric structure being subtracted (Andrews et al., 2021). Such false structures can be degenerate with the real structures, and may disrupt the detection of spirals and circumplanetary discs. Till date, in the frameworks proposed for axisymmetric disc models, these geometric parameters or hyperparameters for regularization have been manually tuned (Jennings et al., 2020; Andrews et al., 2021), or estimated through parameterized models (Zhang et al., 2015; Tazzari et al., 2017; Jennings et al., 2020; Kanagawa et al., 2021) and image-based analysis (Huang et al., 2018a).

Therefore, this study proposed an analytical framework to derive all of the parameters for a geometrically-thin axisymmetric disc and hyperparameters for the Gaussian Process kernel from observations assuming a geometrically thin disc. This study employed the formulation for the inverse modelling presented in Kawahara & Masuda (2020), which recovered a planetary surface map and its spin and orbital geometry from planetary reflection light. Interestingly, the problem considered in that study is mathematically similar to the current problem. Thus, we can apply their methodology for the current problem. We demonstrate the feasibility of the current method via its application to mock and real data. In addition, we perform injection and recovery experiments for circumplanetary discs and spirals, and discuss the applicability and limitations of the method.

The remainder of this paper is organized as follows. In Sec 2, the method for estimating an axisymmetric structure from visibilities is formulated. In Sec 3, the methodology for studying non-axisymmetric features in a residual image is discussed. In Sec 4, the feasibility of the proposed method for mock observations is investigated. In Sec 5, circumplanetary discs and spirals are injected to mock data, and the ability to recover the structures is examined. In Sec 6, our method was also applied to the real data for Elias 20 and AS 209. Sec 7 presents an analysis of the PDS 70 data, demonstrating the recovery of circumplanetary emission. Finally, the conclusions and future improvements are presented in Sec 8.

2 Analytical formulation for parameters for a geometrically thin axisymmetric disc

2.1 Model

We model a geometrically thin axisymmetric disc, whose parameters are composed of the brightness profile 𝒂𝒂\bm{a}bold_italic_a and its geometry 𝒈𝒈\bm{g}bold_italic_g. Here, 𝒂={I(rk)}𝒂𝐼subscript𝑟𝑘\bm{a}=\{I(r_{k})\}bold_italic_a = { italic_I ( italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) } for k=1,2,,N𝑘12𝑁k=1,2,...,Nitalic_k = 1 , 2 , … , italic_N is a vector obtained by discretizing the radial surface brightness profile I(r)𝐼𝑟I(r)italic_I ( italic_r ), and rksubscript𝑟𝑘r_{k}italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the k𝑘kitalic_k-th collocation point of the Fourier-Bessel series:

rk=Routj0kj0N,subscript𝑟𝑘subscript𝑅outsubscript𝑗0𝑘subscript𝑗0𝑁\displaystyle r_{k}=R_{\rm out}\frac{j_{0k}}{j_{0N}},italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT divide start_ARG italic_j start_POSTSUBSCRIPT 0 italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_j start_POSTSUBSCRIPT 0 italic_N end_POSTSUBSCRIPT end_ARG , (1)

where Routsubscript𝑅outR_{\rm out}italic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT is the outer boundary of I(r)𝐼𝑟I(r)italic_I ( italic_r ) with I(r>Rout)=0𝐼𝑟subscript𝑅out0I(r>R_{\rm out})=0italic_I ( italic_r > italic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) = 0, and j0ksubscript𝑗0𝑘j_{0k}italic_j start_POSTSUBSCRIPT 0 italic_k end_POSTSUBSCRIPT is the k𝑘kitalic_k-th root of the zero-order Bessel function of the first kind J0(r)subscript𝐽0𝑟J_{0}(r)italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_r ); J0(j0k)=0subscript𝐽0subscript𝑗0𝑘0J_{0}(j_{0k})=0italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_j start_POSTSUBSCRIPT 0 italic_k end_POSTSUBSCRIPT ) = 0.

The disc geometry is specified by the disc orientation and the central position: 𝒈=(Δxcen,Δycen,cosi,PA)𝒈Δsubscript𝑥cenΔsubscript𝑦cen𝑖PA\bm{g}=(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA})bold_italic_g = ( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA ). Here, the disc centre (Δxcen,Δycen)Δsubscript𝑥cenΔsubscript𝑦cen(\Delta x_{\rm cen},\Delta y_{\rm cen})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) is assumed to be shifted from a phase centre of the observation, and the disc orientation is specified by the position angle PA and inclination i𝑖iitalic_i. PA is defined as the angle of the major axis measured counter-clockwise from the north direction, and i𝑖iitalic_i is defined as the angle between the line of sight and the axis normal to the disc plane. Further, the formulation adopts cosi𝑖\cos iroman_cos italic_i, which represents the aspect ratio for the disc ellipse.

2.2 Visibilities for axisymmetric disc

2.2.1 Face-on disc

As the simplest problem, we consider the face-on disc with no positional offset from the phase centre; 𝒈=(Δxcen=0,Δycen=0,cosi=1,PA=0)𝒈formulae-sequenceΔsubscript𝑥cen0formulae-sequenceΔsubscript𝑦cen0formulae-sequence𝑖1PA0\bm{g}=(\Delta x_{\rm cen}=0\arcsec,\Delta y_{\rm cen}=0\arcsec,\cos i=1,{\rm PA% }=0)bold_italic_g = ( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 0 ″ , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 0 ″ , roman_cos italic_i = 1 , roman_PA = 0 ). Visibilities for an azimuthally symmetric geometrically thin disc with a brightness profile I(r)𝐼𝑟I(r)italic_I ( italic_r ) are expressed as the Hankel transformation (Thompson et al., 2017; Jennings et al., 2020):

V(u,v)𝑉𝑢𝑣\displaystyle V(u,v)italic_V ( italic_u , italic_v ) =\displaystyle== I(x,y)exp(2πj(ux+vy))𝒅𝒙superscriptsubscriptsuperscriptsubscript𝐼𝑥𝑦2𝜋𝑗𝑢𝑥𝑣𝑦differential-d𝒙\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I(x,y)\exp(-2\pi j(% ux+vy))\bm{dx}∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I ( italic_x , italic_y ) roman_exp ( - 2 italic_π italic_j ( italic_u italic_x + italic_v italic_y ) ) bold_italic_d bold_italic_x (2)
=\displaystyle== 2π0J0(2πrqcosi=1)I(r)r𝑑r,2𝜋superscriptsubscript0subscript𝐽02𝜋𝑟subscript𝑞𝑖1𝐼𝑟𝑟differential-d𝑟\displaystyle 2\pi\int_{0}^{\infty}J_{0}(2\pi rq_{\cos i=1})I(r)rdr,2 italic_π ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 2 italic_π italic_r italic_q start_POSTSUBSCRIPT roman_cos italic_i = 1 end_POSTSUBSCRIPT ) italic_I ( italic_r ) italic_r italic_d italic_r , (3)

where V(u,v)𝑉𝑢𝑣V(u,v)italic_V ( italic_u , italic_v ) is the complex visibility at a spatial frequency (u,v)𝑢𝑣(u,v)( italic_u , italic_v ), qcosi=1=u2+v2subscript𝑞𝑖1superscript𝑢2superscript𝑣2q_{\cos i=1}=\sqrt{u^{2}+v^{2}}italic_q start_POSTSUBSCRIPT roman_cos italic_i = 1 end_POSTSUBSCRIPT = square-root start_ARG italic_u start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG is the deprojected spatial frequency for the face-on disc, and J0subscript𝐽0J_{0}italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the zero-order Bessel function of the first kind.

We assume M𝑀Mitalic_M observational spatial frequencies {𝒖,𝒗}j=(uj,vj)subscript𝒖𝒗𝑗subscript𝑢𝑗subscript𝑣𝑗\{\bm{u},\bm{v}\}_{j}=(u_{j},v_{j}){ bold_italic_u , bold_italic_v } start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ), where j=1,2,,M𝑗12𝑀j=1,2,...,Mitalic_j = 1 , 2 , … , italic_M. At these spatial frequencies, we compute model visibilities, denoted by (𝑽={Vj(uj,vj)}(\bm{V}=\{V_{j}(u_{j},v_{j})\}( bold_italic_V = { italic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) }. Using equation (3) and assuming 𝒂𝒂\bm{a}bold_italic_a, the model visibilities 𝑽𝑽\bm{V}bold_italic_V are expressed as follows (e.g., Jennings et al., 2020):

𝑽=𝑯𝒂,𝑽𝑯𝒂\displaystyle\bm{V}=\bm{H}\bm{a},bold_italic_V = bold_italic_H bold_italic_a , (4)

where the matrix for the Hankel transformation 𝑯𝑯\bm{H}bold_italic_H is expressed as follows:

{𝑯}j,ksubscript𝑯𝑗𝑘\displaystyle\{\bm{H}\}_{j,k}{ bold_italic_H } start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT =\displaystyle== 4πRout2j0(N+1)2J12(j0k)J0(2πqj,cosi=1rk),4𝜋superscriptsubscript𝑅out2superscriptsubscript𝑗0𝑁12superscriptsubscript𝐽12subscript𝑗0𝑘subscript𝐽02𝜋subscript𝑞𝑗𝑖1subscript𝑟𝑘\displaystyle\frac{4\pi R_{\rm out}^{2}}{j_{0(N+1)}^{2}J_{1}^{2}(j_{0k})}J_{0}% \left(2\pi q_{j,\cos i=1}r_{k}\right),divide start_ARG 4 italic_π italic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_j start_POSTSUBSCRIPT 0 ( italic_N + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_j start_POSTSUBSCRIPT 0 italic_k end_POSTSUBSCRIPT ) end_ARG italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 2 italic_π italic_q start_POSTSUBSCRIPT italic_j , roman_cos italic_i = 1 end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , (5)
qj,cosi=1subscript𝑞𝑗𝑖1\displaystyle q_{j,\cos i=1}italic_q start_POSTSUBSCRIPT italic_j , roman_cos italic_i = 1 end_POSTSUBSCRIPT =\displaystyle== uj2+vj2,superscriptsubscript𝑢𝑗2superscriptsubscript𝑣𝑗2\displaystyle\sqrt{u_{j}^{2}+v_{j}^{2}},square-root start_ARG italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (6)

where J1subscript𝐽1J_{1}italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the first-order Bessel function of the first kind.

2.2.2 Inclined disc

Refer to caption
Figure 1: Schematic of a disc in observed (a) and deprojected frames (b). The disc is inclined with cosi𝑖\cos iroman_cos italic_i and oriented by PA with respect to the x𝑥xitalic_x-axis in the projected frame. The origin of (x,y)𝑥𝑦(x,y)( italic_x , italic_y ) is set to be the phase centre, and (x,y)superscript𝑥superscript𝑦(x^{\prime},y^{\prime})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) is shifted from (x,y)𝑥𝑦(x,y)( italic_x , italic_y ) by (Δxcen,Δycen)Δsubscript𝑥cenΔsubscript𝑦cen(\Delta x_{\rm cen},\Delta y_{\rm cen})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ). (x′′,y′′)superscript𝑥′′superscript𝑦′′(x^{\prime\prime},y^{\prime\prime})( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) is the coordinate for the deprojected frame, and the brightness profile is assumed to be only radially dependent on I′′(r′′)superscript𝐼′′superscript𝑟′′I^{\prime\prime}(r^{\prime\prime})italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ).

The computations of model visibilities can be simply extended to the case of the inclined disc. For the calculation, we prepare three different coordinates (x,y)𝑥𝑦(x,y)( italic_x , italic_y ), (x,y)superscript𝑥superscript𝑦(x^{\prime},y^{\prime})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), and (x′′,y′′)superscript𝑥′′superscript𝑦′′(x^{\prime\prime},y^{\prime\prime})( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ); Fig. 1 shows a schematic of the coordinates and disc. Here, (x,y)𝑥𝑦(x,y)( italic_x , italic_y ) is the observational coordinate system with the phase centre being the origin, (x,y)superscript𝑥superscript𝑦(x^{\prime},y^{\prime})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) is the shifted coordinate system with the disc centre being the origin, and (x′′,y′′)superscript𝑥′′superscript𝑦′′(x^{\prime\prime},y^{\prime\prime})( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) is the deprojected coordinate system according to (PA, i𝑖iitalic_i). (x,y)𝑥𝑦(x,y)( italic_x , italic_y ) and (x,y)superscript𝑥superscript𝑦(x^{\prime},y^{\prime})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) are related as (x,y)=(x+Δxcen,y+Δycen)𝑥𝑦superscript𝑥Δsubscript𝑥censuperscript𝑦Δsubscript𝑦cen(x,y)=(x^{\prime}+\Delta x_{\rm cen},y^{\prime}+\Delta y_{\rm cen})( italic_x , italic_y ) = ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ), and (x,y)superscript𝑥superscript𝑦(x^{\prime},y^{\prime})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) and (x′′,y′′)superscript𝑥′′superscript𝑦′′(x^{\prime\prime},y^{\prime\prime})( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) are converted as follows:

(x′′y′′)=(1/cosi001)(cos(PA)sin(PA)sin(PA)cos(PA))(xy)superscript𝑥′′superscript𝑦′′1𝑖001PAPAPAPAsuperscript𝑥superscript𝑦\displaystyle\left(\begin{array}[]{c}x^{\prime\prime}\\ y^{\prime\prime}\end{array}\right)=\left(\begin{array}[]{cc}1/\cos i&0\\ 0&1\end{array}\right)\left(\begin{array}[]{cc}\cos({\rm PA})&-\sin({\rm PA})\\ \sin({\rm PA})&\cos({\rm PA})\end{array}\right)\left(\begin{array}[]{c}x^{% \prime}\\ y^{\prime}\end{array}\right)( start_ARRAY start_ROW start_CELL italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ) = ( start_ARRAY start_ROW start_CELL 1 / roman_cos italic_i end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ) ( start_ARRAY start_ROW start_CELL roman_cos ( roman_PA ) end_CELL start_CELL - roman_sin ( roman_PA ) end_CELL end_ROW start_ROW start_CELL roman_sin ( roman_PA ) end_CELL start_CELL roman_cos ( roman_PA ) end_CELL end_ROW end_ARRAY ) ( start_ARRAY start_ROW start_CELL italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ) (15)

The brightness profiles for the coordinates I(x,y)𝐼𝑥𝑦I(x,y)italic_I ( italic_x , italic_y ), I(x,y)superscript𝐼superscript𝑥superscript𝑦I^{\prime}(x^{\prime},y^{\prime})italic_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), and I′′(x′′,y′′)superscript𝐼′′superscript𝑥′′superscript𝑦′′I^{\prime\prime}(x^{\prime\prime},y^{\prime\prime})italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) are assumed to be invariant to coordinate transformations:

I(x,y)=I(x,y)=I′′(x′′,y′′).𝐼𝑥𝑦superscript𝐼superscript𝑥superscript𝑦superscript𝐼′′superscript𝑥′′superscript𝑦′′\displaystyle I(x,y)=I^{\prime}(x^{\prime},y^{\prime})=I^{\prime\prime}(x^{% \prime\prime},y^{\prime\prime}).italic_I ( italic_x , italic_y ) = italic_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) . (16)

This assumption is a mathematical hypothesis for modeling purposes, and it is not necessarily consistent with the physical picture of what would actually occur if the disc were observed from the face-on view111Nevertheless, the assumption is physically consistent when the disc is a geometrically-thin, optically thick disc..

Assuming the brightness profile in a deprojected frame I′′(r′′)superscript𝐼′′superscript𝑟′′I^{\prime\prime}(r^{\prime\prime})italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) with r′′x′′2+y′′2superscript𝑟′′superscript𝑥′′2superscript𝑦′′2r^{\prime\prime}\equiv\sqrt{x^{\prime\prime 2}+y^{\prime\prime 2}}italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ≡ square-root start_ARG italic_x start_POSTSUPERSCRIPT ′ ′ 2 end_POSTSUPERSCRIPT + italic_y start_POSTSUPERSCRIPT ′ ′ 2 end_POSTSUPERSCRIPT end_ARG, we aimed to compute the model visibilities. First, we transform equation (2) as follows:

V(u,v)𝑉𝑢𝑣\displaystyle V(u,v)italic_V ( italic_u , italic_v ) =\displaystyle== I(x,y)exp(2πj(ux+vy))𝒅𝒙superscriptsubscriptsuperscriptsubscript𝐼𝑥𝑦2𝜋𝑗𝑢𝑥𝑣𝑦differential-d𝒙\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I(x,y)\exp(-2\pi j(% ux+vy))\bm{dx}∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I ( italic_x , italic_y ) roman_exp ( - 2 italic_π italic_j ( italic_u italic_x + italic_v italic_y ) ) bold_italic_d bold_italic_x (17)
=\displaystyle== exp(2πj(uΔxcen+vΔycen))2𝜋𝑗𝑢Δsubscript𝑥cen𝑣Δsubscript𝑦cen\displaystyle\exp(-2\pi j(u\Delta x_{\rm cen}+v\Delta y_{\rm cen}))roman_exp ( - 2 italic_π italic_j ( italic_u roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT + italic_v roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) )
I(x,y)exp(2πj(ux+vy))𝒅𝒙,superscriptsubscriptsuperscriptsubscriptsuperscript𝐼superscript𝑥superscript𝑦2𝜋𝑗𝑢superscript𝑥𝑣superscript𝑦differential-dsuperscript𝒙bold-′\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I^{\prime}(x^{% \prime},y^{\prime})\exp(-2\pi j(ux^{\prime}+vy^{\prime}))\bm{dx^{\prime}},∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_exp ( - 2 italic_π italic_j ( italic_u italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_v italic_y start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) bold_italic_d bold_italic_x start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ,
=\displaystyle== exp(2πj(uΔxcen+vΔycen))2𝜋𝑗𝑢Δsubscript𝑥cen𝑣Δsubscript𝑦cen\displaystyle\exp(-2\pi j(u\Delta x_{\rm cen}+v\Delta y_{\rm cen}))roman_exp ( - 2 italic_π italic_j ( italic_u roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT + italic_v roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) )
I′′(x′′,y′′)exp(2πj(ux′′+vy′′))superscriptsubscriptsuperscriptsubscriptsuperscript𝐼′′superscript𝑥′′superscript𝑦′′2𝜋𝑗superscript𝑢superscript𝑥′′superscript𝑣superscript𝑦′′\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I^{\prime\prime}(x^% {\prime\prime},y^{\prime\prime})\exp(-2\pi j(u^{\prime}x^{\prime\prime}+v^{% \prime}y^{\prime\prime}))∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) roman_exp ( - 2 italic_π italic_j ( italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT + italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) )
|cosi|𝒅𝒙′′,𝑖𝒅superscript𝒙bold-′′\displaystyle|\cos i|\bm{dx^{\prime\prime}},| roman_cos italic_i | bold_italic_d bold_italic_x start_POSTSUPERSCRIPT bold_′ bold_′ end_POSTSUPERSCRIPT ,

where we define (u,v)superscript𝑢superscript𝑣(u^{\prime},v^{\prime})( italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) as follows:

(uv)=(cosi001)(cos(PA)sin(PA)sin(PA)cos(PA))(uv).superscript𝑢superscript𝑣𝑖001PAPAPAPA𝑢missing-subexpression𝑣missing-subexpression\displaystyle\left(\begin{array}[]{c}u^{\prime}\\ v^{\prime}\end{array}\right)=\left(\begin{array}[]{cc}\cos i&0\\ 0&1\end{array}\right)\left(\begin{array}[]{cc}\cos({\rm PA})&-\sin({\rm PA})\\ \sin({\rm PA})&\cos({\rm PA})\end{array}\right)\left(\begin{array}[]{cc}u\\ v\end{array}\right).( start_ARRAY start_ROW start_CELL italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ) = ( start_ARRAY start_ROW start_CELL roman_cos italic_i end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ) ( start_ARRAY start_ROW start_CELL roman_cos ( roman_PA ) end_CELL start_CELL - roman_sin ( roman_PA ) end_CELL end_ROW start_ROW start_CELL roman_sin ( roman_PA ) end_CELL start_CELL roman_cos ( roman_PA ) end_CELL end_ROW end_ARRAY ) ( start_ARRAY start_ROW start_CELL italic_u end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_v end_CELL start_CELL end_CELL end_ROW end_ARRAY ) . (26)

Using equation (3), equation (17) is reduced as follows:

V(u,v)𝑉𝑢𝑣\displaystyle V(u,v)italic_V ( italic_u , italic_v ) =2π|cosi|exp(2πj(uΔxcen+vΔycen))absent2𝜋𝑖2𝜋𝑗𝑢Δsubscript𝑥cen𝑣Δsubscript𝑦cen\displaystyle=2\pi|\cos i|\exp(-2\pi j(u\Delta x_{\rm cen}+v\Delta y_{\rm cen}))= 2 italic_π | roman_cos italic_i | roman_exp ( - 2 italic_π italic_j ( italic_u roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT + italic_v roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) ) (27)
×0J0(2πr′′q)I′′(r′′)r′′dr′′,\displaystyle\times\displaystyle\int_{0}^{\infty}J_{0}(2\pi r^{\prime\prime}q)% I^{\prime\prime}(r^{\prime\prime})r^{\prime\prime}dr^{\prime\prime},× ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 2 italic_π italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_q ) italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_d italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ,

where we define a deprojected radial spatial frequency q𝑞qitalic_q as follows:

q=u2+v2.𝑞superscript𝑢2superscript𝑣2\displaystyle q=\sqrt{u^{\prime 2}+v^{\prime 2}}.italic_q = square-root start_ARG italic_u start_POSTSUPERSCRIPT ′ 2 end_POSTSUPERSCRIPT + italic_v start_POSTSUPERSCRIPT ′ 2 end_POSTSUPERSCRIPT end_ARG . (28)

Note that qcosi=1subscript𝑞𝑖1q_{\cos i=1}italic_q start_POSTSUBSCRIPT roman_cos italic_i = 1 end_POSTSUBSCRIPT is the special case of q𝑞qitalic_q with (cosi,PA)=(1,0)𝑖PA10(\cos i,{\rm PA})=(1,0)( roman_cos italic_i , roman_PA ) = ( 1 , 0 ). In the calculation, we assume an axisymmetric disc, whose brightness profile with I′′(x′′,y′′)=I′′(r′′)superscript𝐼′′superscript𝑥′′superscript𝑦′′superscript𝐼′′superscript𝑟′′I^{\prime\prime}(x^{\prime\prime},y^{\prime\prime})=I^{\prime\prime}(r^{\prime% \prime})italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) = italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ), and we discretize I′′(r′′)superscript𝐼′′superscript𝑟′′I^{\prime\prime}(r^{\prime\prime})italic_I start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) using 𝒂𝒂\bm{a}bold_italic_a in the same manner as that presented in Sec 2.2.1.

For 𝑽realsubscript𝑽real\bm{V}_{\rm real}bold_italic_V start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT and 𝑽imagsubscript𝑽imag\bm{V}_{\rm imag}bold_italic_V start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT, which are defined as real and imaginary parts of 𝑽𝑽\bm{V}bold_italic_V, respectively, we derive a linear equation as follows:

(𝑽real𝑽imag)=(𝑪real𝑯𝑪imag𝑯)𝒂,subscript𝑽realsubscript𝑽imagsubscript𝑪realsuperscript𝑯bold-′missing-subexpressionsubscript𝑪imagsuperscript𝑯bold-′missing-subexpression𝒂\displaystyle\left(\begin{array}[]{c}\bm{V}_{\rm real}\\ \bm{V}_{\rm imag}\end{array}\right)=\left(\begin{array}[]{cc}\bm{C}_{\rm real}% \bm{H^{\prime}}\\ \bm{C}_{\rm imag}\bm{H^{\prime}}\\ \end{array}\right)\bm{a},( start_ARRAY start_ROW start_CELL bold_italic_V start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_italic_V start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ) = ( start_ARRAY start_ROW start_CELL bold_italic_C start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_italic_C start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL end_ROW end_ARRAY ) bold_italic_a , (33)

where 𝑯superscript𝑯bold-′\bm{H^{\prime}}bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT, 𝑪realsubscript𝑪real\bm{C}_{\rm real}bold_italic_C start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT, and 𝑪imagsubscript𝑪imag\bm{C}_{\rm imag}bold_italic_C start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT are defined as follows:

{𝑯}j,ksubscriptsuperscript𝑯bold-′𝑗𝑘\displaystyle\{\bm{H^{\prime}}\}_{j,k}{ bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT =\displaystyle== 4πRout2j0(N+1)2J12(j0k)J0(2πqjrk),4𝜋superscriptsubscript𝑅out2superscriptsubscript𝑗0𝑁12superscriptsubscript𝐽12subscript𝑗0𝑘subscript𝐽02𝜋subscript𝑞𝑗subscript𝑟𝑘\displaystyle\displaystyle\frac{4\pi R_{\rm out}^{2}}{j_{0(N+1)}^{2}J_{1}^{2}(% j_{0k})}J_{0}\left(2\pi q_{j}r_{k}\right),divide start_ARG 4 italic_π italic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_j start_POSTSUBSCRIPT 0 ( italic_N + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_j start_POSTSUBSCRIPT 0 italic_k end_POSTSUBSCRIPT ) end_ARG italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 2 italic_π italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , (34)
{𝑪real}j,ksubscriptsubscript𝑪real𝑗𝑘\displaystyle\{\bm{C}_{\rm real}\}_{j,k}{ bold_italic_C start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT =\displaystyle== |cosi|cos(2π(Δxcenuj+Δycenvj))δj,k,𝑖2𝜋Δsubscript𝑥censubscript𝑢𝑗Δsubscript𝑦censubscript𝑣𝑗subscript𝛿𝑗𝑘\displaystyle|\cos i|\cos(-2\pi(\Delta x_{\rm cen}u_{j}+\Delta y_{\rm cen}v_{j% }))\delta_{j,k},| roman_cos italic_i | roman_cos ( - 2 italic_π ( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ) italic_δ start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT ,
{𝑪imag}j,ksubscriptsubscript𝑪imag𝑗𝑘\displaystyle\{\bm{C}_{\rm imag}\}_{j,k}{ bold_italic_C start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT =\displaystyle== |cosi|sin(2π(Δxcenuj+Δycenvj))δj,k.𝑖2𝜋Δsubscript𝑥censubscript𝑢𝑗Δsubscript𝑦censubscript𝑣𝑗subscript𝛿𝑗𝑘\displaystyle|\cos i|\sin(-2\pi(\Delta x_{\rm cen}u_{j}+\Delta y_{\rm cen}v_{j% }))\delta_{j,k}.| roman_cos italic_i | roman_sin ( - 2 italic_π ( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ) italic_δ start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT .

2.3 Inverse modelling

On the basis of the forward modelling in Sec 2.2, we consider inverse modelling for brightness profile 𝒂𝒂\bm{a}bold_italic_a and geometric parameters 𝒈=(Δxcen,Δycen,cosi\bm{g}=(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos ibold_italic_g = ( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i, PA) from visibilities.

2.3.1 Data with Gaussian noise

Let us assume that there are M𝑀Mitalic_M visibilities 𝒅obs={dobs,i(ui,vi)}subscript𝒅obssubscript𝑑obs𝑖subscript𝑢𝑖subscript𝑣𝑖\bm{d}_{\rm obs}=\{d_{{\rm obs},i}(u_{i},v_{i})\}bold_italic_d start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT = { italic_d start_POSTSUBSCRIPT roman_obs , italic_i end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } (i=1,2,,M𝑖12𝑀i=1,2,...,Mitalic_i = 1 , 2 , … , italic_M), with the noise of each visibility data obtained via the standard deviation of 𝝈obs={σobs,i}subscript𝝈obssubscript𝜎obs𝑖\bm{\sigma}_{\rm obs}=\{\sigma_{{\rm obs},i}\}bold_italic_σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT = { italic_σ start_POSTSUBSCRIPT roman_obs , italic_i end_POSTSUBSCRIPT }. We separate the real and imaginary parts of the visibilities as 𝒅real𝔢(𝒅obs)subscript𝒅real𝔢subscript𝒅obs\bm{d}_{\rm real}\equiv\mathfrak{Re}(\bm{d}_{\rm obs})bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT ≡ fraktur_R fraktur_e ( bold_italic_d start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT ) and 𝒅imag𝔪(𝒅obs)subscript𝒅imag𝔪subscript𝒅obs\bm{d}_{\rm imag}\equiv\mathfrak{Im}(\bm{d}_{\rm obs})bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT ≡ fraktur_I fraktur_m ( bold_italic_d start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT ), respectively. We assume that observational noises for 𝒅realsubscript𝒅real\bm{d}_{\rm real}bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT and 𝒅imagsubscript𝒅imag\bm{d}_{\rm imag}bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT obey the multivariate normal distribution. Here, the multivariate normal distribution for a N𝑁Nitalic_N-dimensional random vector 𝒙𝒙\bm{x}bold_italic_x is expressed as follows:

𝒩(𝒙|𝝁,𝚺)=1(2π)N/2|det𝚺|1/2exp(12(𝒙𝝁)TΣ1(𝒙𝝁)),𝒩conditional𝒙𝝁𝚺1superscript2𝜋𝑁2superscriptdet𝚺1212superscript𝒙𝝁𝑇superscriptΣ1𝒙𝝁\displaystyle\mathcal{N}(\bm{x}|\bm{\mu},\bm{\Sigma})=\frac{1}{(2\pi)^{N/2}|{% \rm det}\bm{\Sigma}|^{1/2}}\exp\left(-\frac{1}{2}(\bm{x}-\bm{\mu})^{T}\Sigma^{% -1}(\bm{x}-\bm{\mu})\right),caligraphic_N ( bold_italic_x | bold_italic_μ , bold_Σ ) = divide start_ARG 1 end_ARG start_ARG ( 2 italic_π ) start_POSTSUPERSCRIPT italic_N / 2 end_POSTSUPERSCRIPT | roman_det bold_Σ | start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( bold_italic_x - bold_italic_μ ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Σ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_italic_x - bold_italic_μ ) ) , (37)

where 𝝁𝝁\bm{\mu}bold_italic_μ is the mean vector and 𝚺𝚺\bm{\Sigma}bold_Σ is the covariance matrix. For 𝒅realsubscript𝒅real\bm{d}_{\rm real}bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT and 𝒅imagsubscript𝒅imag\bm{d}_{\rm imag}bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT, we assume the covariance matrix to be 𝚺=𝚺obs{δi,j/σi2}𝚺subscript𝚺obssubscript𝛿𝑖𝑗superscriptsubscript𝜎𝑖2\bm{\Sigma}={\bm{\Sigma}_{\rm obs}}\equiv\{\delta_{i,j}/\sigma_{i}^{2}\}bold_Σ = bold_Σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT ≡ { italic_δ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT / italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT }.

2.3.2 Gaussian prior on brightness profile

In interferometric observations, a brightness profile 𝒂𝒂\bm{a}bold_italic_a is generally under-constrained. If it is directly obtained by solving a linear equation, for example, equation (4), the solution can diverge, leading to overfitting. This can be avoided by introducing regularization in optimization. Specifically, we assume the Gaussian Process kernel with a set of hyperparameters 𝜽=(α,γ)𝜽𝛼𝛾\bm{\theta}=(\alpha,\gamma)bold_italic_θ = ( italic_α , italic_γ ) on 𝒂𝒂\bm{a}bold_italic_a as follows:

p(𝒂|𝜽)𝑝conditional𝒂𝜽\displaystyle p(\bm{a}|\bm{\theta})italic_p ( bold_italic_a | bold_italic_θ ) =\displaystyle== 𝒩(𝒂|𝟎,𝚺a(𝜽)),𝒩conditional𝒂0subscript𝚺𝑎𝜽\displaystyle\mathcal{N}(\bm{a}|\bm{0},\bm{\Sigma}_{a}(\bm{\theta})),caligraphic_N ( bold_italic_a | bold_0 , bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ) ) , (38)

where we adopt the radial basis function (RBF) kernel 𝚺a(𝜽)subscript𝚺𝑎𝜽\bm{\Sigma}_{a}(\bm{\theta})bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ), which is expressed as;

(𝚺a(𝜽))j,ksubscriptsubscript𝚺𝑎𝜽𝑗𝑘\displaystyle(\bm{\Sigma}_{a}(\bm{\theta}))_{j,k}( bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ) ) start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT =\displaystyle== αkRBF(|rjrk|;γ)+αϵδjk,𝛼subscript𝑘𝑅𝐵𝐹subscript𝑟𝑗subscript𝑟𝑘𝛾𝛼italic-ϵsubscript𝛿𝑗𝑘\displaystyle\alpha k_{RBF}(|r_{j}-r_{k}|;\gamma)+\alpha\epsilon\delta_{jk},italic_α italic_k start_POSTSUBSCRIPT italic_R italic_B italic_F end_POSTSUBSCRIPT ( | italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ; italic_γ ) + italic_α italic_ϵ italic_δ start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT , (39)
kRBF(|rjrk|;γ)subscript𝑘𝑅𝐵𝐹subscript𝑟𝑗subscript𝑟𝑘𝛾\displaystyle k_{RBF}(|r_{j}-r_{k}|;\gamma)italic_k start_POSTSUBSCRIPT italic_R italic_B italic_F end_POSTSUBSCRIPT ( | italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ; italic_γ ) =\displaystyle== exp(rjrk22γ2),superscriptnormsubscript𝑟𝑗subscript𝑟𝑘22superscript𝛾2\displaystyle\displaystyle\exp\left(-\frac{||r_{j}-r_{k}||^{2}}{2\gamma^{2}}% \right),roman_exp ( - divide start_ARG | | italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) , (40)

where α𝛼\alphaitalic_α determines the relative weight of the prior information, and γ𝛾\gammaitalic_γ determines the spatial scale for regularization. The second term in Eq (39) is the small identity matrix, which stabilizes the computation of the inverse matrix Σa(θ)subscriptΣ𝑎𝜃\Sigma_{a}(\theta)roman_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( italic_θ ) (Kawahara et al., 2022). We adopted ϵ=108italic-ϵsuperscript108\epsilon=10^{-8}italic_ϵ = 10 start_POSTSUPERSCRIPT - 8 end_POSTSUPERSCRIPT in this paper.

This prior tends to promote smooth solutions, and can aid in preventing overfitting. A previous study by Jennings et al. (2020) adopted parameters for controlling the Gaussian Process prior on powers of the brightness profiles in the frequency domain. However, in this study, we adopted the parameters for regulating the profile in the spatial domain to render the discussion simpler.

2.3.3 Likelihood function for geometry and hyperparameters

Following Kawahara & Masuda (2020), we derive the likelihood function p(𝒅|𝒈,𝜽)𝑝conditional𝒅𝒈𝜽p(\bm{d}|\bm{g},\bm{\theta})italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ). We briefly overview their formulation and apply it to the current problem. For further details, see equations (14)-(21) and Appendix A.1-A.3 in the previous study.

We introduce the data vector 𝒅𝒅\bm{d}bold_italic_d as follows:

𝒅=(𝒅real𝒅imag)𝒅subscript𝒅realsubscript𝒅imag\displaystyle\bm{d}=\left(\begin{array}[]{c}\bm{d}_{\rm real}\\ \bm{d}_{\rm imag}\end{array}\right)bold_italic_d = ( start_ARRAY start_ROW start_CELL bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ) (43)

Recalling Bayes’ theorem, we obtain

p(𝒂|𝒅,𝒈,𝜽)p(𝒅|𝒈,𝜽)=p(𝒅|𝒂,𝒈,𝜽)p(𝒂|𝒈,𝜽).𝑝conditional𝒂𝒅𝒈𝜽𝑝conditional𝒅𝒈𝜽𝑝conditional𝒅𝒂𝒈𝜽𝑝conditional𝒂𝒈𝜽\displaystyle p(\bm{a}|\bm{d},\bm{g},\bm{\theta})p(\bm{d}|\bm{g},\bm{\theta})=% p(\bm{d}|\bm{a},\bm{g},\bm{\theta})p(\bm{a}|\bm{g},\bm{\theta}).italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) = italic_p ( bold_italic_d | bold_italic_a , bold_italic_g , bold_italic_θ ) italic_p ( bold_italic_a | bold_italic_g , bold_italic_θ ) . (44)

This can be rewritten as follows:

p(𝒅|𝒈,𝜽)=p(𝒅|𝒂,𝒈,𝜽)p(𝒂|𝒈,𝜽)p(𝒂|𝒅,𝒈,𝜽)=p(𝒅|𝒂,𝒈)p(𝒂|𝜽)p(𝒂|𝒅,𝒈,𝜽),𝑝conditional𝒅𝒈𝜽𝑝conditional𝒅𝒂𝒈𝜽𝑝conditional𝒂𝒈𝜽𝑝conditional𝒂𝒅𝒈𝜽𝑝conditional𝒅𝒂𝒈𝑝conditional𝒂𝜽𝑝conditional𝒂𝒅𝒈𝜽\displaystyle p(\bm{d}|\bm{g},\bm{\theta})=\frac{p(\bm{d}|\bm{a},\bm{g},\bm{% \theta})p(\bm{a}|\bm{g},\bm{\theta})}{p(\bm{a}|\bm{d},\bm{g},\bm{\theta})}=% \frac{p(\bm{d}|\bm{a},\bm{g})p(\bm{a}|\bm{\theta})}{p(\bm{a}|\bm{d},\bm{g},\bm% {\theta})},italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) = divide start_ARG italic_p ( bold_italic_d | bold_italic_a , bold_italic_g , bold_italic_θ ) italic_p ( bold_italic_a | bold_italic_g , bold_italic_θ ) end_ARG start_ARG italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) end_ARG = divide start_ARG italic_p ( bold_italic_d | bold_italic_a , bold_italic_g ) italic_p ( bold_italic_a | bold_italic_θ ) end_ARG start_ARG italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) end_ARG , (45)

where we use p(𝒅|𝒂,𝒈,𝜽)=p(𝒅|𝒂,𝒈)𝑝conditional𝒅𝒂𝒈𝜽𝑝conditional𝒅𝒂𝒈p(\bm{d}|\bm{a},\bm{g},\bm{\theta})=p(\bm{d}|\bm{a},\bm{g})italic_p ( bold_italic_d | bold_italic_a , bold_italic_g , bold_italic_θ ) = italic_p ( bold_italic_d | bold_italic_a , bold_italic_g ) and p(𝒂|𝒈,𝜽)=p(𝒂|𝜽)𝑝conditional𝒂𝒈𝜽𝑝conditional𝒂𝜽p(\bm{a}|\bm{g},\bm{\theta})=p(\bm{a}|\bm{\theta})italic_p ( bold_italic_a | bold_italic_g , bold_italic_θ ) = italic_p ( bold_italic_a | bold_italic_θ ).

In the equation, p(𝒅|𝒂,𝒈)𝑝conditional𝒅𝒂𝒈p(\bm{d}|\bm{a},\bm{g})italic_p ( bold_italic_d | bold_italic_a , bold_italic_g ) is calculated using equation (33):

p(𝒅|𝒂,𝒈)𝑝conditional𝒅𝒂𝒈\displaystyle p(\bm{d}|\bm{a},\bm{g})italic_p ( bold_italic_d | bold_italic_a , bold_italic_g ) =\displaystyle== p(𝒅real|𝒂,𝒈)p(𝒅imag|𝒂,𝒈)𝑝conditionalsubscript𝒅real𝒂𝒈𝑝conditionalsubscript𝒅imag𝒂𝒈\displaystyle p(\bm{d}_{\rm real}|\bm{a},\bm{g})p(\bm{d}_{\rm imag}|\bm{a},\bm% {g})italic_p ( bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT | bold_italic_a , bold_italic_g ) italic_p ( bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT | bold_italic_a , bold_italic_g ) (46)
=\displaystyle== 𝒩(𝒅real|𝑪real𝑯𝒂,𝚺obs)𝒩(𝒅imag|𝑪imag𝑯𝒂,𝚺obs)𝒩conditionalsubscript𝒅realsubscript𝑪realsuperscript𝑯bold-′𝒂subscript𝚺obs𝒩conditionalsubscript𝒅imagsubscript𝑪imagsuperscript𝑯bold-′𝒂subscript𝚺obs\displaystyle\mathcal{N}(\bm{d}_{\rm real}|\bm{C}_{\rm real}\bm{H^{\prime}}\bm% {a},{\bm{\Sigma}_{\rm obs}})\mathcal{N}(\bm{d}_{\rm imag}|\bm{C}_{\rm imag}\bm% {H^{\prime}}\bm{a},{\bm{\Sigma}_{\rm obs}})caligraphic_N ( bold_italic_d start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT | bold_italic_C start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT bold_italic_a , bold_Σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT ) caligraphic_N ( bold_italic_d start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT | bold_italic_C start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT bold_italic_a , bold_Σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT )
=\displaystyle== 𝒩(𝒅|𝑯¯𝒂,𝚺¯d),𝒩conditional𝒅¯𝑯𝒂subscript¯𝚺𝑑\displaystyle\mathcal{N}(\bm{d}|\bar{\bm{H}}\bm{a},\bar{\bm{\Sigma}}_{d}),caligraphic_N ( bold_italic_d | over¯ start_ARG bold_italic_H end_ARG bold_italic_a , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ,

where 𝚺¯¯𝚺\bar{\bm{\Sigma}}over¯ start_ARG bold_Σ end_ARG and 𝑯¯¯𝑯\bar{\bm{H}}over¯ start_ARG bold_italic_H end_ARG are defined as follows:

𝑯¯¯𝑯\displaystyle\bar{\bm{H}}over¯ start_ARG bold_italic_H end_ARG =(𝑪real𝑯𝑪imag𝑯),absentsubscript𝑪realsuperscript𝑯bold-′subscript𝑪imagsuperscript𝑯bold-′\displaystyle=\left(\begin{array}[]{r}\bm{C}_{\rm real}\bm{H^{\prime}}\\ \bm{C}_{\rm imag}\bm{H^{\prime}}\\ \end{array}\right),= ( start_ARRAY start_ROW start_CELL bold_italic_C start_POSTSUBSCRIPT roman_real end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL bold_italic_C start_POSTSUBSCRIPT roman_imag end_POSTSUBSCRIPT bold_italic_H start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ) , (49)
𝚺¯dsubscript¯𝚺𝑑\displaystyle\bar{\bm{\Sigma}}_{d}over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT =(𝚺obs𝑶𝑶𝚺obs).absentsubscript𝚺obs𝑶𝑶subscript𝚺obs\displaystyle=\left(\begin{array}[]{rr}{\bm{\Sigma}_{\rm obs}}&\bm{O}\\ \bm{O}&{\bm{\Sigma}_{\rm obs}}\\ \end{array}\right).= ( start_ARRAY start_ROW start_CELL bold_Σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT end_CELL start_CELL bold_italic_O end_CELL end_ROW start_ROW start_CELL bold_italic_O end_CELL start_CELL bold_Σ start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ) . (52)

Further, using equations (38) and (46), we obtain

p(𝒂|𝒅,𝒈,𝜽)p(𝒅|𝒂,𝒈)p(𝒂|𝜽)=𝒩(𝒅|𝑯¯𝒂,𝚺¯d)𝒩(𝒂|𝟎,𝚺a(𝜽)).proportional-to𝑝conditional𝒂𝒅𝒈𝜽𝑝conditional𝒅𝒂𝒈𝑝conditional𝒂𝜽𝒩conditional𝒅¯𝑯𝒂subscript¯𝚺𝑑𝒩conditional𝒂0subscript𝚺𝑎𝜽\displaystyle p(\bm{a}|\bm{d},\bm{g},\bm{\theta})\propto p(\bm{d}|\bm{a},\bm{g% })p(\bm{a}|\bm{\theta})=\mathcal{N}(\bm{d}|\bar{\bm{H}}\bm{a},\bar{\bm{\Sigma}% }_{d})\mathcal{N}(\bm{a}|\bm{0},\bm{\Sigma}_{a}(\bm{\theta})).italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) ∝ italic_p ( bold_italic_d | bold_italic_a , bold_italic_g ) italic_p ( bold_italic_a | bold_italic_θ ) = caligraphic_N ( bold_italic_d | over¯ start_ARG bold_italic_H end_ARG bold_italic_a , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) caligraphic_N ( bold_italic_a | bold_0 , bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ) ) . (53)

As 𝒂𝒂\bm{a}bold_italic_a appears as a quadratic form in p(𝒂|𝒅,𝒈,𝜽))p(\bm{a}|\bm{d},\bm{g},\bm{\theta}))italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) ) in the exponent, we can obtain p(𝒂|𝒅,𝒈,𝜽)𝑝conditional𝒂𝒅𝒈𝜽p(\bm{a}|\bm{d},\bm{g},\bm{\theta})italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) as follows:

p(𝒂|𝒅,𝒈,𝜽)𝑝conditional𝒂𝒅𝒈𝜽\displaystyle p(\bm{a}|\bm{d},\bm{g},\bm{\theta})italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) =𝒩(𝒂|𝝁¯a|d,𝚺¯a|d),absent𝒩conditional𝒂subscript¯𝝁conditional𝑎𝑑subscript¯𝚺conditional𝑎𝑑\displaystyle=\mathcal{N}(\bm{a}|\bar{\bm{\mu}}_{a|d},\bar{\bm{\Sigma}}_{a|d}),= caligraphic_N ( bold_italic_a | over¯ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT ) , (54)

where we define

𝝁¯a|dsubscript¯𝝁conditional𝑎𝑑\displaystyle\bar{\bm{\mu}}_{a|d}over¯ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT =\displaystyle== (𝑯¯T𝚺¯d1𝑯¯+𝚺a(𝜽)1)1𝑯¯T𝚺¯d1𝒅¯T,superscriptsuperscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1¯𝑯subscript𝚺𝑎superscript𝜽11superscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1superscript¯𝒅𝑇\displaystyle(\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bar{\bm{H}}+\bm{% \Sigma}_{a}(\bm{\theta})^{-1})^{-1}\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}% \bar{\bm{d}}^{T},( over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG + bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_d end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (55)
𝚺¯a|dsubscript¯𝚺conditional𝑎𝑑\displaystyle\bar{\bm{\Sigma}}_{a|d}over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT =\displaystyle== (𝑯¯T𝚺¯d1𝑯¯+𝚺a(𝜽)1)1.superscriptsuperscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1¯𝑯subscript𝚺𝑎superscript𝜽11\displaystyle(\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bar{\bm{H}}+\bm{% \Sigma}_{a}(\bm{\theta})^{-1})^{-1}.( over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG + bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_italic_θ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (56)

Using equations (45), (46), and (54), we obtain

p(𝒅|𝒈,𝜽)𝒩(𝒅|𝑯¯𝒂,𝚺¯d)/𝒩(𝒂|𝝁¯a|d,𝚺¯a|d).proportional-to𝑝conditional𝒅𝒈𝜽𝒩conditional𝒅¯𝑯𝒂subscript¯𝚺𝑑𝒩conditional𝒂subscript¯𝝁conditional𝑎𝑑subscript¯𝚺conditional𝑎𝑑\displaystyle p(\bm{d}|\bm{g},\bm{\theta})\propto\mathcal{N}(\bm{d}|\bar{\bm{H% }}\bm{a},\bar{\bm{\Sigma}}_{d})/\mathcal{N}(\bm{a}|\bar{\bm{\mu}}_{a|d},\bar{% \bm{\Sigma}}_{a|d}).italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) ∝ caligraphic_N ( bold_italic_d | over¯ start_ARG bold_italic_H end_ARG bold_italic_a , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) / caligraphic_N ( bold_italic_a | over¯ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT ) . (57)

Again, 𝒅𝒅\bm{d}bold_italic_d appears as a quadratic form in p(𝒅|𝒈,𝜽)𝑝conditional𝒅𝒈𝜽p(\bm{d}|\bm{g},\bm{\theta})italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) in its exponent. Thus, we can obtain the likelihood function p(𝒅|𝒈,𝜽)𝑝conditional𝒅𝒈𝜽p(\bm{d}|\bm{g},\bm{\theta})italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) as follows:

p(𝒅|𝒈,𝜽)=𝒩(𝒅|𝟎,𝚺¯d+𝑯¯𝚺a𝑯¯T).𝑝conditional𝒅𝒈𝜽𝒩conditional𝒅0subscript¯𝚺𝑑¯𝑯subscript𝚺𝑎superscript¯𝑯𝑇\displaystyle p(\bm{d}|\bm{g},\bm{\theta})=\mathcal{N}(\bm{d}|\bm{0},\bar{\bm{% \Sigma}}_{d}+\bar{\bm{H}}\bm{\Sigma}_{a}\bar{\bm{H}}^{T}).italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) = caligraphic_N ( bold_italic_d | bold_0 , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG bold_italic_H end_ARG bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) . (58)

2.3.4 Posterior distribution for geometry and hyperparameters

The posterior distribution for p(𝒈,𝜽|𝒅)𝑝𝒈conditional𝜽𝒅p(\bm{g},\bm{\theta}|\bm{d})italic_p ( bold_italic_g , bold_italic_θ | bold_italic_d ) is given by the Bayes’ theorem as follows:

p(𝒈,𝜽|𝒅)p(𝒅|𝒈,𝜽)p(𝒈,𝜽)=𝒩(𝒅|𝟎,𝚺¯d+𝑯¯𝚺a𝑯¯T)p(𝒈,𝜽),proportional-to𝑝𝒈conditional𝜽𝒅𝑝conditional𝒅𝒈𝜽𝑝𝒈𝜽𝒩conditional𝒅0subscript¯𝚺𝑑¯𝑯subscript𝚺𝑎superscript¯𝑯𝑇𝑝𝒈𝜽\displaystyle p(\bm{g},\bm{\theta}|\bm{d})\propto p(\bm{d}|\bm{g},\bm{\theta})% p(\bm{g},\bm{\theta})=\mathcal{N}(\bm{d}|\bm{0},\bar{\bm{\Sigma}}_{d}+\bar{\bm% {H}}\bm{\Sigma}_{a}\bar{\bm{H}}^{T})p(\bm{g},\bm{\theta}),italic_p ( bold_italic_g , bold_italic_θ | bold_italic_d ) ∝ italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) italic_p ( bold_italic_g , bold_italic_θ ) = caligraphic_N ( bold_italic_d | bold_0 , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG bold_italic_H end_ARG bold_Σ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) italic_p ( bold_italic_g , bold_italic_θ ) , (59)

Based on p(𝒈,𝜽|𝒅)𝑝𝒈conditional𝜽𝒅p(\bm{g},\bm{\theta}|\bm{d})italic_p ( bold_italic_g , bold_italic_θ | bold_italic_d ), we draw samples for (𝒈,𝜽)𝒈𝜽(\bm{g},\bm{\theta})( bold_italic_g , bold_italic_θ ) using an MCMC sampler. The prior distribution p(𝒈,𝜽)𝑝𝒈𝜽p(\bm{g},\bm{\theta})italic_p ( bold_italic_g , bold_italic_θ ) is assumed as follows. We assume uniform distributions 𝒰(0,1)𝒰01\mathcal{U}(0,1)caligraphic_U ( 0 , 1 ) and 𝒰(0,π)𝒰0𝜋\mathcal{U}(0,\pi)caligraphic_U ( 0 , italic_π ) for cosi𝑖\cos iroman_cos italic_i and PA, respectively. Further, we consider 𝒰(1,1)𝒰11\mathcal{U}(-1\arcsec,1\arcsec)caligraphic_U ( - 1 ″ , 1 ″ ) for ΔxcenΔsubscript𝑥cen\Delta x_{\rm cen}roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT and ΔycenΔsubscript𝑦cen\Delta y_{\rm cen}roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT. In addition, we assume the log-uniform prior from 104superscript10410^{-4}10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT to 104superscript10410^{4}10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT for α𝛼\alphaitalic_α and a uniform prior 𝒰(0.01,0.15)𝒰0.010.15\mathcal{U}(0.01\arcsec,0.15\arcsec)caligraphic_U ( 0.01 ″ , 0.15 ″ ) for γ𝛾\gammaitalic_γ. We checked that the likelihood is exceedingly small when α𝛼\alphaitalic_α and γ𝛾\gammaitalic_γ fall outside their respective prior ranges, thus confirming the sufficient broadness of the prior. As the computation for p(𝒅|𝒈,𝜽)𝑝conditional𝒅𝒈𝜽p(\bm{d}|\bm{g},\bm{\theta})italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) can be time-consuming, we transform the equation to a more efficient form (further details in Appendix A).

Moreover, the computation also relies on the number of data points. In Appendix B, we introduce and discuss the use of data binning to reduce the computation. Specifically, we prepare two-dimensional linear and log grids, apply them to the simulated visibilities, and compare the binning errors. The binning with the log grid was found to yield overall low errors. A more detailed discussion is presented in Appendix B. In the following analysis, we simply adopt the log grid for the analysis.

2.3.5 Posterior distribution for all of parameters

We can draw samples for {𝒂,𝒈,𝜽}𝒂𝒈𝜽\{\bm{a},\bm{g},\bm{\theta}\}{ bold_italic_a , bold_italic_g , bold_italic_θ } by using the conditional formula for the joint probability: p(𝒂,𝒈,𝜽|𝒅)=p(𝒂|𝒅,𝒈,𝜽)p(𝒈,𝜽|d)𝑝𝒂𝒈conditional𝜽𝒅𝑝conditional𝒂𝒅𝒈𝜽𝑝𝒈conditional𝜽𝑑p(\bm{a},\bm{g},\bm{\theta}|\bm{d})=p(\bm{a}|\bm{d},\bm{g},\bm{\theta})p(\bm{g% },\bm{\theta}|d)italic_p ( bold_italic_a , bold_italic_g , bold_italic_θ | bold_italic_d ) = italic_p ( bold_italic_a | bold_italic_d , bold_italic_g , bold_italic_θ ) italic_p ( bold_italic_g , bold_italic_θ | italic_d ). Specifically, we can draw a sample (𝒈,𝜽)(\bm{g}\dagger,\bm{\theta}\dagger)( bold_italic_g † , bold_italic_θ † ) from p(𝒈,𝜽|d)𝑝𝒈conditional𝜽𝑑p(\bm{g},\bm{\theta}|d)italic_p ( bold_italic_g , bold_italic_θ | italic_d ) as done in Sec 2.3.4, and subsequently take a sample for 𝒂𝒂\bm{a}bold_italic_a from p(𝒂|𝒅,𝒈,𝜽)p(\bm{a}|\bm{d},\bm{g}\dagger,\bm{\theta}\dagger)italic_p ( bold_italic_a | bold_italic_d , bold_italic_g † , bold_italic_θ † ) using equation (54). We can iterate this procedure to make samples, and this sampling is indeed equivalent to drawing sample from the joint distribution p(𝒂,𝒈,𝜽|𝒅)𝑝𝒂𝒈conditional𝜽𝒅p(\bm{a},\bm{g},\bm{\theta}|\bm{d})italic_p ( bold_italic_a , bold_italic_g , bold_italic_θ | bold_italic_d ). Using the samples for {𝒂,𝒈,𝜽}𝒂𝒈𝜽\{\bm{a},\bm{g},\bm{\theta}\}{ bold_italic_a , bold_italic_g , bold_italic_θ }, we can also compute statistics for parameters (e.g., mean and standard deviation for 𝒂𝒂\bm{a}bold_italic_a).

2.3.6 Difference in formulation between current study and frank

We here highlight the key differences between our approach and that of frank. Note that the direct comparison of the recovered profiles is also given in Section 6. While both methods vary in their regularization strategies, the distinct difference is that our approach solve all parameters, including brightness profiles, geometric parameters, and hyperparameters, whereas frank is limited to solving brightness profiles. In this sense, our formulation can be seen as a natural extension of that of frank.

One advantage of our method is that it can optimize geometric parmaeters and hyperparameters directly from the data, and thus it is less susceptible to human biases caused by manual tuning. Moreover, while the current paper is limited to the simplest model with single frequency and single source, the methodology itself can be easily extended to more complex problems with multi-frequency data or multiple sources. In such cases, manual tuning of optimal parameters can be both challenging and less reliable, making our approach more effective.

On the other hand, the disadvantage of our approach is that it is more computationally demanding than frank. This is because we attempted to solve all parameters, including non-linear parameters, rather than deriving only brightness profiles like frank. Furthermore, our current formulation cannot support imposing a non-negative condition on brightness profiles, a feature available in frank. This is because the analytical expression for the marginalization over 𝒂𝒂\bm{a}bold_italic_a becomes inapplicable under the non-negative condition. Although we can implement the condition by considering the same problem as frank, the current study prioritizes simultaneous fitting of all parameters, so we do not implement the function.

3 Extract asymmetric features

This section summarizes the process of extracting and quantifying the non-axisymmetric components from observations using the derived parameters in Sec 2.3. The method is applied to data analyses in Sec 4 and 5.

3.1 Making residual images in observed and deprojected frames

3.1.1 Making residual image

After subtracting the axisymmetric model from visibilities, we expect the residual visibilities to contain only non-axisymmetric components. Therefore, imaging using these residual visibilities can reveal the non-axisymmetric component of the disc (Jennings et al., 2020; Andrews et al., 2021). Based on this principle, we created images to study the non-axisymmetric components.

Drawing samples from a posterior distribution as described in Sec 2, we selected the maximum a posteriori probability (MAP) estimate for 𝒈𝒈\bm{g}bold_italic_g and 𝜽𝜽\bm{\theta}bold_italic_θ. Subsequently, we randomly drew a brightness profile using equation (54). For the chosen parameters and the brightness profile, we computed the model visibilities of an axisymmetric disc, and subtracted them from observed visibilities. Then, the phase centres of measurement sets were aligned with the disc centres using fixvis in CASA.

For the shifted and subtracted measurement set, we created images using the CLEAN algorithm, which is implemented as 𝚝𝚌𝚕𝚎𝚊𝚗𝚝𝚌𝚕𝚎𝚊𝚗{\tt tclean}typewriter_tclean in CASA. In this study, the pixel scale for the image was set as 6 mas, and the image size was 1000×1000100010001000\times 10001000 × 1000 pixels. In the imaging process, Although it is typical to adopt 0 iteration for CLEAN in making the residual map (e.g., Jennings et al., 2020), we found that employing non-zero iterations for CLEAN result in slight or moderate improvement in the quality of the residual image, especially when there remain substantial residuals. Thus, in this paper, we implemented thresholds with nsigma=3.5absent3.5=3.5= 3.5, which stops the iterations if the maximum residual on the image is below 3.53.53.53.5 times the root-mean-square error. Further, we adopted the Briggs weighting with a robust parameter of 0.5 and without UV-taper. Appendix C presents a discussion on the effects of changing robust parameters on the recovered residual images, and 0.5 was determined to be the balanced choice. The brightness of the output image from CASA was measured in Jy beam-1, where ’beam’ is the nominal area where the brightness is defined222In CASA, the synthesized beam is approximated using the Gaussian function, and its area is expressed as π4ln2bmajbmin𝜋4ln2subscript𝑏majsubscript𝑏min\frac{\pi}{4\rm{ln}2}\,b_{\rm maj}\,b_{\rm min}divide start_ARG italic_π end_ARG start_ARG 4 roman_l roman_n 2 end_ARG italic_b start_POSTSUBSCRIPT roman_maj end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, where (bmajbmin)subscript𝑏majsubscript𝑏min(b_{\rm maj}\,b_{\rm min})( italic_b start_POSTSUBSCRIPT roman_maj end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ) are the full widths at half maximum of the Gaussian in major and minor axes. . The standard deviation of the residual image was also measured outside the disc emission.

3.1.2 Deprojection of residual image

In addition to the residual image in an observational frame, we created the deprojected residual image using the disc geometry. Spatial frequencies of the shifted measurement set were converted using the geometric parameter, (u,v)(u,v)𝑢𝑣superscript𝑢superscript𝑣(u,v)\rightarrow(u^{\prime},v^{\prime})( italic_u , italic_v ) → ( italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) in equation (26). Consequently, the image was created in the same manner as Sec 3.1.1. This operation aligns the major axis of the image with the y𝑦yitalic_y-axis (north direction), and stretches the image along the x𝑥xitalic_x axis (east direction). This corresponds to the conversion in Figure 1 from a deprojected to a projected frame.

In the imaging process, we should be cautious that the transformation (u,v)(u,v)𝑢𝑣superscript𝑢superscript𝑣(u,v)\rightarrow(u^{\prime},v^{\prime})( italic_u , italic_v ) → ( italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) apparently decreases the brightness in Jy sr-1 by a factor of cosi𝑖\cos iroman_cos italic_i, while brightness in Jy beam-1 remains unchanged before and after deprojection. This is because the transformation increases the disc area (and beam) while the total flux is unchanged. Throughout the paper, we consistently presented the brightness of images in Jy beam-1, which remains constant, and we did not apply any correction by a factor of cosi𝑖\cos iroman_cos italic_i.

Although widely used, the current simple deprojection methods provide the true image of the disc viewed face-on only in the case of an infinitesimally thin disc. The other geometric effects such as the vertical structures or shadows cannot be incorporated with our simple operations.

3.2 Mode decomposition of residual image in polar direction

Spirals are often characterized by dominant modes in a polar direction. We can decompose I(r,ϕ)𝐼𝑟italic-ϕI(r,\phi)italic_I ( italic_r , italic_ϕ ) in a polar coordinate (r,ϕ)𝑟italic-ϕ(r,\phi)( italic_r , italic_ϕ ) as follows (e.g., Binney & Tremaine, 2008):

I(r,ϕ)=m=0Im(r)cos(m(ϕϕm(r))).𝐼𝑟italic-ϕsubscript𝑚0subscript𝐼𝑚𝑟𝑚italic-ϕsubscriptitalic-ϕ𝑚𝑟\displaystyle I(r,\phi)=\sum_{m=0}I_{m}(r)\cos(m(\phi-\phi_{m}(r))).italic_I ( italic_r , italic_ϕ ) = ∑ start_POSTSUBSCRIPT italic_m = 0 end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) roman_cos ( italic_m ( italic_ϕ - italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) ) ) . (60)

where Im(r)subscript𝐼𝑚𝑟I_{m}(r)italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) is the amplitude at the m𝑚mitalic_m-th mode, and ϕm(r)subscriptitalic-ϕ𝑚𝑟\phi_{m}(r)italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) is a phase shift for the m𝑚mitalic_m-th component. Note that I0(r)subscript𝐼0𝑟I_{0}(r)italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_r ) corresponds to the brightness profile. The amplitudes and phases can be derived from the residual images I(x,y)𝐼𝑥𝑦I(x,y)italic_I ( italic_x , italic_y ) as follows:

Im(r)subscript𝐼𝑚𝑟\displaystyle I_{m}(r)italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) =\displaystyle== 2Cm2+Sm2,2superscriptsubscript𝐶𝑚2superscriptsubscript𝑆𝑚2\displaystyle 2\sqrt{C_{m}^{2}+S_{m}^{2}},2 square-root start_ARG italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (61)
ϕm(r)subscriptitalic-ϕ𝑚𝑟\displaystyle\phi_{m}(r)italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) =\displaystyle== atan2(Sm,Cm),atan2subscript𝑆𝑚subscript𝐶𝑚\displaystyle{\rm atan2}(S_{m},C_{m}),atan2 ( italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , (62)

where atan2atan2{\rm atan2}atan2 is the 2-argument arctangent, and (Cm,Sm)subscript𝐶𝑚subscript𝑆𝑚(C_{m},S_{m})( italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) is defined as follows:

Cm12π02πI(r,ϕ)cos(mϕ)𝑑ϕ,subscript𝐶𝑚12𝜋superscriptsubscript02𝜋𝐼𝑟italic-ϕ𝑚italic-ϕdifferential-ditalic-ϕ\displaystyle C_{m}\equiv\frac{1}{2\pi}\int_{0}^{2\pi}I(r,\phi)\cos(m\phi)d\phi,italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ≡ divide start_ARG 1 end_ARG start_ARG 2 italic_π end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_π end_POSTSUPERSCRIPT italic_I ( italic_r , italic_ϕ ) roman_cos ( italic_m italic_ϕ ) italic_d italic_ϕ , (63)
Sm12π02πI(r,ϕ)sin(mϕ)𝑑ϕ.subscript𝑆𝑚12𝜋superscriptsubscript02𝜋𝐼𝑟italic-ϕ𝑚italic-ϕdifferential-ditalic-ϕ\displaystyle S_{m}\equiv\ \frac{1}{2\pi}\int_{0}^{2\pi}I(r,\phi)\sin(m\phi)d\phi.italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ≡ divide start_ARG 1 end_ARG start_ARG 2 italic_π end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_π end_POSTSUPERSCRIPT italic_I ( italic_r , italic_ϕ ) roman_sin ( italic_m italic_ϕ ) italic_d italic_ϕ . (64)

The values of (Cm,Sm)subscript𝐶𝑚subscript𝑆𝑚(C_{m},S_{m})( italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) can be computed for the real data. In this study, we computed them by interpolating a residual image.

3.3 Extraction of odd and even symmetric components of images

Odd-symmetric and even-symmetric components of an image can be decomposed by using either of each imaginary or real part of the data. We start with decomposing an image into even- and odd-symmetric components.

I(x,y)=Ieven(x,y)+Iodd(x,y),𝐼𝑥𝑦subscript𝐼even𝑥𝑦subscript𝐼odd𝑥𝑦\displaystyle I(x,y)=I_{\rm even}(x,y)+I_{\rm odd}(x,y),italic_I ( italic_x , italic_y ) = italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) + italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) , (65)

where we define

Ieven(x,y)subscript𝐼even𝑥𝑦\displaystyle\displaystyle I_{\rm even}(x,y)italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) =\displaystyle== 12(I(x,y)+I(x,y))12𝐼𝑥𝑦𝐼𝑥𝑦\displaystyle\frac{1}{2}(I(x,y)+I(-x,-y))divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_I ( italic_x , italic_y ) + italic_I ( - italic_x , - italic_y ) ) (66)
=\displaystyle== m=evenIm(r)cos(m(ϕϕm(r))),subscript𝑚evensubscript𝐼𝑚𝑟𝑚italic-ϕsubscriptitalic-ϕ𝑚𝑟\displaystyle\sum_{m={\rm even}}I_{m}(r)\cos(m(\phi-\phi_{m}(r))),∑ start_POSTSUBSCRIPT italic_m = roman_even end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) roman_cos ( italic_m ( italic_ϕ - italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) ) ) , (67)
Iodd(x,y)subscript𝐼odd𝑥𝑦\displaystyle I_{\rm odd}(x,y)italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) =\displaystyle== 12(I(x,y)I(x,y))12𝐼𝑥𝑦𝐼𝑥𝑦\displaystyle\frac{1}{2}(I(x,y)-I(-x,-y))divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_I ( italic_x , italic_y ) - italic_I ( - italic_x , - italic_y ) ) (68)
=\displaystyle== m=oddIm(r)cos(m(ϕϕm(r))),subscript𝑚oddsubscript𝐼𝑚𝑟𝑚italic-ϕsubscriptitalic-ϕ𝑚𝑟\displaystyle\sum_{m={\rm odd}}I_{m}(r)\cos(m(\phi-\phi_{m}(r))),∑ start_POSTSUBSCRIPT italic_m = roman_odd end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) roman_cos ( italic_m ( italic_ϕ - italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) ) ) , (69)

which satisfy Ieven(x,y)=Ieven(x,y)subscript𝐼even𝑥𝑦subscript𝐼even𝑥𝑦I_{\rm even}(x,y)=I_{\rm even}(-x,-y)italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( - italic_x , - italic_y ), and Iodd(x,y)=Iodd(x,y)subscript𝐼odd𝑥𝑦subscript𝐼odd𝑥𝑦I_{\rm odd}(x,y)=-I_{\rm odd}(-x,-y)italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) = - italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( - italic_x , - italic_y ). However, using equation (2), the real and imaginary parts are connected to Ieven(x,y)subscript𝐼even𝑥𝑦I_{\rm even}(x,y)italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) and Iodd(x,y)subscript𝐼odd𝑥𝑦I_{\rm odd}(x,y)italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) as follows:

𝔢(V(u,v))𝔢𝑉𝑢𝑣\displaystyle\displaystyle\mathfrak{Re}(V(u,v))fraktur_R fraktur_e ( italic_V ( italic_u , italic_v ) ) =\displaystyle== Ieven(x,y)exp(2πj(ux+vy))𝒅𝒙,superscriptsubscriptsuperscriptsubscriptsubscript𝐼even𝑥𝑦2𝜋𝑗𝑢𝑥𝑣𝑦differential-d𝒙\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I_{\rm even}(x,y)% \exp(-2\pi j(ux+vy))\bm{dx},∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) roman_exp ( - 2 italic_π italic_j ( italic_u italic_x + italic_v italic_y ) ) bold_italic_d bold_italic_x ,
𝔪(V(u,v))𝔪𝑉𝑢𝑣\displaystyle\mathfrak{Im}(V(u,v))fraktur_I fraktur_m ( italic_V ( italic_u , italic_v ) ) =\displaystyle== Iodd(x,y)exp(2πj(ux+vy))𝒅𝒙.superscriptsubscriptsuperscriptsubscriptsubscript𝐼odd𝑥𝑦2𝜋𝑗𝑢𝑥𝑣𝑦differential-d𝒙\displaystyle\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}I_{\rm odd}(x,y)% \exp(-2\pi j(ux+vy))\bm{dx}.∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) roman_exp ( - 2 italic_π italic_j ( italic_u italic_x + italic_v italic_y ) ) bold_italic_d bold_italic_x .

Thus, we can create Ieven(x,y)subscript𝐼even𝑥𝑦I_{\rm even}(x,y)italic_I start_POSTSUBSCRIPT roman_even end_POSTSUBSCRIPT ( italic_x , italic_y ) and Iodd(x,y)subscript𝐼odd𝑥𝑦I_{\rm odd}(x,y)italic_I start_POSTSUBSCRIPT roman_odd end_POSTSUBSCRIPT ( italic_x , italic_y ) using either the real or imaginary part of the data. Notably, previous studies have already used the imaginary part of the data to extract an odd-symmetric component (Hashimoto et al., 2021; Kanagawa et al., 2021).

4 Application to simulated data for geometrically thin axisymmetric disc

4.1 Injection and recovery test

As a test case, we applied our method to simulated data for an axisymmetric disc to examine the ability to recover input parameters. We used two radial profiles for our test. The continuum brightness profiles for AS 209 and WaOph 6 of DSHARP observations333https://bulk.cv.nrao.edu/almadata/lp/DSHARP(Andrews et al., 2018) were used. AS 209 is chosen as the most structured disc in a radial direction, and WaOph 6 is among m=2𝑚2m=2italic_m = 2 spiral discs in the DSHARP sample. Subsequently, we assume the model discs have the geometric parameters of (Δxcen,Δycen,cosi,PA,)=(0,0,0.75,45)(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA},)=(0\arcsec,0\arcsec,0.% 75,45^{\circ})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA , ) = ( 0 ″ , 0 ″ , 0.75 , 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT ). We normalize the brightness profiles to render the total fluxes of the AS 209 and WaOph 6 models as 0.288 and 0.161 Jy, respectively.

To produce mock data with realistic UV-coverage and noise, we incorporated them using the actual data of DSHARP observations. We downloaded the self-calibrated continuum measurement set for AS 209 and WaOph 6. The data were averaged at each spectral window to produce a single channel data. Thereafter, time averaging was applied for 30 s. A new measurement set was created using the averaged data to obtain the UV-coverage and the weight at each spatial frequency. The signals of the model disc at each spatial frequency were calculated via Fourier transforms of the model surface brightness distribution. Consequently, the Gaussian noises were added to the model visibility according to the the weights. During the analysis, we used 𝚝𝚋.𝚐𝚎𝚝𝚌𝚘𝚕formulae-sequence𝚝𝚋𝚐𝚎𝚝𝚌𝚘𝚕\tt{tb.getcol}typewriter_tb . typewriter_getcol in CASA to export the measurement sets to handle them numerically in python444We used 𝑡𝑏.𝑔𝑒𝑡𝑐𝑜𝑙formulae-sequence𝑡𝑏𝑔𝑒𝑡𝑐𝑜𝑙\it{tb.getcol}italic_tb . italic_getcol in CASA to export (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) from measurement sets, but we found that signs of the outputted arrays for (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) were reversed. Specifically, they have opposite signs with respect to those obtained with 𝑝𝑙𝑜𝑡𝑚𝑠𝑝𝑙𝑜𝑡𝑚𝑠\it{plotms}italic_plotms. In the analysis, we simply flipped signs of (u,v)𝑢𝑣(u,v)( italic_u , italic_v ) output from 𝚝𝚋.𝚐𝚎𝚝𝚌𝚘𝚕formulae-sequence𝚝𝚋𝚐𝚎𝚝𝚌𝚘𝚕\tt{tb.getcol}typewriter_tb . typewriter_getcol. .

The data weights recorded in the measurement sets were overestimated for the entire DSHARP data. Following Appendix E in Hashimoto et al. (2021), we deprojected the real and imaginary parts of visibilities, and compared the recorded standard deviations and the deviations of observed visibilities from the binned visibilities in the deprojected spatial frequencies. Consequently, we confirmed that the overestimation existed in both the real and imaginary parts of the visibilities, and the degree of overestimation was within 10similar-toabsent10\sim 10∼ 10% for both. Therefore, we manually reduced the recorded weights by 3.44 and 3.66 for AS 209 and WaOph 6, respectively.

To reduce the computational time, we binned the data using a logarithmically spaced grid with Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500, where the number of grid points was (2Nbin+3)2superscript2subscript𝑁bin32(2N_{\rm bin}+3)^{2}( 2 italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 3 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The boundaries of the grids are defined by the parameters (xmin,xmax)subscript𝑥minsubscript𝑥max(x_{\rm min},x_{\rm max})( italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ), which are introduced in Appendix B. We set (xmin,xmax)subscript𝑥minsubscript𝑥max(x_{\rm min},x_{\rm max})( italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) to (102λ,107λ)superscript102𝜆superscript107𝜆(10^{2}\lambda,10^{7}\lambda)( 10 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_λ , 10 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_λ ) in our analyses. This binning decreased the number of data points by an approximate factor of 10. Appendix B presents the details of data gridding using a log grid, and we show that the choice of Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500 is acceptable.

Using equation (59), we drew samples from the posterior distribution for (γ,log10α,Δxcen,Δycen,cosi,PA)𝛾subscript10𝛼Δsubscript𝑥cenΔsubscript𝑦cen𝑖PA(\gamma,\log_{10}\alpha,\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA})( italic_γ , roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_α , roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA ). In sampling, we used emcee, which implemented the affine-invariant ensemble sampler for Markov Chain Monte Carlo (Foreman-Mackey et al., 2013). The initial parameters for emcee were set to be close to the input parameters. Here, 16 walkers were prepared and then evolved for at least 8,000 steps. For the drawn samples, we discarded the initial 1,250 steps during the burn-in phase. The convergence is predominantly achieved within 2,000-4,000 steps according to the auto-correlation time analysis. Specifically, we have verified that all samples meet the condition N>50τ𝑁50𝜏N>50\tauitalic_N > 50 italic_τ, where N𝑁Nitalic_N represents the step size and τ𝜏\tauitalic_τ represents the integrated autocorrelation time for the chain555 https://emcee.readthedocs.io/en/stable/tutorials/autocorr/.

The total computational time for the single disc was approximately 12 hours using 8 cores. This represents a significantly higher computational demand in contrast to frank’s approach. This stems from our strategy to retrieve all parameters in a simultaneous manner, as opposed to frank’s approach of fixing parameters except for the brightness profiles.

Figs. 2 and 3 show the recovery and injection test results for the simulated data. The upper-left panels shows the posterior distribution of the parameters for the two simulated cases. All the geometric parameters were reasonably recovered by the current method.

The analyses resulted in two different length parameters; γ0.04similar-to-or-equals𝛾0.04\gamma\simeq 0.04\arcsecitalic_γ ≃ 0.04 ″ for the simulated case of AS 209, and γ0.1similar-to-or-equals𝛾0.1\gamma\simeq 0.1\arcsecitalic_γ ≃ 0.1 ″ for the case of WaOph 6. The differences in γ𝛾\gammaitalic_γ were largely due to the differences in the injected brightness profiles; the injected profile for WaOph 6 was smoother than that of AS 209. The UV-coverage would also affect the length scale, but it remains unclear in the current two cases. The length scale parameter for WaOph 6 was larger than the beam size (0.091,0.037)0.0910.037(0.091\arcsec,0.037\arcsec)( 0.091 ″ , 0.037 ″ ), whereas it was smaller than the beam size (0.076,0.040)0.0760.040(0.076\arcsec,0.040\arcsec)( 0.076 ″ , 0.040 ″ ) for AS 209. Note that the beam sizes reflect the UV-coverage, but they vary with assumed parameters in CLEAN; for example, robust parameter and UV-taper. In Appendix D, we further study the variations in γ𝛾\gammaitalic_γ by stretching the injected brightness profiles and UV-coverage. In conclusion, the optimal length scales γ𝛾\gammaitalic_γ appear to be determined by multiple factors, including the brightness profiles and UV-coverage (beam size).

The upper-right panels in Figs. 2 and 3 compare the injected brightness profiles with 10 samples for 𝒂𝒂\bm{a}bold_italic_a using the method presented in Sec 2.3.5. The radial profiles were also adequately recovered by the current method. However, the residuals for the brightness profiles were not perfectly consistent with the zero line for both cases, although an oscillating feature was observed in the radial direction. The oscillating length scales for the observed residuals indeed corresponded to the derived length scales γ𝛾\gammaitalic_γ, which determined the characteristic scales that could be resolved. Moreover, we also identified the large residuals at the very innermost parts owing to the steep changes in the fluxes in the radial direction. The oscillating features have been also reported in Jennings et al. (2020), which adopted regularization in the frequency domain, and such residuals might be inevitable in the Gaussian Process.

The lower-left panels in Figs. 2 and 3 show the simulated and model visibilities. The model visibilities were computed by choosing the MAP solution for the geometry and hyperparameters, and the brightness profile was randomly sampled. The residual visibilities are shown in the lower panels. The models were well consistent with the simulated visibilities from the low to high spatial frequencies. However, we observed that the powers of the model visibilities at high frequencies were forcibly suppressed. This cut-off scale was determined by the optimized length scale γ𝛾\gammaitalic_γ, which determined the scales below which the oscillating features appeared in the brightness profiles. At the low spatial frequencies, particularly at q<0.3𝑞0.3q<0.3italic_q < 0.3 Mλ𝜆\lambdaitalic_λ, we also found oscillating residuals with a length scale of 0.05similar-toabsent0.05\sim 0.05∼ 0.05 Mλ𝜆\lambdaitalic_λ. This length scale corresponded to the radius for the outer boundary Rout=2subscript𝑅out2R_{\rm out}=2\arcsecitalic_R start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT = 2 ″, which limited the model’s applicability to the large-scale structure.

In the lower-right panels in Figs. 2 and 3, we show the residual images, which are produced by the method presented in Sec 3.1. The residual images did not exhibit noticeable features, demonstrating that we reasonably estimated the injected parameters.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 2: Recovery and injection test for mock data of AS 209. (upper-left) Posterior distribution for (Δxcen,Δycen,cosi,PA,α,γ)Δsubscript𝑥cenΔsubscript𝑦cen𝑖PA𝛼𝛾(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA},\alpha,\gamma)( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA , italic_α , italic_γ ). Dotted lines indicate the input values. (upper-right) Injected brightness profile, denoted by the black line, and 10 samples drawn from the posterior distribution of brightness profiles, denoted by red lines. At the bottom, the residuals between injected and recovered models are shown. (lower-left) Simulated visibilities denoted by blue points and model visibilities denoted by an orange line. The visibilities are binned with a logarithmic grid with N=2000𝑁2000N=2000italic_N = 2000. In the lower panels, the residuals indicating the difference between the simulated and model visibilities are shown. (lower-right) Residual image produced with the MAP estimate in the observed frame. The synthesized beam size (0.076,0.040)0.0760.040(0.076\arcsec,0.040\arcsec)( 0.076 ″ , 0.040 ″ ) is shown at the bottom left.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 3: Recovery and injection test for mock data of WaOph 6 in Sec 4. The format of the figure is same as that of Figure 2. The synthesized beam size (0.091,0.037)0.0910.037(0.091\arcsec,0.037\arcsec)( 0.091 ″ , 0.037 ″ ) for the residual image is shown at the bottom left.

4.2 Residual images constructed from shifted geometric parameters

To demonstrate how incorrect determination of geometric parameters affects the resultant residual images, we shifted each geometric parameter before subtracting axisymmetric models from the visibilities. For the simulated case of AS 209, we shifted each geometric parameter from the MAP estimate as follows: Δxcen=5masΔsubscript𝑥cen5mas\Delta x_{\rm cen}=5\;{\rm mas}roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 5 roman_mas, Δycen=5masΔsubscript𝑦cen5mas\Delta y_{\rm cen}=5\;{\rm mas}roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 5 roman_mas, Δcosi=0.01Δ𝑖0.01\Delta\cos i=0.01roman_Δ roman_cos italic_i = 0.01, ΔPA=2degΔPA2deg\Delta{\rm PA}=2\;{\rm deg}roman_Δ roman_PA = 2 roman_deg. For the simulated case of WaOph 6, we shifted the parameters as follows: Δxcen=2masΔsubscript𝑥cen2mas\Delta x_{\rm cen}=2\;{\rm mas}roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 2 roman_mas, Δycen=2masΔsubscript𝑦cen2mas\Delta y_{\rm cen}=2\;{\rm mas}roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT = 2 roman_mas, Δcosi=0.03Δ𝑖0.03\Delta\cos i=0.03roman_Δ roman_cos italic_i = 0.03, ΔPA=3degΔPA3deg\Delta{\rm PA}=3\;{\rm deg}roman_Δ roman_PA = 3 roman_deg. To simulate the realistic situation as much as possible, we assume these incorrect geometric parameters with the hyperparamters from the MAP solution, and draw a brightness profile from the posterior distribution, which nevertheless gives the profile similar to our best-fit model. With the subtracted visibilities made with the new models, we created the residual images using the CLEAN following the method presented in Sec 3.1. We also created images from unsubtracted visibilities for comparison. For that, we adopted 0.04 mJy as the CLEAN threshold and used the multiscale CLEAN algorithm with scale parameters of [0, 30, 120, 360, 720, 1440] mas (Rau & Cornwell, 2011).

Fig. 4 and 5 show the unsubtracted and residual images with their geometric parameters being shifted. They are all shown in observed frames. The residual images based on the best-fitting models did not exhibit any noticeable structure. This demonstrates the feasibility of the current method. However, the residual images with shifted geometric parameters exhibited coherent patterns. These residual images obtained from the current study were overall consistent with those presented in Andrews et al. (2021), wherein the dependence of residual images was studied by shifting the geometric parameters. As shown in the figures, the difference in the injected brightness profiles changed the residual images. We highlight two outer rings for AS 209 by black dashed ellipses in Fig. 4, and we find that the ring locations indeed corresponded to the boundaries, where the signs of the residuals were flipped. However, the brightness profile for WaOph 6 was more continuous than AS 209, and the residual images were less structured.

Fig. 6 shows Im=1,2(r)subscript𝐼𝑚12𝑟I_{m=1,2}(r)italic_I start_POSTSUBSCRIPT italic_m = 1 , 2 end_POSTSUBSCRIPT ( italic_r ) for the residual images in the simulated case for WaOph6. The shifts in the central positions introduced residuals with the m=1𝑚1m=1italic_m = 1 component, whereas those in cosi𝑖\cos iroman_cos italic_i and PA introduced the residuals with m=2𝑚2m=2italic_m = 2 component. As discussed in the later sections, these residuals can be degenerate with real signals (e.g., spirals); thus they can introduce biases for estimating such non-axisymmetric structures in discs.

As a comparison with the proposed method, we also applied the Gaussian fitting to the visibilities to estimate the geometric parameters. Here, the Gaussian fitting of visibilities corresponds to the fitting of the Gaussian function on the image plane, and it is frequently used for estimating a geometry of a disc. The model was specified by six parameters, where four parameters, (Δxcen,Δycen,cosi,PA)Δsubscript𝑥cenΔsubscript𝑦cen𝑖PA(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA ) are common to our model. We implemented a posterior sampling for the Gaussian model using emcee (Foreman-Mackey et al., 2013).

The first and second data from the left in each panel of Fig. 7 show the injected and recovered geometric parameters obtained from the current method and the Gaussian fitting. The central positions were well recovered in both of cases. However, the parameters cosi𝑖\cos iroman_cos italic_i and PA obtained from the Gaussian fitting deviated from the injected values; Δcosi0.010.05similar-to-or-equalsΔ𝑖0.010.05\Delta\cos i\simeq 0.01-0.05roman_Δ roman_cos italic_i ≃ 0.01 - 0.05 and ΔPA15similar-to-or-equalsΔPA15\Delta{\rm PA}\simeq 1-5roman_Δ roman_PA ≃ 1 - 5 deg. These deviations can be problematic when searching for faint signals (e.g., circumplanetary discs and spirals) because they introduce fake non-axisymmetric structures in the residual images as shown in Figs. 4 and 5.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Unsubtracted image and residual images produced with incorrect geometric parameters in simulated case for AS 209. The standard deviation for the brightness σ𝜎\sigmaitalic_σ is computed in the region outside the disc. (upper-left panel) The image produced with CLEAN for unsubtracted data. (upper-middle panel) The residual image produced with residual visibilities produced from the MAP estimate. (other panels) The residual images with each of the geometric parameters being shifted. The synthesized beam size (0.076,0.040)0.0760.040(0.076\arcsec,0.040\arcsec)( 0.076 ″ , 0.040 ″ ) is shown at the bottom left.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5: Same as Fig. 4 albeit for simulated data of WaOph 6. The synthesized beam size (0.091,0.037)0.0910.037(0.091\arcsec,0.037\arcsec)( 0.091 ″ , 0.037 ″ ) is shown at the bottom left.
Refer to caption
Refer to caption
Figure 6: Im=1,2(r)subscript𝐼𝑚12𝑟I_{m=1,2}(r)italic_I start_POSTSUBSCRIPT italic_m = 1 , 2 end_POSTSUBSCRIPT ( italic_r ) computed for residual images in Fig. 5. The left and right panels show the results for the shifted central position and the shifted disc orientation, respectively.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: Injected and recovered geometric parameters in simulations in Sec 4 and 5. Each panel compares the geometric parameter: the central position and the disc orientation. The geometry is estimated using the proposed method (black) and the Gaussian fitting (red). The injected parameters are indicated by the dashed blue lines.

5 Injection and recovery test for spirals and circumplanetary disc embedded into axisymmetric emission

As a simple extension to Sec 4, we considered an additional non-axisymmetric perturbation to the symmetric disc. We injected spirals and a point source into an axisymmetric model and simulated the visibilities incorporating noises. Subsequently, we applied our method to extract the axisymmetric model and construct residual images, which were then compared against the injected structures.

5.1 Axisymmetric structure +++ spirals

5.1.1 Injection of spirals

We injected spiral structures into an axisymmetric structure assuming the same observational setup as the simulated case for WaOph6. The brightness profile was assumed to be the same as the case for WaOph6. In this simulation, we assume zero brightness outside r=1.11𝑟1.11r=1.11\arcsecitalic_r = 1.11 ″, beyond which the brightness in the literature can become negative. Three different models were considered; an odd-symmetric spiral with m=1𝑚1m=1italic_m = 1 component, an even-symmetric spiral with m=2𝑚2m=2italic_m = 2 component, and the combined image for both spirals. Specifically, we assumed perturbations in the form of ΔI(m,r,ϕ)=ΔIm(r)cos(m(ϕϕm(r)))Δ𝐼𝑚𝑟italic-ϕΔsubscript𝐼𝑚𝑟𝑚italic-ϕsubscriptitalic-ϕ𝑚𝑟\Delta I(m,r,\phi)=\Delta I_{m}(r)\cos(m(\phi-\phi_{m}(r)))roman_Δ italic_I ( italic_m , italic_r , italic_ϕ ) = roman_Δ italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) roman_cos ( italic_m ( italic_ϕ - italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) ) ), where we adopted an Archimedean spiral ϕm(r)=ar+bsubscriptitalic-ϕ𝑚𝑟𝑎𝑟𝑏\phi_{m}(r)=ar+bitalic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) = italic_a italic_r + italic_b. We assumed (m,a,b)=(1,20,1)𝑚𝑎𝑏1201(m,a,b)=(1,20,1)( italic_m , italic_a , italic_b ) = ( 1 , 20 , 1 ) for the odd-symmetric spiral and (m,a,b)=(2,10,π/6)𝑚𝑎𝑏210𝜋6(m,a,b)=(2,-10,-\pi/6)( italic_m , italic_a , italic_b ) = ( 2 , - 10 , - italic_π / 6 ) for the even-symmetric mode. The amplitudes Im(r)subscript𝐼𝑚𝑟I_{m}(r)italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) were assumed as follows:

I1(r)subscript𝐼1𝑟\displaystyle I_{1}(r)italic_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r ) =\displaystyle== {5×105exp(((r0.4)/0.1)2)[Jybeam1]cases5superscript105superscript𝑟0.40.12missing-subexpressiondelimited-[]Jysuperscriptbeam1missing-subexpression\displaystyle\left\{\begin{array}[]{cc}\displaystyle 5\times 10^{-5}\exp(-((r-% 0.4\arcsec)/0.1\arcsec)^{2})\\ {\rm[Jy\;beam^{-1}]}\end{array}\right.{ start_ARRAY start_ROW start_CELL 5 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT roman_exp ( - ( ( italic_r - 0.4 ″ ) / 0.1 ″ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ roman_Jy roman_beam start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ] end_CELL start_CELL end_CELL end_ROW end_ARRAY (74)
I2(r)subscript𝐼2𝑟\displaystyle I_{2}(r)italic_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_r ) =\displaystyle== {5×105[Jybeam1](0<r<0.6)0(0.6<r),cases5superscript105delimited-[]Jysuperscriptbeam10𝑟0.600.6𝑟\displaystyle\left\{\begin{array}[]{cc}\displaystyle 5\times 10^{-5}\;\;{\rm[% Jy\;beam^{-1}]}&\;\;\;\;(0<r<0.6\arcsec)\\ 0&\;\;\;\;(0.6\arcsec<r)\\ \end{array}\right.,{ start_ARRAY start_ROW start_CELL 5 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT [ roman_Jy roman_beam start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ] end_CELL start_CELL ( 0 < italic_r < 0.6 ″ ) end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ( 0.6 ″ < italic_r ) end_CELL end_ROW end_ARRAY , (77)

where the beam area was assumed to be 0.00354[arcsec]20.00354superscriptdelimited-[]arcsec20.00354\;[{\rm arcsec}]^{2}0.00354 [ roman_arcsec ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For the calculation of model visibilities, we used functions for the non-uniform fast Fourier transform from PyNUFFT (Lin & Chung, 2017; Lin, 2018). We adopted the size of the image grid Nd=1024subscript𝑁𝑑1024N_{d}=1024italic_N start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = 1024, the size of the oversampled Fourier grid Kd=2048subscript𝐾𝑑2048K_{d}=2048italic_K start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = 2048, and the size of the interpolator Jd=6subscript𝐽𝑑6J_{d}=6italic_J start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = 6.

Fig. 8 shows the amplitudes of the injected brightness profile and spirals. The amplitudes of the injected spirals were at most 30similar-toabsent30\sim 30∼ 30 per cent of that of the axisymmetric part of the surface brightness, and they were comparable to the amplitudes of the observed spirals with m=2𝑚2m=2italic_m = 2 mode (e.g., WaOph6 and IM Lup) such that the current simulation indeed considered the realistic situation. The left panels in Fig. 9 show the injected images for the three cases.

Refer to caption
Figure 8: Amplitudes of injected brightness profile and spirals in simulations in Sec 5.

5.1.2 Comparison of injected and recovered models

After simulating the visibilities, we applied our method to estimate the axisymmetric structures. We selected the MAP estimate on geometric parameters, drew a brightness profile, and subtracted their contributions from the observations. The residual visibilities were then used to construct the residual images using CLEAN.

For the comparison, we also created the residual images by adopting geometric parameters estimated from the Gaussian fitting to the visibilities. Specifically, we assumed geometry with Gaussian fitting and hyperparameters from our method, and we drew a brightness profile. Subsequently, the brightness model was subtracted from the visibilities, and the residual visibilities were imaged with CLEAN.

The residual images are shown in Fig. 9. In the second column, we show the recovered residual images obtained via the current method. The last column shows the residual images created from the geometry given by the Gaussian fitting. The residual images from our method were reasonably consistent with the injected maps, whereas those based on the Gaussian fitting exhibited larger residuals. However, the innermost structures of the discs at r<0.20.3𝑟0.20.3r<0.2-0.3\arcsecitalic_r < 0.2 - 0.3 ″ were not perfectly recovered even with our method. Specifically, the excess in the amplitude of the spiral was observed at r<0.3𝑟0.3r<0.3\arcsecitalic_r < 0.3 ″ for the odd-symmetric spiral. Moreover, no clear spiral structure was observed at r<0.2𝑟0.2r<0.2\arcsecitalic_r < 0.2 ″ for the even-symmetric spiral. These inconsistencies arise from the biases in the estimated geometric parameters, as discussed next.

The comparison of geometric parameters for three cases is shown in Fig. 7. For every parameter, larger deviations from the injected parameters were observed in the case of the Gaussian modelling than that using the proposed method. However, our method was still affected with biases; (Δxcen,Δycen)Δsubscript𝑥cenΔsubscript𝑦cen(\Delta x_{\rm cen},\Delta y_{\rm cen})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) for the odd-symmetric spiral, and (PA,cosi)PA𝑖({\rm PA},\cos i)( roman_PA , roman_cos italic_i ) for the even-symmetric spiral. In the case of the combined spiral, the shifts in all the geometric parameters roughly corresponded to the summation of shifts in the odd and even-symmetric spirals. These shifts are reasonable because, as discussed in Sec 4, the shifts in (Δxcen,Δycen)Δsubscript𝑥cenΔsubscript𝑦cen(\Delta x_{\rm cen},\Delta y_{\rm cen})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) introduce the residuals with m=1𝑚1m=1italic_m = 1 mode, which can be degenerate with the injected odd-symmetric spiral. Similarly, the shifts in (cosi,PA)𝑖PA(\cos i,{\rm PA})( roman_cos italic_i , roman_PA ) introduce the residuals with m=2𝑚2m=2italic_m = 2 mode, which can be degenerate with the even-symmetric spiral. These degeneracies produce the biases in estimating geometric parameters. Consequently, these biases impede recovery of injected non-axisymmetric structures, as shown in Fig. 9.

The left panels in Fig. 10 compare the residual image with the injected odd-symmetric spiral. The phase for the spiral was reasonably recovered in the residual image. The upper right and lower panels show the recovered radial phases ϕ1(r)subscriptitalic-ϕ1𝑟\phi_{1}(r)italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r ) and the amplitudes I1(r)subscript𝐼1𝑟I_{1}(r)italic_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r ). The amplitudes and phases were reasonably recovered in the range of 0.3<r<0.50.3𝑟0.50.3\arcsec<r<0.5\arcsec0.3 ″ < italic_r < 0.5 ″, where the spiral amplitude was large. However, the amplitudes were clearly overestimated at inner radii r<0.3𝑟0.3r<0.3\arcsecitalic_r < 0.3 ″. This overestimation was due to the biased central position, and is consistent with the failure of the recovery of the spiral at the innermost part in Fig. 9.

The right panels in Fig. 10 show the comparison in the case of the even-symmetric spiral. The phases were reasonably recovered in most of the radial range; however, the deviations were large at the innermost part r<0.1𝑟0.1r<0.1\arcsecitalic_r < 0.1 ″. The amplitudes were well recovered at 0.3<r<0.60.3𝑟0.60.3\arcsec<r<0.6\arcsec0.3 ″ < italic_r < 0.6 ″, whereas they were underestimated in the inner regions. The deviations in the phases and amplitudes originated from the biases in the estimation of (cosi𝑖\cos iroman_cos italic_i, PA), which are associated with m=2𝑚2m=2italic_m = 2 feature.

Finally, Fig. 11 shows the residual images obtained with either real or imaginary parts of images. As evident, the imaging with only the real part successfully extracted the even-symmetric spiral, whereas that with only the imaginary part extracted the odd-symmetric spiral. This is consistent with the discussion in Sec 3.3. Notably, we successfully extracted odd- and even-symmetric spirals for the case of the combined image. Thus, in a realistic situation, where odd- and even-symmetric components are mixed, the imaging with either the real or imaginary part is indeed useful for separating them. In addition, the noise level decreased by a factor of 2similar-toabsent2\sim\sqrt{2}∼ square-root start_ARG 2 end_ARG, or the signal-to-noise ratio increased by a factor of 2similar-toabsent2\sim\sqrt{2}∼ square-root start_ARG 2 end_ARG because of the assumed symmetry I(x,y)=I(x,y)𝐼𝑥𝑦𝐼𝑥𝑦I(x,y)=I(-x,-y)italic_I ( italic_x , italic_y ) = italic_I ( - italic_x , - italic_y ) or I(x,y)=I(x,y)𝐼𝑥𝑦𝐼𝑥𝑦I(x,y)=-I(-x,-y)italic_I ( italic_x , italic_y ) = - italic_I ( - italic_x , - italic_y ). Thus, this method would be helpful in the search for faint signals.

Refer to caption
Refer to caption
Refer to caption
Figure 9: Injected and recovered residual images for spirals in deprojected frames; (top rows) Even-symmetric spiral (middle rows) Odd-symmetric spiral (bottom rows) Combined spirals. (left columns) Injected perturbations (middle columns) Recovered residual images based on the proposed method (right columns) Recovered residual images based on the geometry obtained with Gaussian fitting. The synthesized beam size for the deprojected image (0.11,0.041)0.110.041(0.11\arcsec,0.041\arcsec)( 0.11 ″ , 0.041 ″ ) is shown at the bottom left, and the beam size for the image before deprojection is (0.091,0.037)0.0910.037(0.091\arcsec,0.037\arcsec)( 0.091 ″ , 0.037 ″ ).
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 10: Phases and amplitudes for injected and recovered odd- (left) and even- (right) symmetric spirals in Sec 5. (upper panels) The residual images in deprojected frame overplotted with the locations of injected (black) and recovered phases (blue). (middle panels) Input and recovered phases in the radial direction. (lower panels) Input and recovered amplitudes of spirals.
Refer to caption
Refer to caption
Refer to caption
Figure 11: Demonstration of extraction of odd- and even-symmetric spirals with real and imaginary parts of the visibilities; (top rows) Residual images for even-symmetric spiral in deprojected frames (middle rows) Residual images for odd-symmetric spiral (bottom rows) Residual images for combined spirals (left columns) Injected spirals (middle column) Image produced with only the real part of the data (right column) Image produced with only the imaginary part of the data.

5.2 Axisymmetric structure +++ circumplanetary disc emission

We injected a point source to the simulated data assuming circumplanetary disc emission and tested the recovering ability. The flux for the point source was assumed to be 0.10.10.10.1 mJy, which is approximately 10 times more significant than the noise level but 10 times lower than the ambient emission at the inner disc. The planetary location was set to be at 0.5 \arcsec in the deprojected frame (x′′,y′′)=(0.5,0)superscript𝑥′′superscript𝑦′′0.50(x^{\prime\prime},y^{\prime\prime})=(0.5\arcsec,0)( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) = ( 0.5 ″ , 0 ), such that the source resided in a large gap of a disc. We considered the same brightness profile and geometric configuration as the simulated case for AS209 in Sec 4 and computed the visibilities and added noises to the data assuming the same observational setup.

Using our method, we similarly recovered the brightness profile and geometric parameters from the simulated data. With the MAP estimate of geometric parameters, we randomly drew one brightness profile and subtracted the model from the simulated visibilities. Fig. 12 shows the residual images, which are generated from the residual visibilities. The position of the injected point source was reasonably recovered, and the flux density was well recovered as well.

Moreover, the central position of the disc was slightly biased, suggesting that the point source might introduce another bias. To check this possibility, we injected the brighter sources with fluxes of 1 and 2 mJy, and attempted to estimate the central positions of the discs by repeating the same analyses. Resultantly, the central positions were estimated to be (Δxcen,Δycen)=(0.580.07+0.07mas,0.500.07+0.07mas)Δsubscript𝑥cenΔsubscript𝑦censubscriptsuperscript0.580.070.07massubscriptsuperscript0.500.070.07mas(\Delta x_{\rm cen},\Delta y_{\rm cen})=(0.58^{+0.07}_{-0.07}{\rm mas},-0.50^{% +0.07}_{-0.07}{\rm mas})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) = ( 0.58 start_POSTSUPERSCRIPT + 0.07 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.07 end_POSTSUBSCRIPT roman_mas , - 0.50 start_POSTSUPERSCRIPT + 0.07 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.07 end_POSTSUBSCRIPT roman_mas ) for the 1 mJy source and (Δxcen,Δycen)=(0.970.07+0.07mas,0.820.07+0.07mas)Δsubscript𝑥cenΔsubscript𝑦censubscriptsuperscript0.970.070.07massubscriptsuperscript0.820.070.07mas(\Delta x_{\rm cen},\Delta y_{\rm cen})=(0.97^{+0.07}_{-0.07}{\rm mas},-0.82^{% +0.07}_{-0.07}{\rm mas})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ) = ( 0.97 start_POSTSUPERSCRIPT + 0.07 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.07 end_POSTSUBSCRIPT roman_mas , - 0.82 start_POSTSUPERSCRIPT + 0.07 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.07 end_POSTSUBSCRIPT roman_mas ) for the 2 mJy source. However, the positions of the flux centre with respect to the disc centre were calculated as (0.9mas,0.9mas)0.9mas0.9mas(0.9{\rm mas},-0.9{\rm mas})( 0.9 roman_mas , - 0.9 roman_mas ) and (1.8mas,1.8mas)1.8mas1.8mas(1.8{\rm mas},-1.8{\rm mas})( 1.8 roman_mas , - 1.8 roman_mas ) for the 1 and 2 mJy sources, respectively, and they were comparable to the biases in the estimation. Thus, even with our method, the central positions of the disc can be biased by approximately half of the positional difference between the disc and flux centres. This holds true not only for the point source but also for localized emission; for example, a crescent-shaped emission. In the data analysis, in case of bright localized emission, it is recommended that such an additional emission be modelled or removed because it affects the residual image creation process. Specifically, in the case of the point source, we can directly include the point source model in the axisymmetric model. However, if the emission is more complicated, we can model them on the image plane and remove their visibilities from the observations, as presented in Andrews et al. (2021).

Refer to caption
Refer to caption
Refer to caption
Figure 12: Residual images for the simulated data that include a simulated circumplanetary disc emission, whose position is indicated by a red dashed circle. The central and right panels show the maps in an observed and deprojected frames, respectively. The synthesized beam sizes before and after deprojection are (0.076″, 0.040″) and (0.089″, 0.045″) are shown at the bottom left, respectively.

6 Application to real data: Elias 20 and AS 209

We applied the proposed method to the real DSHARP data of two discs, Elias 20 and AS 209, to demonstrate its feasibility. AS 209 is one of most structured discs in the DSHARP sample, and Elias 20 shows a non-axisymmetric feature in the residual map as shown later. More systematic studies on the DSHARP discs will be presented in future studies.

We downloaded the measurement sets of the DSHARP data for the two discs, and applied time and spectral averaging in the same manner as that of Sec 4.1. We manually reduced the recorded weights by 3.50 and 3.44 for Elias 20 and AS 209, respectively, to match the observations. The data were then binned with a log grid with Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500, and they were analyzed with the current method. After sampling the posterior distribution for the parameters, we created the residual images following the method presented in Sec 3.1.1. To create residual images, we used two different geometries: one is obtained from our method, and the other obtained from Huang et al. (2018a), who estimated the geometric parameters through ellipse fitting of annular substructures on image planes.

Table 1 presents the derived geometries for Elias 20 and AS 209. For a reference, we also show the geometric parameters from Huang et al. (2018a). The length scale parameters γ𝛾\gammaitalic_γ for Elias 20 and AS 209 are comparable to their beam sizes (0.076,0.040)0.0760.040(0.076\arcsec,0.040\arcsec)( 0.076 ″ , 0.040 ″ ) and (0.048,0.028)0.0480.028(0.048\arcsec,0.028\arcsec)( 0.048 ″ , 0.028 ″ ), respectively. This result is consistent with the discussion in Sec 4.1 and Appendix D; γ𝛾\gammaitalic_γ was determined by the combination of the intrinsic brightness profile and the UV-coverage.

The derived geometries from our method were mostly consistent with the previous estimates from Huang et al. (2018a), although there are slight or moderate discrepancies. The central positional estimates for AS209 are offset by approximately 1111 mas for both directions. Similarly, in the case of Elias 20, the positions also show a difference of 1-2 mas, and notably, there is a 0.07 difference in cosi𝑖\cos iroman_cos italic_i.

These small discrepancies are important for creating the residual images. Figs. 13 and 14 show the brightness profiles, visibilities, and residual images. For comparison, we also show the brightness profiles and model visibilities obtained by Jennings et al. (2022a), who systematically analysed the DSHARP data. Here, they used frank (Jennings et al., 2020), which reconstructs the radial brightness profile by fitting the real part of the deprojected visibilities. Note that Jennings et al. (2022a) adopted the non-negativity condition on the brightness profile in case of AS 209, and assumed additional point-source emission with flux of 0.66 mJy at the disc centre to suppress the artificial oscillating features for Elias 20 (detailed discussion on point-source correction is presented in Appendix A in their paper). Their model visibilities, as shown in Fig. 13, for Elias 20 are thus converged to 0.66 mJy, which corresponds to the flux of this point-source emission.

The brightness profiles and model visibilities of the proposed method and that of frank were mostly consistent; however, there existed slight or moderate differences.

In the case of Elias 20, the model visibilities are horizontably offset, and this is due to the difference in the cosi𝑖\cos iroman_cos italic_i. In addition, the models using the proposed method gradually converged to zero at high spatial frequencies, whereas those from frank sharply converged to the flux of the point source around 5555 Mλ𝜆\lambdaitalic_λ. The locations of the tipping points for the convergence were determined by γ𝛾\gammaitalic_γ in our method or hyperparameters in frank, and the differences in the way of convergence were due to the different choices for the regularization.

The notable difference was observed in the peak brightness values near r=0𝑟0r=0\arcsecitalic_r = 0 ″.The peak brightness is generally hard to estimate accurately because of the small flux at the small radii. This is mainly due to the point-source correction with 0.66 mJy adopted in Jennings et al. (2022a), which was however not included in their brightness profile. The innermost brightness I(r1)𝐼subscript𝑟1I(r_{1})italic_I ( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) in our model corresponding to the flux 0.66 mJy is 6×1010similar-toabsent6superscript1010\sim 6\times 10^{10}∼ 6 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPTJy sr-1, which can largely explain the observed difference of about 10×101010superscript101010\times 10^{10}10 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT Jy sr-1 between this study and Jennings et al. (2022a). Another potential explanation could be the difference in the strength of the regularization for smoothing, although identifying the superior model remains challenging at this stage.

In the case of AS 209, the brightness profiles and model visibilities were mostly consistent for our method and that of frank. The peak brightness values, on the other hand, showed the difference of about 2.5×10102.5superscript10102.5\times 10^{10}2.5 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT Jy sr-1, which is yet smaller than that of Elias 20. The discrepancy potentially arises from the difference in regularization strengths or the non-negative condition adopted in Jennings et al. (2020). Overall, our method exhibited the same capability at recovering the brightness profiles as that of frank.

The estimation of the residual images was also improved using our updated geometry. In case of Elias 20, the residual image derived with the geometry from Huang et al. (2018a) exhibits the m=2𝑚2m=2italic_m = 2 pattern, which is mainly attributed to the inclination offset, as shown in Figs. 4 and 5. This coherent pattern, also seen in Jennings et al. (2020), mostly disappears in our update image, suggesting that our method suppresses the artificial pattern owing to wrong geometric parameters. In case of AS 209, the previous literature identified significant residual patterns (Guzmán et al., 2018; Jennings et al., 2020). The residual image with the updated geometry is less noisy than the previous estimate; however, structured patterns were still present. We thus conclude that the residual pattern for AS 209 cannot be removed by solely optimizing the geometric parameters for a geometrically-thin axisymmetric model.

There can be multiple possible explanations for the residual pattern in AS 209. It may be due to the real non-axisymmetry in the physical parameters, or because of the geometric effect. The individual rings may have different geometric parameters, that is, misalignment or positional offsets, where the latter case was reported for HL Tau (ALMA Partnership et al., 2015). Further, the vertical structure may also render the residual pattern complicated, although the observed residual image is not perfectly consistent with this hypothesis, which predicts the symmetric pattern with respect to the x𝑥xitalic_x axis (minor axis). To unveil the origin of the residual pattern for AS 209, we require a more complicated model with multiple geometric parameters or vertical structures; however this is beyond the scope of this paper.

Table 1: Hyperparameters for Gaussian Process and geometric parameters of Elias 20, AS 209, and PDS 70. Geometric parameters assumed in previous studies are also shown for comparison.
Name γ𝛾\gammaitalic_γ ["] log10αsubscript10𝛼\log_{10}\alpharoman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_α ΔxcenΔsubscript𝑥cen\Delta x_{\rm cen}roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT [mas] ΔycenΔsubscript𝑦cen\Delta y_{\rm cen}roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT [mas] cosi𝑖\cos iroman_cos italic_i PA [deg]
Elias 20 (this study) 0.0380.001+0.001subscriptsuperscript0.0380.0010.0010.038^{+0.001}_{-0.001}0.038 start_POSTSUPERSCRIPT + 0.001 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.001 end_POSTSUBSCRIPT 0.8000.150+0.161subscriptsuperscript0.8000.1610.1500.800^{+0.161}_{-0.150}0.800 start_POSTSUPERSCRIPT + 0.161 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.150 end_POSTSUBSCRIPT 53.9190.065+0.064subscriptsuperscript53.9190.0640.065-53.919^{+0.064}_{-0.065}- 53.919 start_POSTSUPERSCRIPT + 0.064 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.065 end_POSTSUBSCRIPT 488.8490.073+0.071subscriptsuperscript488.8490.0710.073-488.849^{+0.071}_{-0.073}- 488.849 start_POSTSUPERSCRIPT + 0.071 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.073 end_POSTSUBSCRIPT 0.5840.001+0.001subscriptsuperscript0.5840.0010.0010.584^{+0.001}_{-0.001}0.584 start_POSTSUPERSCRIPT + 0.001 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.001 end_POSTSUBSCRIPT 153.9300.060+0.060subscriptsuperscript153.9300.0600.060153.930^{+0.060}_{-0.060}153.930 start_POSTSUPERSCRIPT + 0.060 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.060 end_POSTSUBSCRIPT
Elias 20 (Huang et al. (2018a)) -54.5 -491.0 0.656 153.2
AS 209 (this study) 0.0280.001+0.001subscriptsuperscript0.0280.0010.0010.028^{+0.001}_{-0.001}0.028 start_POSTSUPERSCRIPT + 0.001 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.001 end_POSTSUBSCRIPT 1.2020.087+0.092subscriptsuperscript1.2020.0920.087-1.202^{+0.092}_{-0.087}- 1.202 start_POSTSUPERSCRIPT + 0.092 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.087 end_POSTSUBSCRIPT 0.7670.066+0.068subscriptsuperscript0.7670.0680.0660.767^{+0.068}_{-0.066}0.767 start_POSTSUPERSCRIPT + 0.068 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.066 end_POSTSUBSCRIPT 1.2250.050+0.048subscriptsuperscript1.2250.0480.050-1.225^{+0.048}_{-0.050}- 1.225 start_POSTSUPERSCRIPT + 0.048 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.050 end_POSTSUBSCRIPT 0.821>0.001<0.001subscriptsuperscript0.821absent0.001absent0.0010.821^{<0.001}_{>-0.001}0.821 start_POSTSUPERSCRIPT < 0.001 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT > - 0.001 end_POSTSUBSCRIPT 85.7660.038+0.036subscriptsuperscript85.7660.0360.03885.766^{+0.036}_{-0.038}85.766 start_POSTSUPERSCRIPT + 0.036 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.038 end_POSTSUBSCRIPT
AS 209 (Huang et al. (2018a)) 1.9 -2.5 0.819 85.8
PDS 70 (this study) 0.0640.003+0.003subscriptsuperscript0.0640.0030.0030.064^{+0.003}_{-0.003}0.064 start_POSTSUPERSCRIPT + 0.003 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.003 end_POSTSUBSCRIPT 2.3960.110+0.112subscriptsuperscript2.3960.1120.110-2.396^{+0.112}_{-0.110}- 2.396 start_POSTSUPERSCRIPT + 0.112 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.110 end_POSTSUBSCRIPT 10.8270.078+0.079subscriptsuperscript10.8270.0790.07810.827^{+0.079}_{-0.078}10.827 start_POSTSUPERSCRIPT + 0.079 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.078 end_POSTSUBSCRIPT 14.9530.105+0.101subscriptsuperscript14.9530.1010.10514.953^{+0.101}_{-0.105}14.953 start_POSTSUPERSCRIPT + 0.101 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.105 end_POSTSUBSCRIPT 0.64270.0002+0.0002subscriptsuperscript0.64270.00020.00020.6427^{+0.0002}_{-0.0002}0.6427 start_POSTSUPERSCRIPT + 0.0002 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.0002 end_POSTSUBSCRIPT 160.0030.022+0.020subscriptsuperscript160.0030.0200.022160.003^{+0.020}_{-0.022}160.003 start_POSTSUPERSCRIPT + 0.020 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.022 end_POSTSUBSCRIPT
PDS 70 Benisty et al. (2021) 12 15 0.6494 161
Refer to caption
Refer to caption
Refer to caption
Figure 13: Analysis of DSHARP data of Elias 20. (upper-left panels) Brightness profiles recovered by this study and Jennings et al. (2022a). The profiles are shown in linear and log scales in the upper and lower panels, respectively. (upper-right panels) Model and observed real visibilities and deprojected spatial frequencies. The model by Jennings et al. (2022a) is also shown for comparison. (lower panels) Residual image made with the geometry from our method in the observational frame, that in the deprojected frame, and the image made with that of Huang et al. (2018a) in the deprojected frame. In the residual images, the reference lines at r=0.5𝑟0.5r=0.5\arcsecitalic_r = 0.5 ″ are also shown. As treference, the ellipse and circles are shown. The synthesized beam sizes before and after deprojection are (0.048,0.028)0.0480.028(0.048\arcsec,0.028\arcsec)( 0.048 ″ , 0.028 ″ ) and (0.079,0.029)0.0790.029(0.079\arcsec,0.029\arcsec)( 0.079 ″ , 0.029 ″ ) are shown at the bottom left, respectively.
Refer to caption
Refer to caption
Refer to caption
Figure 14: Analysis of DSHARP data of AS 209. The format of the figure is the same as that of Fig. 13. The reference lines corresponding to gaps and rings are shown in brightness profiles and the residual images. The synthesized beam sizes before and after deprojection are (0.076,0.040)0.0760.040(0.076\arcsec,0.040\arcsec)( 0.076 ″ , 0.040 ″ ) and (0.076,0.049)0.0760.049(0.076\arcsec,0.049\arcsec)( 0.076 ″ , 0.049 ″ ), respectively.

7 Application to real data: PDS 70

7.1 Analysis with axisymmetric disc model

As a practical application to recovering a circumplanetary emission, we applied our method to the data of PDS 70 in Benisty et al. (2021). The same data were already analyzed by frank in Benisty et al. (2021). We used the combined dataset, including long, medium, and short baseline data, for continuum emission in Band 7 as used in Benisty et al. (2021). The data were then averaged in a similar manner to Sec 4 to reduce the data size.

In the continuum emission, there is a notable crescent feature in the North-West direction.
citebenisty2021 removed the asymmetric feature by following the method described in Andrews et al. (2021). Specifically, using the CLEAN model image, they defined the asymmetry model by isolating the emission of the crescent feature, and subtracted the mean radial profile outside the area from the model, leaving only the asymmetric contribution. The constructed model was then Fourier transformed, and the model visibilities are subtracted from the observed visibilities, which were analyzed using frank. As demonstrated in Andrews et al. (2021), the method is effective for extracting strong asymmetric features that hinder the detection of weak signals, such as emission from CPD (see Appendix B of their paper for details).

In this paper, we applied our method to the original data without any such prior subtraction, aiming to minimize the manual adjustment. A major concern with this simpler approach was that the localized crescent feature might bias the estimates of the geometric parameters, especially the position estimate (Δxcen,Δycen)Δsubscript𝑥cenΔsubscript𝑦cen(\Delta x_{\rm cen},\Delta y_{\rm cen})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT ), similar to that in Sec. 5.2. However, as will be shown later, the derived parameters agree well with those estimated by Benisty et al. (2021), with slight differences, such as 1mas1mas1\;{\rm mas}1 roman_mas in ΔxcenΔsubscript𝑥cen\Delta x_{\rm cen}roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT and 1deg1deg1\;{\rm deg}1 roman_deg in PA. The effect of the positional difference in the residual image is negligible in the current analysis (see Fig. 23 in Appendix F). Therefore, we simply present the result of the analysis with our method, without implementing the prior subtraction of the asymmetric features.

We employed a logarithmically spaced grid with Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500 and (xmin,xmax)=(102λ,1.1×107λ)subscript𝑥minsubscript𝑥maxsuperscript102𝜆1.1superscript107𝜆(x_{\rm min},x_{\rm max})=(10^{2}\lambda,1.1\times 10^{7}\lambda)( italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) = ( 10 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_λ , 1.1 × 10 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_λ ) to bin the data. We drew samples from the posterior of parameters using emcee with 16 walkers and 10,000 samples, and we ensured the convergence of MCMC. Table 1 lists the parameters from this study and Benisty et al. (2021) (specifically, Appendix B in their paper). The two studies yielded consistent values, while there are slight differences: approximately 1111 mas in ΔxΔ𝑥\Delta xroman_Δ italic_x, 0.0070.0070.0070.007 in cosi𝑖\cos iroman_cos italic_i, and 1superscript11^{\circ}1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT in PA.

We constructed a visibility model using the MAP solution of parameters, and subtracted it from the observed visibilities. Subsequently, the residuals were processed with CLEAN, with a threshold of nsigma=3.5absent3.5=3.5= 3.5. Note that we did not adopt JvM correction adopted in the previous literature (Czekala et al., 2021; Benisty et al., 2021; Balsalobre-Ruza et al., 2023) to prevent potential exaggeration of signal-to-noise ratios in an image (Casassus & Cárcamo, 2022). In the imaging process, we experimented with various robust parameters, specifically setting them to be 0.0, 0.5, 1.0, and 1.5. We found that a setting of 1.0 offers a small standard deviation image while maintaining relatively high angular resolution. For comparison, we also generated a residual image using the geometric parameters in Benisty et al. (2021) while adopting the hyperparameters in the MAP solution from our modeling.

Figure 15 illustrates the brightness profiles and visibilities derived from our method. For the brightness profiles, we show 30 random samples, indicated by the light orange lines. The brightness profile was characterized by the presence of the inner disc as well as the outer disc with two local maxima, consistently with previous studies (Keppler et al., 2019; Benisty et al., 2021). At the outer disc, there is another shoulder at 0.850.850.85\arcsec0.85 ″, as also reported in Benisty et al. (2021). A slight positive excess was also observed in the cavity region, between the inner and outer discs. The flux would arise from emission around PDS 70 b&c and their Lagrange points (Benisty et al., 2021; Balsalobre-Ruza et al., 2023).

Figure 15 shows the residual images for PDS 70; the left figure in the lower panels shows the residual map based on our study in the observational frame, the middle one is shown in the deprojected frame, and the right one is based on the geometry in Benisty et al. (2021). The residual images from both geometries are consistent, but there is a slight discrepancy due to difference in geometry. We investigated which parameter made the large difference, and found that the difference in PA largely accounts for the discrepancy (also see Appendix F).

The upper panels in Figure 16 shows the residual images in deprojected coordinate and the polar coordinate. Our methods successfully recovered the circumplanetary emission around PDS 70 c, in addition to the crescent feature. The emission of the circumplanetary disc remains unresolved, consistently with the analysis in Benisty et al. (2021). We measured the brightness of the circumplanetary emission around PDS 70 c. The estimated peak brightness of the emission were 95.2±17.7plus-or-minus95.217.795.2\pm 17.795.2 ± 17.7 μ𝜇\muitalic_μJy beam-1, 89.8±13.1plus-or-minus89.813.189.8\pm 13.189.8 ± 13.1 μ𝜇\muitalic_μJy beam-1, and 91.2±11.9plus-or-minus91.211.991.2\pm 11.991.2 ± 11.9 μ𝜇\muitalic_μJye beam-1 for robust=0,0.5,1.0robust00.51.0{\rm robust}=0,0.5,1.0roman_robust = 0 , 0.5 , 1.0, respectively. Our estimates for robust=0robust0\rm{robust}=0roman_robust = 0 and 0.50.50.50.5 were consistent with the result from Benisty et al. (2021), which reported 86±10plus-or-minus861086\pm 1086 ± 10 μ𝜇\muitalic_μJy beam-1 and 79±7plus-or-minus79779\pm 779 ± 7 μ𝜇\muitalic_μJy beam-1. It should be noted that in our study, the minor/major beam sizes exhibit a slight deviation from those reported in Benisty et al. (2021) (see Table 4) with discrepancies ranging between 0.01 and 0.03\arcsec. This minor variation is likely due to the time and spectral averaging processes applied to our measurement sets, but the discrepancies give negligible impacts on the measurements of intensities for PDS 70 c. On the other hand, our estimate for robust=1.0robust1.0{\rm robust}=1.0roman_robust = 1.0 significantly differed from 170±4plus-or-minus1704170\pm 4170 ± 4 μ𝜇\muitalic_μJy beam-1 reported in Benisty et al. (2021). The discrepancy likely arises from the contamination from the main disc emission, which is however mostly mitigated by our method.

An extended emission around PDS 70 b was reported in Benisty et al. (2021). Our analyses similarly identified a positive excess around it, and the enclosed flux for each robust parameter was around 50 μ𝜇\muitalic_μJy, comparable to 38 μ𝜇\muitalic_μJy reported by Benisty et al. (2021), although our measured flux was influenced by the cumulative noise within the region. Additionally, there was a claim on the tentative co-orbital emission near PDS 70 b’s L5 Lagrangian point (Balsalobre-Ruza et al., 2023). We found the signal to be visually insignificant in the images. Indeed, the peak emission in this region showed significance of 2.0-2.7σ𝜎\sigmaitalic_σ for robust=(0,0.5,1)robust00.51\rm{robust}=(0,0.5,1)roman_robust = ( 0 , 0.5 , 1 ), slightly lower than the significance around 3.3-3.4σ𝜎\sigmaitalic_σ reported in Balsalobre-Ruza et al. (2023) for the robust parameters that we considered. The slight discrepancy might be due to the subtraction of the annular brightness in our study. On the other hand, Balsalobre-Ruza et al. (2023) observed higher significance at 5-6σ𝜎\sigmaitalic_σ if they employed robust1.5robust1.5{\rm robust}\geq 1.5roman_robust ≥ 1.5 and JvM correction, which were not investigated in this study. The further observations and analyses of these marginal signals will be needed.

Refer to caption
Refer to caption
Refer to caption
Figure 15: Analysis for PDS 70. The format of the figure is the same as that of Fig. 13 except that the zoom-up view of the radial profile is shown in the linear scale. The reference lines are shown at r=(0.4,0.6,0.8)𝑟0.40.60.8r=(0.4\arcsec,0.6\arcsec,0.8\arcsec)italic_r = ( 0.4 ″ , 0.6 ″ , 0.8 ″ ) in the panels of the brightness profile and the residual maps. The synthesized beam sizes before and after deprojection are (0.063(0.063\arcsec( 0.063 ″ , 0.0530.0530.053\arcsec0.053 ″) and (0.098(0.098\arcsec( 0.098 ″ , 0.053)0.053\arcsec)0.053 ″ ), respectively.

7.2 Additional analysis on residual images by subtracting crescent model

Apart from the crescent feature and circumplanetary emission, residual features still persist in the image. Benisty et al. (2021) also identified the residual features in the image after subtracting the axisymmetric component and asymmetric feature including the crescent feature (rightmost end of Fig. 8 in their paper), although the detailed discussion was not presented. Here, we revisited the residual feature in more detail using our updated residual image.

For the further investigation, the high contrast of the bright crescent feature hinders the search of faint features in the images. In addition, the bright crescent unnecessarily introduces the negative flux, because the residual image is supposed to have zero flux when averaged over the polar direction. To avoid the latter problem, Benisty et al. (2021) removed the asymmetric feature in the image-based analysis before the visibility analysis. To mitigate these effects, we instead attempted to subtract the bright crescent feature by employing a provisional analytical model. Specifically, we considered a model comprising of the super Gaussian function in the radial direction and the von Mises distribution in the polar direction as follows:

Icrescent(r,ϕ)subscript𝐼crescent𝑟italic-ϕ\displaystyle I_{\rm crescent}(r,\phi)italic_I start_POSTSUBSCRIPT roman_crescent end_POSTSUBSCRIPT ( italic_r , italic_ϕ ) =Icrescent,ampexp(((rr0)22σr2)α)absentsubscript𝐼crescentampsuperscriptsuperscript𝑟subscript𝑟022superscriptsubscript𝜎𝑟2𝛼\displaystyle=I_{{\rm crescent,amp}}\exp\left(-\left(\frac{(r-r_{0})^{2}}{2% \sigma_{r}^{2}}\right)^{\alpha}\right)= italic_I start_POSTSUBSCRIPT roman_crescent , roman_amp end_POSTSUBSCRIPT roman_exp ( - ( divide start_ARG ( italic_r - italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT )
×(exp(κcos(ϕμ))2πIn=0,modfiedbessel(β)12π),absent𝜅italic-ϕ𝜇2𝜋subscript𝐼𝑛0modfiedbessel𝛽12𝜋\displaystyle\times\left(\frac{\exp(\kappa\cos(\phi-\mu))}{2\pi I_{n=0,{\rm modfied% \;bessel}}(\beta)}-\frac{1}{2\pi}\right),× ( divide start_ARG roman_exp ( italic_κ roman_cos ( italic_ϕ - italic_μ ) ) end_ARG start_ARG 2 italic_π italic_I start_POSTSUBSCRIPT italic_n = 0 , roman_modfied roman_bessel end_POSTSUBSCRIPT ( italic_β ) end_ARG - divide start_ARG 1 end_ARG start_ARG 2 italic_π end_ARG ) , (78)

where In,modfiedbesselsubscript𝐼𝑛modfiedbesselI_{n,{\rm modfied\;bessel}}italic_I start_POSTSUBSCRIPT italic_n , roman_modfied roman_bessel end_POSTSUBSCRIPT is the modified Bessel function of the first kind, Icrescent,ampsubscript𝐼crescentampI_{{\rm crescent,amp}}italic_I start_POSTSUBSCRIPT roman_crescent , roman_amp end_POSTSUBSCRIPT determines the amplitude of the crescent model, α𝛼\alphaitalic_α is the exponent for the super Gaussiaan function, r0subscript𝑟0r_{0}italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denotes the radial position for the crescent peak, σrsubscript𝜎𝑟\sigma_{r}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT indicates the radial width, κ𝜅\kappaitalic_κ specifies the azimuthal concentration, and μ𝜇\muitalic_μ determines the azimuthal location for the peak. The integral of the equation (78) along ϕitalic-ϕ\phiitalic_ϕ is zero, consistent with the construction of the residual image. The adopted model does not perfectly explain the crescent feature, but it is still useful for removing the bright component of the crescent. We fitted the model to the residual image in Sec 7.1, and the optimization was performed using curve_fit in scipy, yielding (Icrescent,amp,α,r0,σr,κ,μ)=(0.339mJy/beam,0.793,0.623,0.0723,5.88,0.960rad)subscript𝐼crescentamp𝛼subscript𝑟0subscript𝜎𝑟𝜅𝜇0.339mJybeam0.7930.6230.07235.880.960rad(I_{{\rm crescent,amp}},\alpha,r_{0},\sigma_{r},\kappa,\mu)=(0.339\;{\rm mJy/% beam},0.793,0.623\arcsec,0.0723\arcsec,5.88,-0.960\;{\rm rad})( italic_I start_POSTSUBSCRIPT roman_crescent , roman_amp end_POSTSUBSCRIPT , italic_α , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_κ , italic_μ ) = ( 0.339 roman_mJy / roman_beam , 0.793 , 0.623 ″ , 0.0723 ″ , 5.88 , - 0.960 roman_rad ).

The lower panels in Figure 16 shows the residual images with the crescent models being subtracted in deprojected coordinate and the polar coordinate. We confirmed the presence of emission from PDS 70 c, while no significant detection for point-source emission was observed from PDS 70 b and its L5 point.

The subtraction process facilitated the identification of faint features in the residual images. We here briefly summarize their characteristics in the following:

  • (a)

    Crescent is trailing : As evident in the residual images with and without the subtraction in a polar coordinate, the observed crescent feature appears to exhibit a trailing pattern rather than circular shape. Here the disc rotation is clockwise, as we deduced from the velocity gradient map and the emission surface of the gas (Keppler et al., 2019). One interesting possibility is that a planetary gravity perturbs the crescent, resulting in the trailing shape.

  • (b)

    Extended emission from crescent to PDS 70 c?: We also observed that the flux excess at the crescent appears to extend toward the vicinity of PDS 70 c. One interesting possibility is that this feature implies a dust accretion flow to PDS 70 c from the outer disc.

  • (c)

    Faint arm stemming from inner end of crescent: A faint leading arm is observed to stem from the inner end of the crescent. This feature is also faintly discernible in the residual images without the subtraction.

  • (d)

    Arm/Vortex-like structure around 0.750.750.75\arcsec0.75 ″ stemming from crescent: Another arc or vortex stemming from the crescent was observed, and it is radially concentrated around r=0.75𝑟0.75r=0.75\arcsecitalic_r = 0.75 ″. This feature is more pronounced in our updated residual image than the residual image based on the geometry from Benisty et al. (2021), due to the difference in the adopted geometries, especially in PA (also see Appendix F). The base of the arm is overlapped with the crescent model in the lower panels, potentially resulting in over-subtraction of the flux of the arm near the region.

  • (e)

    Depletion near crescent: We observed a strong negative feature near the crescent feature. It is visible on both residual maps, with and without subtraction of crescent models. This might be attributed to the counter effect of the dust concentration at the crescent.

  • (f)

    Blobs in crescent: We observed blobs inside the crescent, suggesting that the crescent is not unimodal but rather multimodal. There are at least two visible blobs in the crescent feature, located at (x′′,y′′)=(0.53,0.24)superscript𝑥′′superscript𝑦′′0.530.24(x^{\prime\prime},y^{\prime\prime})=(0.53\arcsec,-0.24\arcsec)( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) = ( 0.53 ″ , - 0.24 ″ ) and (x′′,y′′)=(0.58,0.02)superscript𝑥′′superscript𝑦′′0.580.02(x^{\prime\prime},y^{\prime\prime})=(0.58\arcsec,0.02\arcsec)( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) = ( 0.58 ″ , 0.02 ″ ) in the deprojected frames. The blobs exhibited an excess of 3-5σ𝜎\sigmaitalic_σ with respect to the background level of the the crescent feature.

All of the features appear to be associated with the crescent feature. In Appendix F, we presented the difference in the residual images derived with geometries from our study and Benisty et al. (2021). Apart from the possible arm (d), all of the features were similarly identified in both of the residual images, reinforcing the coherence of the signals within the current data set.

Substructures in the PDS 70 disc have also been reported in different bands. The arm-like structure was found in the northwest direction in the near-infrared bands, near the crescent observed in the radio band (Müller et al., 2018; Juillard et al., 2022; Christiaens et al., 2024). Juillard et al. (2022) investigated the motion of the arm over six years. However, they did not find the anticipated rotation expected if the arm were comoving with PDS 70 c. This suggests that the arm might be a vortex rather than a spiral. Indeed, as demonstrated in Fig 4 of Marr & Dong (2022), such a circular arc/vortex may be misunderstood as one-armed spiral in the near-infrared band, where the disc thickness becomes significant. On the other hand, the crescent observed in the radio band in this study displays a trailing pattern, which is unlikely to result from a purely geometric effect. One alternative explanation for the arm might be the presence of an undetected companion in the outer disc. Christiaens et al. (2024) set an upper limit on the mass of such a potential planet, obtaining limits 1-4 Jupiter masses from the JWST observation in the near-infrared band.

Additionally, Christiaens et al. (2024) identified a spiral accretion stream to the vicinity of PDS 70 c in the near infrared band by the JWST observation. It might be related to the extended emission excess near PDS 70 in the radio band (feature b in this study), but the connection between the features is not clear at this point.

The present findings rely on the assumption that the disc structure is well approximated by the thin axisymmetric disc with a single set of geometric parameters. This underlying assumption may be inadequate if the inner and outer discs are misaligned, or the disc thickness is substantial. In addition, there is still room for improvement for the modeling of the crescent feature, and the model might be better to be incorporated into the framework of the axisymmetric modelling. A comprehensive analysis of these points is essential for the robust detection the features, and we leave this issues for the future study.

Refer to caption
Figure 16: The residual images for PDS 70 in the deprojected coordinate (left panels) and in the polar coordinate (right panels). The upper and lower panels show the residual images without and with the subtraction of the crescent models, as discussed in Sec 7.2. Contours of the crescent models are also overlaid as black lines with levels of (50,100,200)50100200(50,100,200)( 50 , 100 , 200 ) μ𝜇\muitalic_μJy beam-1. Annotations in the upper panels show the locations of the protoplanets and the L5 point of PDS 70 b, as provided by Balsalobre-Ruza et al. (2023). Annotations in the lower panels specify the possible structures identified in this system, and the further explanations for the features (a)-(f) are provided in Sec 7.2.

8 Summary

This study proposed a scheme for estimating the geometry, hyperparameters, central position, and brightness profiles assuming a geometrically thin disc in radio interferometry. Our approach is less susceptible to human biases due to manual tuning of parameters in contrast to frank, where the non-linear parameters need to be determined a priori.

Simulating observations for an axisymmetric disc, we demonstrated that the proposed method can successfully retrieve geometric parameters in a more precise manner than that using Gaussian fitting. Additionally, we performed injection and recovery tests for the non-axisymmetric perturbations to the simulated data, and showed that the proposed method can reasonably recover the injected structures. However, the estimated geometric parameters were slightly shifted from the assumed values. This is attributed to the degeneracy between the non-axisymmetric perturbations and the residuals caused by biases in the geometric parameters.

The model was then applied to the real data for Elias 20 and AS 209, and the ability of the method to determine the disc geometry and brightness profile was demonstrated especially for Elias 20. Moreover, the data for the continuum emission of the PDS 70 were reanalyzed with our method. Our methods successfully identified the circumplanetary emission from PDS 70 c and the crescent feature. Furthermore, we tentatively identified several new features, including trailing nature of crescent, extended emission near PDS 70 c, arm-like structures, dust depletion near crescent, and blobs. The origins of these features are unclear, and a further modeling is needed.

The current methodology can be applied to any type of continuum data in radio interferometric observations, and future studies will analyse the DSHARP data (Andrews et al., 2018) to explore their non-axisymmetric structures.

One of the notable strengths of this research is its capacity to adapt to more complex problems involving a multitude of geometric or hyper parameters, which are difficult to adjust manually. Below, we present the potential directions for extending our study:

  • The model can be extended to cover multiple rings/gaps with different central positions, inclinations, or position angles as observed in GW Ori (Bi et al., 2020). This can be realized by considering multiple sets of geometric parameters, each of which is applied to one of the separate ranges.

  • The current model can be extended to incorporate frequency-dependent radial profiles for multi-frequency data. We can straightforwardly expand our formulation to include the frequency dependence in a linearized formulation by Taylor expansions in a manner similar to multi-frequency CLEAN (Rau & Cornwell, 2011):

    I(r;ν)=t(νν0ν0)tIt,ν0(r),𝐼𝑟𝜈subscript𝑡superscript𝜈subscript𝜈0subscript𝜈0𝑡subscript𝐼𝑡subscript𝜈0𝑟\displaystyle I(r;\nu)=\sum_{t}\left(\frac{\nu-\nu_{0}}{\nu_{0}}\right)^{t}I_{% t,\nu_{0}}(r),italic_I ( italic_r ; italic_ν ) = ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( divide start_ARG italic_ν - italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_t , italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_r ) , (79)

    where I(r;ν)𝐼𝑟𝜈I(r;\nu)italic_I ( italic_r ; italic_ν ) is the radial brightness at the frequency ν𝜈\nuitalic_ν, ν0subscript𝜈0\nu_{0}italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the reference frequency, and It,ν0(r)subscript𝐼𝑡subscript𝜈0𝑟I_{t,\nu_{0}}(r)italic_I start_POSTSUBSCRIPT italic_t , italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_r ) is the coefficient of t𝑡titalic_t-th Taylor expansions. With this expression, we can analytically derive the model parameters in the same manner as that presented in the current study.

  • The current study focuses on the axisymmetric component I(r)𝐼𝑟I(r)italic_I ( italic_r ); however, we may be able to include non-axisymmetric components (Im(r),ϕm(r))subscript𝐼𝑚𝑟subscriptitalic-ϕ𝑚𝑟(I_{m}(r),\phi_{m}(r))( italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) , italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_r ) ) in the model, and directly solve them. Such modelling efforts might ease the degeneracy between the geometric parameters and non-axisymmetric structures.

  • Although we created residual images with CLEAN, we can also use other imaging techniques, for example, sparse modelling, to produce images alternatively (Honma et al., 2014; Nakazato & Ikeda, 2020; Yamaguchi et al., 2020; Aizawa et al., 2020). In particular, sparse modelling favours solutions with many zeros, and it can achieve better angular resolutions; thus it will be helpful for resolving or identifying the new substructures, including spirals and circumplanetary emission.

  • A more comprehensive analyse for PDS 70 will be essential for the reliable detection of the discovered structures. Incorporating the crescent model into our model will allow us to separate the residual signal accurately. Exploring more complicated models that consider a disc thickness or misalignment between inner and outer disc planes will be rewarding. The multi-wavelength data will be also useful for assessing the consistency of the signals across different wavelengths.

These will be considered in future studies.

Acknowledgments

We thank Kazuhiro Kanagawa, Hongping Deng, and Ruobing Dong for their insightful discussions. We also thank the anonymous referee for constructive comments, which significantly improved the quality of the paper. Furthermore, we would like to extend our gratitude to Myriam Benisty for sharing the data for PDS 70 used in this research.

This work was supported by JSPS KAKENHI grant numbers 17H01103, 18H05441, and 23K03463. M.A. is supported by Special Postdoctoral Researcher Program at RIKEN. T.M. is supported by Yamada Science Foundation Overseas Research Support Program.

Data analysis was in part carried out on the Multi-wavelength Data Analysis System operated by the Astronomy Data Center (ADC), National Astronomical Observatory of Japan. This research is based on the following ALMA data #2013.1.00226.S, #2015.1.00486.S, and #2016.1.00484.L. ALMA is a partnership of ESO (representing its member states), NSF (USA) and NINS (Japan), together with NRC (Canada), MOST and ASIAA (Taiwan), and KASI (Republic of Korea), in cooperation with the Republic of Chile. The Joint ALMA Observatory is operated by ESO, AUI/NRAO and NAOJ.

Software: Astropy (Astropy Collaboration et al., 2013), CASA (McMullin et al., 2007), corner (Foreman-Mackey, 2016), emcee (Foreman-Mackey et al., 2013), Jupyter Notebook (Kluyver et al., 2016), Matplotlib (Hunter, 2007), NumPy (van der Walt et al., 2011), Pandas (McKinney, 2010), PyNUFFT (Lin, 2018; Lin & Chung, 2017), SciPy (Virtanen et al., 2020).

Data Availability

The DSHARP data used in this article are available in the DSHARP Data Release at https://bulk.cv.nrao.edu/almadata/lp/DSHARP. The data underlying this article will be shared on reasonable request to the corresponding author.

References

  • ALMA Partnership et al. (2015) ALMA Partnership et al., 2015, ApJ, 808, L3
  • Aizawa et al. (2020) Aizawa M., Suto Y., Oya Y., Ikeda S., Nakazato T., 2020, ApJ, 899, 55
  • Andrews et al. (2018) Andrews S. M., et al., 2018, ApJ, 869, L41
  • Andrews et al. (2021) Andrews S. M., et al., 2021, ApJ, 916, 51
  • Astropy Collaboration et al. (2013) Astropy Collaboration et al., 2013, A&A, 558, A33
  • Balsalobre-Ruza et al. (2023) Balsalobre-Ruza O., et al., 2023, A&A, 675, A172
  • Benisty et al. (2021) Benisty M., et al., 2021, ApJ, 916, L2
  • Bi et al. (2020) Bi J., et al., 2020, ApJ, 895, L18
  • Binney & Tremaine (2008) Binney J., Tremaine S., 2008, Galactic Dynamics: Second Edition
  • Casassus & Cárcamo (2022) Casassus S., Cárcamo M., 2022, MNRAS, 513, 5790
  • Christiaens et al. (2024) Christiaens V., et al., 2024, arXiv e-prints, p. arXiv:2403.04855
  • Currie et al. (2022) Currie T., et al., 2022, Nature Astronomy, 6, 751
  • Czekala et al. (2021) Czekala I., et al., 2021, ApJS, 257, 2
  • Dong et al. (2016) Dong R., Zhu Z., Fung J., Rafikov R., Chiang E., Wagner K., 2016, ApJ, 816, L12
  • Dong et al. (2018) Dong R., Li S., Chiang E., Li H., 2018, ApJ, 866, 110
  • Flock et al. (2015) Flock M., Ruge J. P., Dzyurkevich N., Henning T., Klahr H., Wolf S., 2015, A&A, 574, A68
  • Foreman-Mackey (2016) Foreman-Mackey D., 2016, The Journal of Open Source Software, 1, 24
  • Foreman-Mackey et al. (2013) Foreman-Mackey D., Hogg D. W., Lang D., Goodman J., 2013, PASP, 125, 306
  • Goldreich & Tremaine (1980) Goldreich P., Tremaine S., 1980, ApJ, 241, 425
  • Grady et al. (2013) Grady C. A., et al., 2013, ApJ, 762, 48
  • Guzmán et al. (2018) Guzmán V. V., et al., 2018, ApJ, 869, L48
  • Hammond et al. (2023) Hammond I., Christiaens V., Price D. J., Toci C., Pinte C., Juillard S., Garg H., 2023, MNRAS, 522, L51
  • Hashimoto et al. (2021) Hashimoto J., Muto T., Dong R., Liu H. B., van der Marel N., Francis L., Hasegawa Y., Tsukagoshi T., 2021, ApJ, 911, 5
  • Honma et al. (2014) Honma M., Akiyama K., Uemura M., Ikeda S., 2014, PASJ, 66, 95
  • Huang et al. (2018a) Huang J., et al., 2018a, ApJ, 869, L42
  • Huang et al. (2018b) Huang J., et al., 2018b, ApJ, 869, L43
  • Hunter (2007) Hunter J. D., 2007, Computing in Science and Engineering, 9, 90
  • Jennings et al. (2020) Jennings J., Booth R. A., Tazzari M., Rosotti G. P., Clarke C. J., 2020, MNRAS, 495, 3209
  • Jennings et al. (2022a) Jennings J., Booth R. A., Tazzari M., Clarke C. J., Rosotti G. P., 2022a, MNRAS, 509, 2780
  • Jennings et al. (2022b) Jennings J., Tazzari M., Clarke C. J., Booth R. A., Rosotti G. P., 2022b, MNRAS, 514, 6053
  • Juillard et al. (2022) Juillard S., Christiaens V., Absil O., 2022, A&A, 668, A125
  • Kanagawa et al. (2021) Kanagawa K. D., et al., 2021, ApJ, 909, 212
  • Kawahara & Masuda (2020) Kawahara H., Masuda K., 2020, ApJ, 900, 48
  • Kawahara et al. (2022) Kawahara H., Kawashima Y., Masuda K., Crossfield I. J. M., Pannier E., van den Bekerom D., 2022, ApJS, 258, 31
  • Keppler et al. (2018) Keppler M., et al., 2018, A&A, 617, A44
  • Keppler et al. (2019) Keppler M., et al., 2019, A&A, 625, A118
  • Kluyver et al. (2016) Kluyver T., et al., 2016, in , IOS Press. pp 87–90, doi:10.3233/978-1-61499-649-1-87
  • Lin (2018) Lin J.-M., 2018, Journal of Imaging, 4
  • Lin & Chung (2017) Lin J.-M., Chung H.-W., 2017, Building Bridges in Medical Sciences
  • Lin & Papaloizou (1979) Lin D. N. C., Papaloizou J., 1979, MNRAS, 186, 799
  • Marr & Dong (2022) Marr M., Dong R., 2022, ApJ, 930, 80
  • McKinney (2010) McKinney W., 2010, in van der Walt S., Millman J., eds, Proceedings of the 9th Python in Science Conference. pp 51 – 56
  • McMullin et al. (2007) McMullin J. P., Waters B., Schiebel D., Young W., Golap K., 2007, in Shaw R. A., Hill F., Bell D. J., eds, Astronomical Society of the Pacific Conference Series Vol. 376, Astronomical Data Analysis Software and Systems XVI. p. 127
  • Müller et al. (2018) Müller A., et al., 2018, A&A, 617, L2
  • Muto et al. (2012) Muto T., et al., 2012, ApJ, 748, L22
  • Nakazato & Ikeda (2020) Nakazato T., Ikeda S., 2020, PRIISM: Python module for Radio Interferometry Imaging with Sparse Modeling, Astrophysics Source Code Library, record ascl:2006.002 (ascl:2006.002)
  • Okuzumi et al. (2016) Okuzumi S., Momose M., Sirono S.-i., Kobayashi H., Tanaka H., 2016, ApJ, 821, 82
  • Pinte et al. (2018) Pinte C., et al., 2018, ApJ, 860, L13
  • Pinte et al. (2020) Pinte C., et al., 2020, ApJ, 890, L9
  • Pinte et al. (2023) Pinte C., et al., 2023, arXiv e-prints, p. arXiv:2301.08759
  • Rau & Cornwell (2011) Rau U., Cornwell T. J., 2011, A&A, 532, A71
  • Reggiani et al. (2014) Reggiani M., et al., 2014, ApJ, 792, L23
  • Speedie & Dong (2022) Speedie J., Dong R., 2022, arXiv e-prints, p. arXiv:2211.05757
  • Takahashi & Inutsuka (2014) Takahashi S. Z., Inutsuka S.-i., 2014, ApJ, 794, 55
  • Tazzari et al. (2017) Tazzari M., et al., 2017, A&A, 606, A88
  • Thompson et al. (2017) Thompson A. R., Moran J. M., Swenson George W. J., 2017, Interferometry and Synthesis in Radio Astronomy, 3rd Edition, doi:10.1007/978-3-319-44431-4.
  • Virtanen et al. (2020) Virtanen P., et al., 2020, Nature Methods, 17, 261
  • Wagner et al. (2015) Wagner K., Apai D., Kasper M., Robberto M., 2015, ApJ, 813, L2
  • Wagner et al. (2018) Wagner K., et al., 2018, ApJ, 863, L8
  • Wagner et al. (2023) Wagner K., et al., 2023, Nature Astronomy, 7, 1208
  • Yamaguchi et al. (2020) Yamaguchi M., et al., 2020, ApJ, 895, 84
  • Zhang et al. (2015) Zhang K., Blake G. A., Bergin E. A., 2015, ApJ, 806, L7
  • van der Walt et al. (2011) van der Walt S., Colbert S. C., Varoquaux G., 2011, Computing in Science and Engineering, 13, 22

Appendix A Efficient computation of evidence given geometry

The direct computation of 𝒩(𝒅|0,𝚺¯d+H¯𝚺¯aH¯T)𝒩conditional𝒅0subscript¯𝚺𝑑¯𝐻subscript¯𝚺𝑎superscript¯𝐻𝑇\mathcal{N}(\bm{d}|0,\bar{\bm{\Sigma}}_{d}+\bar{H}\bar{\bm{\Sigma}}_{a}\bar{H}% ^{T})caligraphic_N ( bold_italic_d | 0 , over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) in equation (58) requires the inverse matrix of 𝚺¯d+H¯𝚺¯a𝑯¯Tsubscript¯𝚺𝑑¯𝐻subscript¯𝚺𝑎superscript¯𝑯𝑇\bar{\bm{\Sigma}}_{d}+\bar{H}\bar{\bm{\Sigma}}_{a}\bar{\bm{H}}^{T}over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, whose computational cost can be 𝒪(M3)𝒪superscript𝑀3\mathcal{O}(M^{3})caligraphic_O ( italic_M start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) with M𝑀Mitalic_M the number of data points. As 𝒪(M3)𝒪superscript𝑀3\mathcal{O}(M^{3})caligraphic_O ( italic_M start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) can be very large for the usual case in interferometric observations M>1056𝑀superscript1056M>10^{5-6}italic_M > 10 start_POSTSUPERSCRIPT 5 - 6 end_POSTSUPERSCRIPT, we aimed at reducing this computation through the formula transformation.

Using the Woodbury identity (A+UWV)1=A1A1U(W1+VA1U)1VA1superscript𝐴𝑈𝑊𝑉1superscript𝐴1superscript𝐴1𝑈superscriptsuperscript𝑊1𝑉superscript𝐴1𝑈1𝑉superscript𝐴1(A+UWV)^{-1}=A^{-1}-A^{-1}U(W^{-1}+VA^{-1}U)^{-1}VA^{-1}( italic_A + italic_U italic_W italic_V ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_U ( italic_W start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_V italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_U ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_V italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, we obtain

(𝚺¯d+𝑯¯𝚺¯a𝑯¯T)1=𝚺¯d1𝚺¯d1𝑯¯𝚺¯a|d𝑯¯T𝚺¯d1.superscriptsubscript¯𝚺𝑑¯𝑯subscript¯𝚺𝑎superscript¯𝑯𝑇1superscriptsubscript¯𝚺𝑑1superscriptsubscript¯𝚺𝑑1¯𝑯subscript¯𝚺conditional𝑎𝑑superscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1\displaystyle(\bar{\bm{\Sigma}}_{d}+\bar{\bm{H}}\bar{\bm{\Sigma}}_{a}\bar{\bm{% H}}^{T})^{-1}=\bar{\bm{\Sigma}}_{d}^{-1}-\bar{\bm{\Sigma}}_{d}^{-1}\bar{\bm{H}% }\bar{\bm{\Sigma}}_{a|d}\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}.( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (80)

With this equation, we can compute the log probability for p(𝒅|𝒈,𝜽)𝑝conditional𝒅𝒈𝜽p(\bm{d}|\bm{g},\bm{\theta})italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) as follows:

2log(p(𝒅|𝒈,𝜽))2𝑝conditional𝒅𝒈𝜽\displaystyle-2\log(p(\bm{d}|\bm{g},\bm{\theta}))- 2 roman_log ( italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) ) =\displaystyle== log|det((𝚺¯d+𝑯¯𝚺¯a𝑯¯T))|+𝒅T𝚺¯d1𝒅detsubscript¯𝚺𝑑¯𝑯subscript¯𝚺𝑎superscript¯𝑯𝑇superscript𝒅𝑇superscriptsubscript¯𝚺𝑑1𝒅\displaystyle\log|{\rm det}((\bar{\bm{\Sigma}}_{d}+\bar{\bm{H}}\bar{\bm{\Sigma% }}_{a}\bar{\bm{H}}^{T}))|+\bm{d}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bm{d}roman_log | roman_det ( ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) ) | + bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_d (81)
\displaystyle-- 𝒅T𝚺¯d1𝑯¯𝚺¯a|d𝑯¯T𝚺¯d1𝒅.superscript𝒅𝑇superscriptsubscript¯𝚺𝑑1¯𝑯subscript¯𝚺conditional𝑎𝑑superscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1𝒅\displaystyle\bm{d}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bar{\bm{H}}\bar{\bm{\Sigma}}% _{a|d}\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bm{d}.bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_d .

Using another identity det(A+UWV)=det(W1+VA1U)det(W)det(A)det𝐴𝑈𝑊𝑉detsuperscript𝑊1𝑉superscript𝐴1𝑈det𝑊det𝐴{\rm det}(A+UWV)={\rm det}(W^{-1}+VA^{-1}U){\rm det}(W){\rm det}(A)roman_det ( italic_A + italic_U italic_W italic_V ) = roman_det ( italic_W start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_V italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_U ) roman_det ( italic_W ) roman_det ( italic_A ), we can compute the first term as follows:

log|det((𝚺¯d+𝑯¯𝚺¯a𝑯¯T))|=log|det(𝚺¯a|d1)|detsubscript¯𝚺𝑑¯𝑯subscript¯𝚺𝑎superscript¯𝑯𝑇detsuperscriptsubscript¯𝚺conditional𝑎𝑑1\displaystyle\log|{\rm det}((\bar{\bm{\Sigma}}_{d}+\bar{\bm{H}}\bar{\bm{\Sigma% }}_{a}\bar{\bm{H}}^{T}))|=\log|{\rm det}(\bar{\bm{\Sigma}}_{a|d}^{-1})|roman_log | roman_det ( ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) ) | = roman_log | roman_det ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) | (82)
+\displaystyle++ log|det(𝚺¯d)|+log|det(𝚺¯a)|.detsubscript¯𝚺𝑑detsubscript¯𝚺𝑎\displaystyle\log|{\rm det}(\bar{\bm{\Sigma}}_{d})|+\log|{\rm det}(\bar{\bm{% \Sigma}}_{a})|.roman_log | roman_det ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) | + roman_log | roman_det ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) | .

Consequently, we find the following expression:

2log(p(𝒅|𝒈,𝜽))2𝑝conditional𝒅𝒈𝜽\displaystyle-2\log(p(\bm{d}|\bm{g},\bm{\theta}))- 2 roman_log ( italic_p ( bold_italic_d | bold_italic_g , bold_italic_θ ) ) =\displaystyle== log|det(𝚺¯a|d1)|+log|det(𝚺¯a)|detsuperscriptsubscript¯𝚺conditional𝑎𝑑1detsubscript¯𝚺𝑎\displaystyle\log|{\rm det}(\bar{\bm{\Sigma}}_{a|d}^{-1})|+\log|{\rm det}(\bar% {\bm{\Sigma}}_{a})|roman_log | roman_det ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) | + roman_log | roman_det ( over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) | (83)
\displaystyle-- 𝒅T𝚺¯d1𝑯¯𝚺¯a|d𝑯¯T𝚺¯d1𝒅+c,superscript𝒅𝑇superscriptsubscript¯𝚺𝑑1¯𝑯subscript¯𝚺conditional𝑎𝑑superscript¯𝑯𝑇superscriptsubscript¯𝚺𝑑1𝒅𝑐\displaystyle\bm{d}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bar{\bm{H}}\bar{\bm{\Sigma}}% _{a|d}\bar{\bm{H}}^{T}\bar{\bm{\Sigma}}_{d}^{-1}\bm{d}+c,bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG bold_italic_H end_ARG over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_a | italic_d end_POSTSUBSCRIPT over¯ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over¯ start_ARG bold_Σ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_d + italic_c ,

where c𝑐citalic_c corresponds to the terms independent on (𝒈,𝜽)𝒈𝜽(\bm{g},\bm{\theta})( bold_italic_g , bold_italic_θ ). In the case of MNmuch-greater-than𝑀𝑁M\gg Nitalic_M ≫ italic_N, this incurs a computational cost that is much lesser than that using equation (58).

Appendix B Revisiting visibility binning with uniform and log gridding

The computational cost strongly depends on the number of visibilities. Here, we discuss the data binning in an interferometric observation assuming linear and log grids.

B.1 Formulation

We consider the data 𝒅obssubscript𝒅obs\bm{d}_{\rm obs}bold_italic_d start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT with the data weights 𝒘obssubscript𝒘obs\bm{w}_{\rm obs}bold_italic_w start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT. We binned them on a grid specified by bin edges {(Ei,Ej)}subscript𝐸isubscript𝐸j\{(E_{\rm i},E_{\rm j})\}{ ( italic_E start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT , italic_E start_POSTSUBSCRIPT roman_j end_POSTSUBSCRIPT ) } with i=(Nbin+1),Nbin,,Nbin,Nbin+1𝑖subscript𝑁bin1subscript𝑁binsubscript𝑁binsubscript𝑁bin1i=-(N_{\rm bin}+1),-N_{\rm bin},...,N_{\rm bin},N_{\rm bin}+1italic_i = - ( italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1 ) , - italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT , … , italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT , italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1, wherein Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT determines the number of grids. For a cell with Ei1<uk<Eisubscript𝐸𝑖1subscript𝑢𝑘subscript𝐸𝑖E_{i-1}<u_{k}<E_{i}italic_E start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Ej1<vk<Ejsubscript𝐸𝑗1subscript𝑣𝑘subscript𝐸𝑗E_{j-1}<v_{k}<E_{j}italic_E start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < italic_E start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, we define the summing operation for a vector 𝒙𝒙\bm{x}bold_italic_x with the same length as 𝒅𝒅\bm{d}bold_italic_d as follows:

Bin(𝒙,i,j)=Ei1<uk<EiEj1<vk<Ejxk,Bin𝒙𝑖𝑗subscriptFRACOPsubscript𝐸𝑖1subscript𝑢𝑘subscript𝐸𝑖subscript𝐸𝑗1subscript𝑣𝑘subscript𝐸𝑗subscript𝑥𝑘\displaystyle{\rm Bin}(\bm{x},i,j)=\sum_{E_{i-1}<u_{k}<E_{i}\atop E_{j-1}<v_{k% }<E_{j}}x_{k},roman_Bin ( bold_italic_x , italic_i , italic_j ) = ∑ start_POSTSUBSCRIPT FRACOP start_ARG italic_E start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_E start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < italic_E start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (84)

where Eisubscript𝐸𝑖E_{i}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes either Euniform,isubscript𝐸uniform𝑖E_{{\rm uniform},i}italic_E start_POSTSUBSCRIPT roman_uniform , italic_i end_POSTSUBSCRIPT or Elog,isubscript𝐸log𝑖E_{{\rm log},i}italic_E start_POSTSUBSCRIPT roman_log , italic_i end_POSTSUBSCRIPT. Bin edges for a uniform gridding are defined as follows:

Euniform,i={xmin+(i1)(xmaxxminNbin)i=1,2,,Nbin+10i=0xmin+(i+1)(xmaxxminNbin)i=1,2,,(Nbin+1),subscript𝐸uniform𝑖casessubscript𝑥min𝑖1subscript𝑥maxsubscript𝑥minsubscript𝑁bin𝑖12subscript𝑁bin10𝑖0subscript𝑥min𝑖1subscript𝑥maxsubscript𝑥minsubscript𝑁bin𝑖12subscript𝑁bin1\displaystyle E_{{\rm uniform},i}=\left\{\begin{array}[]{cc}x_{\rm min}+(i-1)% \left(\frac{x_{\rm max}-x_{\rm min}}{N_{\rm bin}}\right)&i=1,2,...,N_{\rm bin}% +1\\ 0&i=0\\ -x_{\rm min}+(i+1)\left(\frac{x_{\rm max}-x_{\rm min}}{N_{\rm bin}}\right)&i=-% 1,-2,...,-(N_{\rm bin}+1),\\ \end{array}\right.italic_E start_POSTSUBSCRIPT roman_uniform , italic_i end_POSTSUBSCRIPT = { start_ARRAY start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT + ( italic_i - 1 ) ( divide start_ARG italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT end_ARG ) end_CELL start_CELL italic_i = 1 , 2 , … , italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_i = 0 end_CELL end_ROW start_ROW start_CELL - italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT + ( italic_i + 1 ) ( divide start_ARG italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT end_ARG ) end_CELL start_CELL italic_i = - 1 , - 2 , … , - ( italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1 ) , end_CELL end_ROW end_ARRAY (88)

where (xminsubscript𝑥minx_{\rm min}italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, xmaxsubscript𝑥maxx_{\rm max}italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT) are values that determine the limits of the grid.

Similarly, we define bin edges for a log grid as follows:

Elog,i={xmin×10(i1)Δlogi=1,2,,Nbin+10i=0xmin×10(i+1)Δlogi=1,2,,(Nbin+1)subscript𝐸log𝑖casessubscript𝑥minsuperscript10𝑖1subscriptΔlog𝑖12subscript𝑁bin10𝑖0subscript𝑥minsuperscript10𝑖1subscriptΔlog𝑖12subscript𝑁bin1\displaystyle E_{{\rm log},i}=\left\{\begin{array}[]{cc}x_{\rm min}\times 10^{% (i-1)\Delta_{\rm log}}&i=1,2,...,N_{\rm bin}+1\\ 0&i=0\\ -x_{\rm min}\times 10^{-(i+1)\Delta_{\rm log}}&i=-1,-2,...,-(N_{\rm bin}+1)% \end{array}\right.italic_E start_POSTSUBSCRIPT roman_log , italic_i end_POSTSUBSCRIPT = { start_ARRAY start_ROW start_CELL italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT ( italic_i - 1 ) roman_Δ start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL italic_i = 1 , 2 , … , italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_i = 0 end_CELL end_ROW start_ROW start_CELL - italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT - ( italic_i + 1 ) roman_Δ start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL italic_i = - 1 , - 2 , … , - ( italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT + 1 ) end_CELL end_ROW end_ARRAY (93)

where we define a spacing ΔlogsubscriptΔlog\Delta_{\rm log}roman_Δ start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT for the log grid as follows:

Δlog=log10(xmax)log10(xmin)Nbin.subscriptΔlogsubscript10subscript𝑥maxsubscript10subscript𝑥minsubscript𝑁bin\displaystyle\Delta_{\rm log}=\frac{\log_{10}(x_{\rm max})-\log_{10}(x_{\rm min% })}{N_{\rm bin}}.roman_Δ start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT = divide start_ARG roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) - roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ) end_ARG start_ARG italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT end_ARG . (94)

In each cell, we computed a weighted average 𝒅binsubscript𝒅bin\bm{d}_{{\rm bin}}bold_italic_d start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT, a total sum of weights 𝒘binsubscript𝒘bin\bm{w}_{{\rm bin}}bold_italic_w start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT, standard deviations for the noise σbin,i,jsubscript𝜎bin𝑖𝑗\sigma_{{\rm bin},i,j}italic_σ start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT, and weighted average of spatial frequencies (𝒖bin,𝒗bin)subscript𝒖binsubscript𝒗bin(\bm{u}_{{\rm bin}},\bm{v}_{{\rm bin}})( bold_italic_u start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT , bold_italic_v start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT ) as follows:

wbin,i,jsubscript𝑤bin𝑖𝑗\displaystyle w_{{\rm bin},i,j}italic_w start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT =\displaystyle== Bin(𝒘,i,j)Bin𝒘𝑖𝑗\displaystyle{\rm Bin}(\bm{w},i,j)roman_Bin ( bold_italic_w , italic_i , italic_j ) (95)
σbin,i,jsubscript𝜎bin𝑖𝑗\displaystyle\sigma_{{\rm bin},i,j}italic_σ start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT =\displaystyle== 1wbin,i,j1subscript𝑤bin𝑖𝑗\displaystyle\frac{1}{\sqrt{w_{{\rm bin},i,j}}}divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_w start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT end_ARG end_ARG (96)
dbin,i,jsubscript𝑑bin𝑖𝑗\displaystyle d_{{\rm bin},i,j}italic_d start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT =\displaystyle== Bin(𝒅𝒘,i,j)wbin,i,jBin𝒅𝒘𝑖𝑗subscript𝑤bin𝑖𝑗\displaystyle\frac{{\rm Bin}(\bm{d}\circ\bm{w},i,j)}{w_{{\rm bin},i,j}}divide start_ARG roman_Bin ( bold_italic_d ∘ bold_italic_w , italic_i , italic_j ) end_ARG start_ARG italic_w start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT end_ARG (97)
ubin,i,jsubscript𝑢bin𝑖𝑗\displaystyle u_{{\rm bin},i,j}italic_u start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT =\displaystyle== Bin(𝒖𝒘,i,j)wbin,i,jBin𝒖𝒘𝑖𝑗subscript𝑤bin𝑖𝑗\displaystyle\frac{{\rm Bin}(\bm{u}\circ\bm{w},i,j)}{w_{{\rm bin},i,j}}divide start_ARG roman_Bin ( bold_italic_u ∘ bold_italic_w , italic_i , italic_j ) end_ARG start_ARG italic_w start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT end_ARG (98)
vbin,i,jsubscript𝑣bin𝑖𝑗\displaystyle v_{{\rm bin},i,j}italic_v start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT =\displaystyle== Bin(𝒗𝒘,i,j)wbin,i,j,Bin𝒗𝒘𝑖𝑗subscript𝑤bin𝑖𝑗\displaystyle\frac{{\rm Bin}(\bm{v}\circ\bm{w},i,j)}{w_{{\rm bin},i,j}},divide start_ARG roman_Bin ( bold_italic_v ∘ bold_italic_w , italic_i , italic_j ) end_ARG start_ARG italic_w start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT end_ARG , (99)

where \circ denotes the Hadamard product for vectors 𝒙𝒙\bm{x}bold_italic_x and 𝒚𝒚\bm{y}bold_italic_y: (𝒙𝒚)i=xiyisubscript𝒙𝒚𝑖subscript𝑥𝑖subscript𝑦𝑖(\bm{x}\circ\bm{y})_{i}=x_{i}y_{i}( bold_italic_x ∘ bold_italic_y ) start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

B.2 Quantifying binning errors with uniform and log grids

We quantified binning errors by simulating an observation of a proto-planetary disc. The simulation setup was same as simulated case of AS 209 in Sec 4.

The data were binned with Nbin=(125,250,500,1000,2000,4000)subscript𝑁bin125250500100020004000N_{\rm bin}=(125,250,500,1000,2000,4000)italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = ( 125 , 250 , 500 , 1000 , 2000 , 4000 ), xmin=102subscript𝑥minsuperscript102x_{\rm min}=10^{2}italic_x start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT λ𝜆\lambdaitalic_λ, and xmax=107subscript𝑥maxsuperscript107x_{\rm max}=10^{7}italic_x start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT λ𝜆\lambdaitalic_λ. At the binned spatial frequencies (𝒖bin,𝒗bin)subscript𝒖binsubscript𝒗bin(\bm{u}_{\rm bin},\bm{v}_{\rm bin})( bold_italic_u start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT , bold_italic_v start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT ), we computed model visibilities 𝒅bin,modelsubscript𝒅binmodel\bm{d}_{\rm bin,model}bold_italic_d start_POSTSUBSCRIPT roman_bin , roman_model end_POSTSUBSCRIPT, and quantified a binning error relative to an observational error as follows:

χbin,i,j=dbin,i,jdmodel,i,jσbin,i,j,subscript𝜒bin𝑖𝑗subscript𝑑bin𝑖𝑗subscript𝑑model𝑖𝑗subscript𝜎bin𝑖𝑗\displaystyle\chi_{{\rm bin},i,j}=\frac{d_{{\rm bin},i,j}-d_{{\rm model},i,j}}% {\sigma_{{\rm bin},i,j}},italic_χ start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT = divide start_ARG italic_d start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT - italic_d start_POSTSUBSCRIPT roman_model , italic_i , italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT end_ARG , (100)

which is the ratio of the binning error with respect to the noise amplitude. Additionally, we also computed the mean squared binning errors χbinerror,mean2superscriptsubscript𝜒binerrormean2\chi_{{\rm binerror,mean}}^{2}italic_χ start_POSTSUBSCRIPT roman_binerror , roman_mean end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT:

χbin,mean2=1Nd,binχbin,i,j2,superscriptsubscript𝜒binmean21subscript𝑁dbinsuperscriptsubscript𝜒bin𝑖𝑗2\displaystyle\chi_{{\rm bin,mean}}^{2}=\frac{1}{N_{\rm d,bin}}\sum\chi_{{\rm bin% },i,j}^{2},italic_χ start_POSTSUBSCRIPT roman_bin , roman_mean end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_N start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT end_ARG ∑ italic_χ start_POSTSUBSCRIPT roman_bin , italic_i , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (101)

where Md,binsubscript𝑀dbinM_{\rm d,bin}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT is the number of data after binning.

We computed the binning errors for the linear and log grids by adopting Nbin=(125,250,500,1000,2000,4000)subscript𝑁bin125250500100020004000N_{\rm bin}=(125,250,500,1000,2000,4000)italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = ( 125 , 250 , 500 , 1000 , 2000 , 4000 ). The left panel in Fig. 17 shows the relation between the mean squared binning errors χbin,mean2superscriptsubscript𝜒binmean2\chi_{\rm bin,mean}^{2}italic_χ start_POSTSUBSCRIPT roman_bin , roman_mean end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and the number of binned data Md,binsubscript𝑀dbinM_{\rm d,bin}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT, which determines the computational time. As Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT increases, the binning errors became small, and Md,binsubscript𝑀dbinM_{\rm d,bin}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT became large as expected, thereby increasing the computational cost. On average, uniform and log grids yielded similar binning errors with Md,binsubscript𝑀dbinM_{\rm d,bin}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT being fixed as shown in Fig. 17.

The right panel in Fig. 17 shows the binning errors |χbin|subscript𝜒bin|\chi_{\rm bin}|| italic_χ start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT | at deprojected spatial frequencies q𝑞qitalic_q. To obtain a similar number of data points Md,bin105similar-to-or-equalssubscript𝑀dbinsuperscript105M_{\rm d,bin}\simeq 10^{5}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT ≃ 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT, we assumed Nbin=2000subscript𝑁bin2000N_{\rm bin}=2000italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 2000 for the uniform grid and Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500 for the log grid (see left panel in Fig. 17). The binning errors in the uniform grid increased at low spatial frequencies, and they reduced at higher frequencies. With Nbin=2000subscript𝑁bin2000N_{\rm bin}=2000italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 2000 for the uniform grid, the binning error can reach 40404040 per cent at most, and such errors might be problematic to estimations on large-scale emissions. The binning errors increase at small spatial frequencies because the binning errors in the uniform grid are proportional to the first derivative of the radial visibility profile, which tends to be large at smaller scales. This is supported by the small binning errors at large spatial frequencies. In the case of the log grid, the binning errors are suppressed at small spatial frequencies, in contrast to the uniform grid. This accords with the narrow bin widths at small spatial frequencies in the log grid; the grid is sufficiently fine to resolve the visibility profile. At the middle to high spatial frequencies, the binning errors increased and decreased, with a peak at 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT Mλ𝜆\lambdaitalic_λ. At the higher spatial frequencies, the grids become coarse, while the first derivative of the radial visibility profile becomes small. These two different trends result in this dependency.

Comparing the two grids, the log grid tends to yield moderate binning errors, whereas the uniform grid yields errors in a broad range. Considering the robustness, we adopted the log grid in the analysis of the main text, because the large binning errors at small spatial frequencies for the uniform grid can degrade the estimated accuracy of the flux on a large scale.

Refer to caption
Refer to caption
Figure 17: Comparison of binning errors for log (orange line) and linear (blue line) grids. (left) The average binning error χbin,mean2superscriptsubscript𝜒binmean2\chi_{{\rm bin,mean}}^{2}italic_χ start_POSTSUBSCRIPT roman_bin , roman_mean end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and the number of binned data Md,binsubscript𝑀dbinM_{\rm d,bin}italic_M start_POSTSUBSCRIPT roman_d , roman_bin end_POSTSUBSCRIPT. We vary Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT in the range of 125125125125-4000400040004000. (right) Individual binning error |χbin|subscript𝜒bin|\chi_{\rm bin}|| italic_χ start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT | and deprojected spatial frequency q𝑞qitalic_q. We assume Nbin=2000subscript𝑁bin2000N_{\rm bin}=2000italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 2000 for the uniform grid and Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500 for the log grid.

B.3 Dependence of geometric parameters and hyperparameters on number of grids

In the main text, we adopted the log grid with Nbin=500subscript𝑁bin500N_{\rm bin}=500italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 500. Here, we show that the choice of Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT does not affect the estimation on non-linear parameters. Simulating the same observational setup and the brightness profile as in the simulated case for AS 209 in Sec 4, we derived the geometric parameters by varying the number of grids, Nbin=subscript𝑁binabsentN_{\rm bin}=italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 125, 250, 500, and 1000. We injected (Δxcen,Δycen,cosi,PA)=(0,0,0.75,45)Δsubscript𝑥cenΔsubscript𝑦cen𝑖PA000.75superscript45(\Delta x_{\rm cen},\Delta y_{\rm cen},\cos i,{\rm PA})=(0\arcsec,0\arcsec,0.7% 5,45^{\circ})( roman_Δ italic_x start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_Δ italic_y start_POSTSUBSCRIPT roman_cen end_POSTSUBSCRIPT , roman_cos italic_i , roman_PA ) = ( 0 ″ , 0 ″ , 0.75 , 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT ). Fig. 18 shows the results for recovered geometric parameters and hyperparameters. The results appear to be unaffected by the choice of Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 18: Recovered geometric and hyperparameters with different numbers of bins Nbinsubscript𝑁binN_{\rm bin}italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT.

Appendix C Dependence of residual images on robust parameters

We varied the robust parameters as [0, 0.5, 1] in Briggs gridding, and investigated the dependence of the resultant residual images. Fig. 19 shows the deprojected residual images adopting three different robust parameters. They were constructed from the same residual visibilities as those used in the case of the even-symmetric spiral in Fig. 10. Visual inspections indicated that the robust parameter of 0.5 yielded the most balanced image in terms of both sensitivity and resolution.

Refer to caption
Refer to caption
Refer to caption
Figure 19: Dependence of residual images on robust parameters [0, 0.5, 1.0]. The synthesized beam sizes (0.067,0.031)0.0670.031(0.067\arcsec,0.031\arcsec)( 0.067 ″ , 0.031 ″ ), (0.11,0.041)0.110.041(0.11\arcsec,0.041\arcsec)( 0.11 ″ , 0.041 ″ ), and (0.16,0.063)0.160.063(0.16\arcsec,0.063\arcsec)( 0.16 ″ , 0.063 ″ ) are shown at the bottom left of panels.

Appendix D Recovered spatial scales for different angular resolutions and brightness profiles

We changed the length scales for the input profile I(r)𝐼𝑟I(r)italic_I ( italic_r ) and observational spatial frequencies {uj,vj}subscript𝑢𝑗subscript𝑣𝑗\{u_{j},v_{j}\}{ italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } to investigate the variations of recovered length parameters γ𝛾\gammaitalic_γ. Specifically, we considered a modified brightness profile Ia(r)=I(ar)subscript𝐼𝑎𝑟𝐼𝑎𝑟I_{a}(r)=I(ar)italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( italic_r ) = italic_I ( italic_a italic_r ) and modified spatial frequencies {ub,j,vb,j}={buj,bvj}subscript𝑢𝑏𝑗subscript𝑣𝑏𝑗𝑏subscript𝑢𝑗𝑏subscript𝑣𝑗\{u_{b,j},v_{b,j}\}=\{bu_{j},bv_{j}\}{ italic_u start_POSTSUBSCRIPT italic_b , italic_j end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_b , italic_j end_POSTSUBSCRIPT } = { italic_b italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_b italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT }, where we introduced scaling parameters (a,b)𝑎𝑏(a,b)( italic_a , italic_b ). We considered the same observational setup and brightness profile as that of AS 209 in Sec 4, simulated the data, and derived the posterior distribution for parameters including γ𝛾\gammaitalic_γ. Fig. 20 shows the result with varying scale of brightness profile a=(0.5,1,1.5)𝑎0.511.5a=(0.5,1,1.5)italic_a = ( 0.5 , 1 , 1.5 ) and fixed observed spatial frequency b=1𝑏1b=1italic_b = 1. The optimized length scale γ𝛾\gammaitalic_γ positively scaled with a𝑎aitalic_a. Beyond the optimized length scale, the power of model visibilities was suppressed and exhibited damped oscillations. Fig. 21 shows the result with varying scale for spatial frequency for the data b=(0.5,1,1.5)𝑏0.511.5b=(0.5,1,1.5)italic_b = ( 0.5 , 1 , 1.5 ) and the same brightness profile a=1𝑎1a=1italic_a = 1. The optimized length scale was unchanged for b=1,1.5𝑏11.5b=1,1.5italic_b = 1 , 1.5; however, it reduced for b=0.5𝑏0.5b=0.5italic_b = 0.5. This indicates that if the structure is already well resolved, the improvement in the angular resolution does not change the optimized length scale γ𝛾\gammaitalic_γ. However, if we adopt the worse angular resolution, the resolution cannot be sufficient to resolve the structure. This results in the larger value of γ𝛾\gammaitalic_γ. From this discussion, we can conclude that the optimized length scale γ𝛾\gammaitalic_γ is determined by both the brightness profile and the UV-coverage.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 20: Simulation result with varying length scales of injected brightness profile fa(r)=f(ar)subscript𝑓𝑎𝑟𝑓𝑎𝑟f_{a}(r)=f(ar)italic_f start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( italic_r ) = italic_f ( italic_a italic_r ) with a=(0.5,1,1.5)𝑎0.511.5a=(0.5,1,1.5)italic_a = ( 0.5 , 1 , 1.5 ). (left upper) Posterior distribution of recovered length scale γ𝛾\gammaitalic_γ for a=(0.5,1,1.5)𝑎0.511.5a=(0.5,1,1.5)italic_a = ( 0.5 , 1 , 1.5 ). (right upper) Result with a=0.5𝑎0.5a=0.5italic_a = 0.5. Injected and recovered brightness profile in the upper panel, and model and observed visibilities in the lower panel. The observed visibilities are binned with a log grid with Nbin=2000subscript𝑁bin2000N_{\rm bin}=2000italic_N start_POSTSUBSCRIPT roman_bin end_POSTSUBSCRIPT = 2000. (lower panels) Result with a=(1,1.5)𝑎11.5a=(1,1.5)italic_a = ( 1 , 1.5 ).
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 21: Same figure as Fig. 20 except that we vary b=(0.5,1,1.5)𝑏0.511.5b=(0.5,1,1.5)italic_b = ( 0.5 , 1 , 1.5 ) and fix a=1𝑎1a=1italic_a = 1.

Appendix E Mean and variance of residual images

Assuming the simulated data for AS 209 in Sec 4, we produced 10 different residual images by drawing samples from posterior distribution of (𝒈,𝜽,𝒂)𝒈𝜽𝒂(\bm{g},\bm{\theta},\bm{a})( bold_italic_g , bold_italic_θ , bold_italic_a ). We computed the mean and the standard deviation for 10 residual images, and calculated the differences of two random residual images from the mean image. Fig. 22 shows the result. The mean residual image was consistent with that shown in Fig. 4. The image for the standard deviation was nearly axisymmetric, and the amplitude was 105superscript10510^{-5}10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT Jy beam-1 at most in the innermost part; whereas, it was 24×10624superscript1062-4\times 10^{-6}2 - 4 × 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT Jy beam-1, which is a few times smaller than the observational error in the residual image, for most of the disc plane. In the lower panels, we show the differences from the mean image, and they are nearly axisymmetric as well. The axisymmetry would arise from the larger variances of the brightness profile than those of geometry and hyperparmaeters. Considering the small amplitudes and axisymmetry of the differential images, it is reasonable to conclude that their effects would be negligible in searching for non-axisymmetric structures, at least in the current observational setup.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 22: Mean and variance of residual images produced with 10 samples of parameters derived from simulated data of AS 209 in Sec 4. Upper and left panels show the mean and standard deviation of the residual images. The lower panels show the difference maps of two random residual images from the mean residual image.

Appendix F Comparison of residual images from two geometries

The upper left panel in Fig 23 illustrates the difference in the deprojected residual images derived from geometries in our study and Benisty et al. (2021) (see Fig 15). We observed a noticeable m=2𝑚2m=2italic_m = 2 pattern in the difference, primarily attributed to the discrepancy in PA by 1superscript11^{\circ}1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. Additionally, adopting the geometry in Benisty et al. (2021), we also subtracted the crescent model from the residual image following the same procedure described in Sec 7.2, while the model was re-optimized. In the upper right and lower panels in Fig 23 present the subtracted residual images in a deprojected coordinate and a polar coordinate.

The estimate of PA appears to be influenced by large-scale faint residuals rather than localized emission (also see Fig F1). Whether to adopt the removal of visibility of localized emission in a specific region as in Benisty et al. (2021) might affect the estimation. However, both the removal and non-removal can equally introduce bias, because the removal can result in the loss of information in the data. Thus, although our estimate is obtained by optimization of the data, we have not conclusively determined which estimation is more accurate.

A major difference is that the residual image derived from geometry in Benisty et al. (2021) exhibits a wide leading arm extended from the crescent to the point around (x′′,y′′)=(0.75,0.4)superscript𝑥′′superscript𝑦′′0.750.4(x^{\prime\prime},y^{\prime\prime})=(-0.75\arcsec,0.4\arcsec)( italic_x start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) = ( - 0.75 ″ , 0.4 ″ ). In addition, the significance of an arm (feature (d) in Sec 7.2) appears to be diminished compared to the image in Fig 16. On the other hand, all of the other features (a-c,e,f) described in Sec 7.2 are consistently identified in the two residual images with different geometries.

Refer to caption
Refer to caption
Refer to caption
Figure 23: Difference in residual images derived from geometry in our study and Benisty et al. (2021), and residual images based on the geometry from Benisty et al. (2021) with the crescent model being subtracted. (upper left) Difference in the residual images based on geometries from two studies in Fig 15. (upper right) The residual image with the subtraction of the crescent model based on geometry from Benisty et al. (2021). The format is same as that in Fig 16. (lower) The image with the crescent mode being subtracted in polar coordinate.